Toxicity Forecaster (ToxCastTM) Assay Description Documentation


v>EPA

United States
Environmental
Protection Agency

EPA Publication Number 601B24001 | September 2024



§¦ H



I

|C •



PCt#

W' J

1 1$

-1 -

/>



z.:m



# JIT

J V

I

Office of Research and Development

Center for Computational Toxicology and Exposure

Toxicity Forecaster

(ToxCast™)
Assay Description
Documentation

-------
EPA Publication Number 601B24001

TOXICITY FORECASTER (TOXCAST)
ASSAY DESCRIPTION DOCUMENTATION

September 2024
Madison Feshuk1, Ashley Ko12, Manasvinee Mayil Vahanan13,
Kelly Cartsens1, Alison Harrill1, Katie Paul Friedman1

1Center for Computational Toxicology and Exposure, Office of Research and Development, US EPA, Research Triangle Park, NC
2Oak Ridge Associated Universities (ORAU) National Student Services Contract at US EPA
3Oak Ridge Institute for Science and Education (ORISE) at US EPA

-------
Overview

The Toxicity Forecaster fToxCast™) program at the US Environmental Protection Agency (US EPA)'s
makes in vitro medium- and high-throughput screening assay data publicly available for prioritization and
hazard characterization of thousands of chemicals. Please review the vignette for the ToxCast Data Analysis
Pipeline (tcpl) R package for comprehensive documentation describing ToxCast data processing, retrieval,
and interpretation. Given ToxCast includes a heterogeneous set of assays across a diverse biological space,
annotations in the database help users flexibly aggregate and differentiate processed data whereas assay
documentation aligned with international standardization efforts can make ToxCast data more useful and
interpretable for use in decision-making.

This documentation for the ToxCast assay endpoints is in a format outlined by the OECD Guidance
Document 211 (GD211) for describing non-guideline in vitro test methods and their interpretation. The
intent of GD 211 is to harmonize non-guideline, in vitro method descriptions to allow assessment of the
relevance of the test method for biological responses of interest and the quality of the data produced.

This document contains reports for 809 assay endpoints accompanying the invitrodb v4.2 release
(September 2024). Please utilize the Table of Contents, Ctrl+F, or the Bookmarks panel to navigate to
specific assay sources and endpoints of interest. These reports are a work in progress and will be
iteratively updated as more information becomes available.

For additional questions or concerns, please contact Madison Feshuk (feshuk.madison@epa.gov).

Disclaimer

This report does not reflect the views or policies of the US Environmental Protection Agency. Company
or product names do not constitute endorsement by US EPA.

duration Summary

For this effort, existing database information from ToxCast's invitrodb was first reviewed to populate as
many GD211 stipulated fields. Assay element and auxiliary annotations were leveraged, though this
information is often short in a standardized format or using controlled vocabulary. Missing fields were
identified and selected for curation. This curated information has no character limit therefore can
provide users with the most robust description of the assay technology and its relevance. All curated
information has also been databased in the updated "assay descriptions" table of invitrodb. Fields and
their descriptions, modeled after their corresponding GD211 sections, are provided in the table below,
but also available in invitrodb's data dictionary:

Field Description

aeid Assay component endpoint ID

assay_title 1.1 Assay Name (title): Short and descriptive title for the assay

assay_objectives 2.1 Purpose of the test method: Inserted after assay_component_target_desc; The

claimed purpose and rationale for intended use of the method (e.g. alternative to an
existing method, screening, provision of novel information in regulatory decision-making,
mechanistic information, adjunct test, replacement, etc.) should be explicitly described
and documented. The response measured in the assay should be put in the context of
the biology/physiology leading to the in vivo response or effect.

-------
If the biological activity or response refers to a key event or molecular initiating
event (MIE), provide a short description indicating firstly what key event within
an existing or developing AOP, or in relation to a mechanism or mode of action,
the assay is aiming to characterize (i.e. which level of biological organization the
assay may be attributed (e.g. sub-cellular, cellular, tissue, organ or individual),
and secondly where the assay might fit in the context of an existing regulatory
hazard (i.e. adverse outcome).

In the absence of any AOP, provide an indication of the plausible linkage
between the mechanism(s) the assay is measuring and the resulting hazard
endpoint.

assay_ throughput 1.10 Information about the throughput of the assay: Information about the throughput
of the assay: indicate the throughput of the assay to provide an indication of likely
resource intensity e.g. low (manual assay, one chemical tested at a time), low-moderate,
moderate, moderate-high, high throughput (e.g. in 96 well-plate and higher) and qualify
with e.g. approximate number of chemicals/concentrations per run. If appropriate
indicate whether a manual assay could be run in a higher throughput mode
scientific_ principles 2.2 Scientific principle of the method: provide the scientific rationale, supported by
bibliographic references to articles, for the development of the assay. A summary
description of the scientific principle including the biological/physiological basis and
relevance (e.g. modeling of a specific organ) and/or mechanistic basis (e.g. modeling a
particular mechanism by biochemical parameters) should be described. If possible,
indicate what the anchor point is within an AOP.

2.6 Response and Response Measurement: response here refers to any biological effect,
process or activity that can be measured. Specify precisely and describe the response
and its measurement.

3.2 Data analysis: Comment on the response value in terms of a boundary or range to
provide a context for interpretation. E.g. putting into context what a negative value or
>100% value might represent in a binding inhibition assay.

2.5 Description of the experimental system exposure regime: provide a summary
description of the essential information pertaining to the exposure regime (dosage and
exposure time including observation frequency) of the test compounds to the
experimental system including information on metabolic competence if appropriate;
number of doses/concentrations tested or testing range, number of replicates, the use of
control(s) and vehicle. Also, describe any specialized equipment needed to perform the
assay and measure the response. Indicate whether there might be potential solubility
issues with the test system, and solutions proposed to address the issue.

2.3 Tissue, cells or extracts utilized in the assay and the species source: indicate the
experimental system for the activity or response being measured. Provide information on
whether materials are readily available commercially or whether materials are developed
in the laboratory (e.g. cell suspensions from tissue). Indicate source/manufacturer of
biological material used. Indicated whether cryopreserved biological material can be
used or only freshly prepared.

2.4 Metabolic competence of the test system: describe and discuss the extent to which
the test system can be considered metabolically competent, either by itself, or with the
addition of an enzymatic fraction, if appropriate. Provide reference if available.
1.9 Availability of information about the assay in relation to proprietary elements:

indicate whether the assay is proprietary or non-proprietary (to what extent is the assay
method transferable or contains proprietary elements) and specify (if possible) what kind
of information about the assay cannot be disclosed or is not available (e.g., chemical
reference sets (training or test sets), prediction model).

* Key information has been emboldened.

biologica l_
responses

analytical
description

basic_procedures

experimental
system

xenobiotic_
biotransformation

proprietary_
elements

-------
Interpretation of Robustness Metrics

To assess test method performance, quantitative metrics were derived for the processed multi-
concentration response data to examine the assay's performance relative to controls. This function is
available in the tcpl vignette, under Data Retrieval in invitrodb>Review MC assay quality. A summary of
each of the metrics and their interpretation is provided below:

NEUTRAL CONTROL
(well type = "n")

Neutral control well median response
value, by plate: nmed

Neutral control median absolute
deviation, by plate: nmad

Coefficient of variation (CV%) in
neutral control wells:
(nmad/nmed)*100

POSITIVE CONTROL
(well type = "p")

Positive control well median response
value, by plate: pmed

Positive control well median absolute
deviation, by plate: pmad
Z-Prime Factor for median positive and
neutral control across all plates:

1 — ((3 * (pmad + nmad))

abs(pmed — nmed)

DESCRIPTION

This is a robust measure of central tendency in the neutral control wells
(i.e., vehicle control or wells not expected to cause biological change).
Often serves as baseline or background response observed without
chemical treatment. Value should be considered in context of the
endpoint's response units and is calculated as the median of responses
in neutral control wells.

This is a robust measure of the variability in neutral control wells. Value
should be considered with response units and is calculated as the
median of the absolute deviations (from the median), multiplied by the
scaling factor constant of 1.4826:

1.4826*median(|yi-y|)
where y, is the ith observation of all wells within neutral control wells
and y is the median across all yi's

The coefficient of variation (CV), expressed as a percentage, compares
the relative variability of neutral control responses against the median
of neutral controls wells. CV% >20% may indicate high variability
however interpretation is assay dependent.

This is a measure of central tendency in the positive control wells. Value
should be considered in context of the endpoint's normalized response
units.

This is a measure of the variability in positive control wells. Value
should be considered with response units.

Z-prime factor is a robust measure of signal-to-background difference
(Zhang et al, 1999). Measuring the degree of separation between
neutral and positive controls, each with their own variability, can be
indicative of likelihood of false positives or negatives. The Z'-factor has

the range of -°° to 1, and is traditionally interpreted as follows:

• Z =l: Ideal. This is approached when the assay has wide
dynamic range with a small median absolute deviation across
controls. In this situation, the separation band is almost as long
as the dynamic range.

• 1.0 > T > 0.5: Excellent. Assay shows good separation between
controls.

• 0.5 > T > 0: Marginal. Assay shows an acceptable degree of
separation between controls.

• Z'=0: Nominal: Good only for a yes/no response

• Z'<0: Unacceptable. Use caution with given overlap in
response between controls.

Note that these categories are not imposed in presented metrics.

-------
Strictly standardized mean difference
(SSMD) for positive compared to
neutral control wells:

(pmed — nmed)

yjpmad2 + nmad2

Strictly standardized mean difference (SSMD, often denoted as P) is a
robust measure of effect size and was developed to address limitations
in the Z1 factor for experiments using controls of moderate strength.
Acceptable screening values for SSMD depend on the strength of the
positive controls used. A higher SSMD may correspond to stronger
controls. Table of suggested interpretation of values by control strength
from Advanced Assay Development Guidelines for Image-based High
Content Screening and Analysis

Quality Type

1 Moderate Control

2 Strong Control

3 Very Strong Control

•1 Extremely Strong Control

Excellent

p>2

|!>3



IV

-J

Good

2> p> 1

3>P42

5>p>3

7>p>5

Inferior

1 > p > 0 S

2 > pa l

3>p>2

S>p>3

Poor

p<0.5

p T > 0.5: Excellent. Assay shows good separation between
controls.

•	0.5 > T > 0: Marginal. Assay shows an acceptable degree of
separation between controls.

•	Z'=0: Nominal: Good only for a yes/no response

•	Z'<0: Unacceptable. Use caution with given overlap in
response between controls.

Note that these categories are not imposed in presented metrics.
Strictly standardized mean difference (SSMD, often denoted as P) is a
robust measure of effect size and was developed to address limitations
in the Z1 factor for experiments using controls of moderate strength.

-------
(mmed — nmed)
\lmmad2 + nmad2

Acceptable screening values for SSMD depend on the strength of the
positive controls used. Table of suggested interpretation of values by
control strength from Advanced Assay Development Guidelines for

Image-based High Content Screening and Analysis.

Signal-to-noise (median across all
plates, using negative control wells):
(mmed-nmed)/nmad)
Signal-to-background (median across
all plates, using negative control
wells): (mmed/nmed)

The signal-to-noise ratio (S/N) gives a measure of the degree of
confidence that a difference in signal noise in negative controls
compared to background response is real.

The signal-to-background ratio (S/B) is a simple comparison of the
median negative control signal to the median neutral controls, i.e.
background response. It does not contain any information about
variability of the data.

-------
Table of Contents

The following endpoints are included in this iteration of the ToxCast Assay Description Documentation.

Selecting a hyperlinked endpoint from this Table of Contents will direct users to the individual endpoint-

specific PDF. Please utilize Ctrl+F or the Bookmarks panel to navigate to specific assay sources and

endpoints of interest within this document.

AEID8 APR HepG2 MicrotubuleCSK Ihr ~

AEID26 APR	HepG2 Cell Loss 24 hr ~

AEID12 APR	HepG2 MitoMembPot Ihr ~

AEID24 APR	HepG2 CellCvcleArrest 24hr ~

AEID6 APR HepG2 CellLoss Ihr ~

AEID30 APR	HepG2 MitoMass 24hr ~

AEID16 APR	HepG2 NuclearSize Ihr ~

AEID14 APR	HepG2 MitoticArrest Ihr ~

AEID20 APR	HepG2 p53Act Ihr ~

AE1D32 APR	HepG2 MitoMembPot 24hr ~

AEID22 APR	HepG2 StressKinase Ihr ~

AEID10 APR	HepG2 MitoMass Ihr ~

AEID28 APR	HepG2 MicrotubuleCSK 24hr ~

AE1D18 APR	HepG2 P-H2AX Ihr ~

AEID52. APR	HepG2 MitoMembPot ?2hr ~

AEID2 ACEA ER 80hr ~

AEID46 APR	HepG2 CellLoss ?2hr ~

AEID56 APR	HepG2 NuclearSize ?2hr ~

AEID4 APR HepG2 CellCvcleArrest Ihr ~

AEID40 APR	HepG2 p53Act 24hr ~

AEID34 APR	HepG2 MitoticArrest 24hr ~

AEID36 APR	HepG2 NuclearSize 24hr ~

AEID67 ATG	C EBP CIS ~

AEID63 ATG	Ahr CIS ~

AE1D48 APR	HepG2 MicrotubuleCSK 72hr ~

AEID65 ATG	AP 2. CIS ~

AEID64 ATG	AP 1 CIS ~

AEID44 APR	HepG2 CellCvcleArrest ?2hr ~

AEID54 APR	HepG2 MitoticArrest ?2.hr ~

AEID58 APR	HepG2 P-H2AX ?2hr ~

AEID50 APR	HepG2 MitoMass ?2hr ~

AE1D42 APR	HepG2 StressKinase 24hr ~

AEID60 APR	HepG2 p53Act ?2hr ~

AEID69 ATG	CRE CIS ~

AEID62 APR	HepG2 StressKinase ?2hr ~

AE1D72 ATG	E Box CIS ~

AEID66 ATG	BRE CIS ~

-------
AEIDE	)G2 P-H2AX 24 hr ~

5 RAR t

AE1D75 ATG ERE CIS ~

AE1D74 ATG EGR CIS ~

AEID68 ATG CMV CIS ~

AEID82. ATG HIFla CIS ~

AEID79 ATG GATA CIS ~

AEID76 ATG Ets CIS ~

AEID80 ATG GLI CIS ~

AEID78 ATG FoxO CIS ~

AEID77 ATG FoxA2 CIS ~

AEID70 ATG DR4 LXR CIS ~

AEID73 ATG E2.F CIS ~

AEID86 ATG ISRE CIS ~

AEID85 ATG IR1 CIS ~

AEID83 ATG HNF6 CIS ~

AEID81 ATG GRE CIS ~

AEID84 ATG HSE CIS ~

AEID88 ATG M 19 CIS ~

AEID91 ATG MRE CIS ~

AEID90 ATG M 61 CIS ~

AEID89 ATG M 32. CIS ~

AEID87 ATG M 06 CIS ~

AEID92. ATG Mvb CIS ~

AEID94 ATG NF kB CIS ~

AEID97 ATG NRF2 ARE CIS ~
AEID98 ATG Oct MLP CIS ~

AEID93 ATG Mvc CIS ~

AEID100 ATG Pax6 CIS ~

AEID96 ATG NRF1 CIS ~

AEID95 ATG NFI CIS ~

AEID101 ATG PBREM CIS ~

AEID103 ATG PXRE CIS ~

AEID99 ATG p53 CIS ~

AEID106 ATG Sol CIS ~

AEID108 ATG STAT3 CIS ~

AEID104 ATG RORE CIS ~

AEID109 ATG TA CIS ~

AEID107 ATG SREBP CIS ~

AEID105 ATG Sox CIS ~

AEID111 ATG TCF b cat CIS ~
AEID110 ATG TAL CIS ~

AEID102. ATG PPRE CIS ~

AEID113 ATG VDRE CIS ~

-------
AEID116

ATG

*,NS ~

AEID118

ATG

ERRa TRANS ~

AEID114 ATG Xbol CIS ~

AEID117

ATG

ERa TRANS ~

AEID112

ATG

TGFb CIS ~

AEID115

ATG

AR TRANS ~

AEID119 ATG ERRg TRANS ~

AEID123

ATG

HNF4a TRANS ~

AEID120

ATG

FXR TRANS ~

AEID122

ATG

GR TRANS ~

AEID126

ATG

LXRb TRANS ~

AEID124 ATG Hpa5 TRANS ~

AEID125

ATG

LXRa TRANS ~

AEID127

ATG

M 06 TRANS ~

AEID130

ATG

M 61 TRANS ~

AEID129

ATG

M 32 TRANS ~

AEID128

ATG

M 19 TRANS ~

AEID121

ATG

GAL4 TRANS ~

AEID131

ATG

NURR1 TRANS ~

AEID133

ATG

PPARd TRANS ~

AEID134 ATG PPARg TRANS ~

AEID132

ATG

PPARa TRANS ~

AEID137

ATG

RARb TRANS ~

AEID135

ATG

PXR TRANS ~

AEID136

ATG

RARa TRANS ~

AEID143

ATG

THRal TRANS ~

AEID139

ATG

RORb TRANS ~

AEID141

ATG

RXRa TRANS ~

AEID144

ATG

VDR TRANS ~

AEID140 ATG RORe TRANS ~

AEID138 ATG RARg TRANS ~

AEID142

ATG

RXRb TRANS ~

AEID150

BSK

3C ICAM i ~

AEID154

BSK

3C MCP1 ~

AEID146

BSK

3C Eselectin ~

AEID148

BSK

3C HLADR ~

AEID152

BSK

3C IL8 ~

AEID160

BSK

3C SRB ~

AEID156

BSK

3C MIG ~

AEID162

BSK

3C Thrombomodulin ~

AEID166

BSK

3C uPAR ~

AEID164

BSK

3C TissueFactor ~

AEID158

BSK

3C Proliferation ~

AEID168

BSK

3C VCAM1 ~

-------
AEID170

BSK

3C

Vis ~

AEID172

BSK

4H

Eotaxin3 ~

AEID174

BSK

4H

MCP1 ~

AEID180

BSK

4H

uPAR ~

AEID182

BSK

4H

VCAM1 ~

AEID184

BSK

4H

VEGFRII ~

AEID176

BSK

4H

Pselectin ~

AEID186

BSK

BE3C HLADR~

AEID178

BSK

4H

SRB ~

AEID188

BSK

BE3C Ilia ~

AEID194

BSK

BE3C MMP1 ~

AEID196	BSK BE3C PA11 ~

AEID198	BSK BE3C SRB ~

AEID190	BSK BE3C IP10 ~

AEID218	BSK CASM3C MCSF ~

AEID214	BSK CASM3C LDLR ~

AEID204	BSK BE3C tiPA ~

AEID200	BSK BE3C TGFbl ~

AEID206	BSK BE3C uPAR ~

AEID192	BSK BE3C M1G ~

AEID212	BSK CASM3C 1L8 ~

AEID210	BSK CASM3C 1L6 ~

AEID202	BSK BE3C tPA ~

AEID224	BSK CASM3C SAA ~

AEID230	BSK CASM3C TissueFactor ~

AEID220	BSK CASM3C MIG ~

AEID234	BSK CASM3C VCAM1 ~

AEID232	BSK CASM3C uPAR ~

AEID246	BSK hDFCGF MIG ~

AEID250	BSK hDFCGF PA11 ~

AEID222	BSK CASM3C Proliferation ~

AEID238	BSK hDFCGF EGFR ~

AEID264	BSK KF3CT IP 10 ~

AEID228	BSK CASM:	mbomodulin ~

AEID244	BSK hDFCGF MCSI

AEID266	BSK KF3CT MCP1 ~

AEID276	BSK KF3CT uPA ~

AEID280

BSK

LPS Eselectin ~

AEID282

BSK

LPS ILla ~

AEID270

BSK

KF3CT SRB ~

AEID272

BSK

KF3CT TGFbl ~

AEID268

BSK

KF3CT MMP9 ~

AEID262

BSK

KF3CT ILla ~

AEID292

BSK

LPS SRB ~

-------
AEID286 BSK LPS MCP1 ~

AEID290 BSK LPS PGE2 ~

AEID304 BSK SAg CD69 ~

AEID298 BSK LPS VCAM1 ~

AEID300 BSK SAg CD38 ~

AEID739 OT AR ARELUC AG 1440 ~

AEID308 BSK SAg IL8 ~

AEID302 BSK SAg CD40 ~

AEID1913 ATG chAR XSP1 ~

AE1D751 OT ERa GFPERaERE 0480 ~

AEID306 BSK SAg Eselectin ~

AEID296 BSK LPS IN Fa ~

AEID756 OT NURR1 NURR1RXI

AEID316 BSK SAg Proliferation ~

AE1D310 BSK SAg MCP1 ~

AE1D744 OT ER ERaERb 0480 ~

AEID795 TOX21 GR BLA Antagonist viability ~

AE1D746 OT ER ERbERb 0480 ~

AEID314 BSK SAg PBMCCytotoxicity ~

AE1D740 OT AR ARSRC1 0480 ~

AEID318 BSK SAg SRB ~

AE1D743 OT ER ERaERa 1440 ~

AEID312 BSK SAg MIG ~

AEID755 OT NURR1 NURRlRXRa 0480 ~

AEID784 TOX21 ERa BLA Agonist ch2. ~

AE1D750 OT ERa GFPERaERE 0120 ~

AE1D753 OT FXR FXRSRC1 0480 ~

AE1D745 OT ER ERaERb 1440 ~

AEID782 TOX21 ELG1 LUC Agonist viability ~

AE1D742. OT ER ERaERa 0480 ~

AEID783 TOX21 ERa BLA Agonist chl ~

AEID800 TOX21 PPARg BLA Agonist chl ~

AEID794 TOX21 GR BLA Antagonist ratio ~

AEID786 TOX21 ERa BLA Antagonist ratio ~

AEID899 CEETOX H295R CORTIC ~

AEID909 CEETOX H295R ESTRONE ~

AEID790 TOX21 ERa LUC VM7 Antagonist 0.5nM E2 viability ~
AE1D897 CEETOX H295R ANDR ~

AEID801 TOX21 PPARg BLA Agonist ch2. ~

AEID240

BSK

hDFCGF 11.8 ~

AEID242

BSK

hDFCGF IP1G ~

AEID260

BSK

KF3CT ICAM1 ~

AEID256

BSK

hDFCGF TIM PI ~

-------
AEIDE	(21 AhR LUC Agonist viability ~

AEID252 BSK hDFCGF Proliferation ~

AEID236 BSK hDFCGF Collagenlll ~

AEID893 CEETOX H295R OHPREG ~

AEID248 BSK hDFCGF MMP1 ~

AEID907 CEETOX H295R ESTRADIOL ~

AE1D785 TOX21 ERa BLA Agonist ratio ~

AE1D901 CEETOX H295R CORTISOL ~

AEID254 BSK hDFCGF SRB ~

AEID913 CEETOX H295R PROG ~

AE1D1U7 TOX21 FXR BLA Agonist chl ~

AEID915 CEETOX H295R TESTO ~

AEID891 CEETOX H295R 11DCORT ~

AEID905 CEETOX H295R DOC ~

AEID1109 TOX21 ARE BLA Agonist ch2. ~

AEID1108 TOX21 ARE BLA Agonist chl ~

AEID1122 TOX21 PPARd BLA Agonist chl ~
AE1D895 CEETOX H295R OH PROG ~

AEID1120 TOX21 FXR BLA Antagonist ratio ~
AEID1125 TOX21 PPARd BLA Antagonist ratio ~
AEID1123 TOX21 PPARd BLA Agonist ch2. ~
AEID1110 TOX21 ARE BLA Agonist ratio ~
AE1D1U8 TOX21 FXR BLA Agonist ch2. ~

AEID1119 TOX21 FXR BLA Agonist ratio ~
AEID1124 TOX21 PPARd BLA Agonist ratio ~
AEID1193 TOX21 GR BLA Antagonist chl ~
AEID1121 TOX21 FXR BLA Antagonist viability ~
AEID1188 TOX21 FXR BLA agonist viability ~
AEID1189 TOX21 ERa BLA Antagonist chl ~
AEID1196 TOX21 PPARd BLA Antagonist chl ~
AEID1128 TOX21 PPARg BLA Antagonist viabi
AEID1126 TOX21 PPARd BLA Antagonist viability ~
AEID1191 TOX21 FXR BLA Antagonist chl ~
AEID1192 TOX21 FXR BLA Antagonist ch2. ~
AEID1190 TOX21 ERa BLA Antagonist ch2. ~
AEID1343 TOX21 ESRE BLA Agonist viability ~
AEID1194 TOX21 GR BLA Antagonist ch2. ~
AEID1195 TOX21 PPARd BLA Agonist viability ~
AEID1198 TOX21 PPARg BLA Antagonist chl ~
AE1D3	atoxicity ~

AEID1199 TOX21 PPARg BLA Antagonist ch2. ~
AEID1185 TOX21 ARE BLA agonist viability ~
AE1DU97 TOX21 PPARd BLA Antagonist ch2 ~
AEID1354 ATG HNF4g TRANS2 ~

-------
AEID1359	ATG TR4 TRANS2 ~

AEID1350	ATG COUP TF2 TRANS2 ~

AEID1348 ATG NUR77 TRANS2 ~

AEID1202	TOX21 AR BLA Antagonist chl ~

AEID1341	TOX2.1 ESRE BLA Agonist ch2. ~

AEID1370	ATG EAR2 TRANS2. ~

AEID1361	ATG Rev ERB B TRANS2 ~

AEID1203	TOX21 AR BLA Antagonist ch2 ~

AEID1365	ATG SF 1 TRANS2. ~

AEID1356	ATG MR TRANS2 ~

AEID1351	ATG PNR TRANS2. ~

AEID1357	ATG COUP TF1 TRANS2 ~

AEID1352	ATG LRH1 TRANS2. ~

AEID1349	ATG GCNF TRANS2. ~

AEID1355	ATG ERRb TRANS2. ~

AEID1360	ATG DAX1 TRANS2. ~

AEID1358 ATG NOR1 TRANS2. ~

AEID1374 Tanguav ZF 120hpf AXIS legacy ~

AEID1369	ATG THRb TRANS2. ~

AEID1366	ATG SHP TRANS2. ~

AEID1368 ATG TLX TRANS2 ~

AEID1363	ATG PR TRANS2 ~

AEID1383	Tanguav ZF 120hpf CFIN legacy ~

AEID1389	Tanguay ZF 120hpf TR legacy ~

AEID1372	Tanguav ZF 120hpf MORT legacy ~

AEID1385	Tanguav ZF 120hpf CIRC legacy ~

AEID1375	Tanguav ZF 120hpf EYE legacy ~

AEID1376	Tanguav ZF 120hpf SNOU legacy ~

hpf TERATOSCORE ~

AE1D162.6	CLP UGT1A1 6hr ~

AEID1630	CLP ACT IN 24hr ~

AEIP2.74 BSK KF3CT TIMP2 ~

AEIP1613	CLP ABCG2. 6hr ~

AE1P1614 CLP ACT IN 6hr ~

AEIP1623	CLP HMGCS2 6hr ~

AE1P162.8 CLP ABCB11 24hr ~

AE1P162.4 CLP SLCQ1B1 6hr ~

AEID1629	CLP ABCG2 24hr ~

AEID1388 Tanguav ZF 120hpf NC legacy ~

AEID284 BSK LPS 1L8 ~

AEIP1611	CLP ABCB1 6hr ~

AEIP1621	CLP GAPDH 6hr ~

AE1P1618 CLP CYP2C19 8hr~

-------
AEID1616 CLP CYP1A2 6hr ~

AEID1617 CLP CYP2B6 6hr ~
AE1P1643 CLP ABCB1 48hr ~
AE1P1640 CLP SLC01B1 24 hr ~
AEID294 BSK LPS TissueFactor ~

6hr ~

AEIP1635 CLP CYP2.C9 24hr ~
AEID288 BSK LPS MCSF ~

AEID278 BSK LPS CP40 ~

AE IP1652

CLP

CYP3A4 48hr~

AEIP1644

CLP

ABCB11 48hr~

AE IP 1642

CLP

UGT1A1 24hr ~

AEIP1653

CLP

GAPPH 48hr~

AEIP1631

CLP

CYP1A1 24hr ~

AE IP 1647

CLP

CYP1A1 48hr~

AEIP1634

CLP

CYP2C19 24hr ~

AEIP1639

CLP

HMGCS2 24hr ~

AEIP1650

CLP

CYP2C19 48hr ~

AEIP1651

CLP

CYP2C9 48hr ~

AEIP1654

CLP

GSTA2 48hr ~

AEIP1636

CLP

CYP3A4 24hr ~

AEIP1648

CLP

CYP1A2 48hr ~

AEIP1646

CLP

ACTIN 48hr ~

AEIP1638

CLP

GSTA2 24hr ~

AEIP1633

CLP

CYP2B6 24hr ~

AEIP1641

CLP

SULT2A 24hr ~

AE IP 1746

ATG

GPCR APORA2A TRANS ~

AE IP1752

ATG

GPCR APRA2B TRANS ~

AE IP 1748

ATG

GPCR APORA2B TRANS ~

AEIP1649

CLP

CYP2B6 48hr ~

AEID1664 CEETOX H295	1 viabili

3H 24hr ~

AEID1750 ATG GPCR ADRA1A TRANS ~
AEID1660 TOX2.1 RAR LUC Agonist viability ~
AEID1756 ATG GPCR ADRB3 TRANS ~

AE IP 1764 ATG GPCR EDNRA TRANS ~
AEIP1659 TOX2.1 RAR LUC Agonist ~
AEIP1658 CLP UGT1A1 48hr ~

AEIP1682

STM

H9 CvstinelSnorm perc ~

AEiP1655

CLP

HMGCS2 48hr ~

AEIP1688

STM

H9 OmithinelSnorm perc ~

AEIP1657

CLP

SULT2A 48hr ~

AE IP1796

ATG

GPCR PTGIR TRANS ~

AEIP1758

ATG

GPCR CHRM3 TRANS ~

-------
AEID 1844 T0X21 API BLA Agonist chl ~

AEID1827 ArunA Migration hNP ~

AEID1835 ArunA NOG NeuritesPerNeuron ~

AEID1846 TOX2.1 API BLA Agonist ratio ~

AEID1831 ArunA NOG NucleusCount ~

AEID1826 ArunA CellTiter hNC ~

AE ID 1847 TOX21 API BLA Agonist viability ~

AEID1823 TOX21 AR LUC MDAKB2 Agonist 3uM Nilutamide viabilil

)X21 API BLA Agonist
AEID1840 TOX21 RAR i I l> Antagonist viability ~

AEID1933 ATG chERa XSP1 ~

AEID1855 ACEA AR agonist 80hr ~

AEID1858 SIM H9 NormalizedViabilitv ~

AEID1925 ATG zfERl XSP1 ~

AEID1921 ATG zfAR XSP1 ~

AEID1856 AC	antagonist 80hr ~

AEID1923 ATG frERl XSPl ~

AE ID 185 2. ACEA ER AUC viability ~

AEID1939 ATG HERb XSPl ~

AE1D741 OT AR ARSRC1 0960 ~

AEID1937 ATG trERa XSPl ~

AE1D1957 ATG mPXR XSPl ~

AEID1955 ATG zfPPARg XSPl ~

AE1D1951 ATG hPPARg XSPl ~

AE1D1941 ATG GAL4 XSPl ~

AEID1963 ATG trTRa XSPl ~

AEID1943 ATG M 06 XSPl ~

AE1D1953 ATG mPPARg XSPl ~

AE ID 1947 ATG M 32. XSPl ~

AE1D1949 ATG M 61 XSPl ~

AEID1945 ATG M 19 XSPl ~

AE1D1959 ATG frTRa XSPl ~

AE ID1961 ATG hTRa XSPl ~

AE1D754 OT FXR FXRSRC1 1440 ~

AE1D747 OT ER ERbERb 1440 ~

AE1D759 TOX2.1 AR BLA Agonist chl ~

AE1D757 OT PPARg PPARgSRCl 0480 ~

AE1D760 TOX2.1 AR BLA Agonist ch2. ~

AE1D758 OT PPARg PPARgSRCl 1440 ~

AE1D791 TOX2.1 GR BLA Agonist chl ~

AE1D762. TOX2.1 AR BLA Antagonist ratio ~

AEID765 TOX21 AR LUC MDAKB2. Antagonist IQnM R1881 ~
AEID1971 ATG XTT Cytotoxicity XSPl ~

AE1D761 TOX2.1 AR BLA Agonist ratio ~

-------
AEID1969 ATG zfTRb XSP1 ~

AE1D788 T0X21 ERa LUC VM7 Agonist ~

AEID763 TOX21 AR BLA Antagonist viability ~

AE1D766 TOX21 AR LUC MDAKB2. Antagonist IQnM R1881 viability ~

AEID 1965 ATG zfT'Ra XSP1 ~

AEID208 BSK CASM3C HLADR ~

AE ID1967 ATG hTRb XSP1 ~

AEID1981 ATG GAL4 XSP2 ~

AEID764 TOX21 AR LUC MDAKB2 Agonist ~

AEID1973 ATG M 06 XSP2 ~

AE ID 1987 ATG zfER2a XSP2 ~

AE ID1977 ATG hERb XSP2 ~

AEID1989 ATG HAR XSP2 ~

AEID1991 ATG chAR XSP2 ~

AE ID 1979 ATG trAR XSP2 ~

AEID1975 ATG trERa XSP2 ~

AEID1985 ATG chERa XSP2 ~

AEID2001 ATG HERa XSP2 ~

AEID1993 ATG frERl XSP2 ~

AEID2011 ATG zfPPARg XSP2 ~

AEID2005 ATG M 32 XSP2 ~

AE ID 1997 ATG zfAR XSP2. ~

AE1D781 TOX21 ELG1 LUC Agonist ~

AEID2003 ATG M 19 XSP2. ~

AEID1995 ATG frAR XSP2. ~

AEID216 BSK CASM3C MCP1 ~

AElDi	(21 GR BLA Agonist ratio ~

AMI1 v I >><21 ERa BLA Antagonist viability ~

AE1D789 TOX21 ERa LUC VM7 Antagonist 0.5nM E2 ~

AEID2023 ATG M 61 XSP2 ~

AE1D2025 ATG frTRa XSP2 ~

AEID2029 ATG hTRb XSP2 ~

AE ID 2.017 ATG trTRa XSP2 ~

AE ID 2.031 ATG XTT Cytotoxicity XSP2. ~

AEID226 BSK CASM3C SRB ~

AEID258 BSK HDFCGF VCAM1 ~

AEID806 TOX2.1 AhR LUC Agonist ~

AEID2055 TOX21 ERR LUC Agonist ~

AEID2118 TOX21 ERb BLA Antagonist ch2 ~

AEID2059 TOX21 ERR LUC viability ~

AEID2116 TOX21 ERb BLA Agonist viability ~

AE1D2U7 TOX2.1 ERb BLA Antagonist chl ~

AEID2120 TOX21 ERb BLA Antagonist viability ~

AEID2053 TOX21 ERa LUC VM7 Antagonist O.lnM E2. ~

-------
AEID2114 T0X2.1 ERb BLA Agonist ch2 ~

AEID2113 TOX2.1 ERb BLA Agonist chl ~

AEID2126 TOX21 PR BLA Antagonist ch2 ~

AEID2128 TOX21 PR BLA Antagonist viability ~

AEID2221 TOX21 PR BLA Followup Antagonist viability ~

AEID2222 TOX21 PR LUC Followup Agonist ~

AEID2225 TOX21 PR LUC Followup Antagonist viability ~

AEID2224 TOX21 PR LUC Followup Agonist viability ~

AEID2125 TOX21 PR BLA Antagonist chl ~

AEID2123 TOX21 PR BLA Agonist ratio ~

AEID2223 TOX21 PR LUC Followup Antagonist ~

AEID2124 TOX21 PR BLA Agonist viability ~

AEID2470 CCTE Shafer MEA acute per network burst spike number mean ~

AEID2472 CCTE Shafer MEA acute per network burst spike number stci ~

AEID2309 CCTE GLTED hDIOl ~

AEID2220 TOX21 PR BLA Followup Agonist viability ~

AEID2127 TOX2.1 PR BLA Antagonist ratio ~

AEID2151 CEETOX H295R CORTIC noMTC ~

AEID2121 TOX21 PR BLA Agonist chl ~

AEID2167 CEETOX H295R TESTO noMTC ~

AEID2165 CEETOX H295R PROG noMTC ~

AEID2161 CEETOX H295R ESTRONE noMTC ~

AEID2153 CEETOX H295R CORTISOL noMTC ~

AEID2157 CEETOX H295R DOC noMTC ~

AE1D2.2.18 TOX2.1 PR BLA Followup Agonist ratio ~

AEID2219 TOX21 PR BLA Followup Antagonist ratio ~

AEID2159 CEETOX H295R ESTRADIOL noMTC ~

AEID1127 TOX2.1 PPARg BLA Antagonist ratio ~

AEID2149 CEETOX H295R ANDR noMTC ~

AEID2122 TOX21 PR BLA Agonist ch2. ~

AEID2468 CCTE Shafer MEA acute burst percentage std ~

AEID2464 CCTE Shafer MEA acute interburst interval mean ~

AEID2460 CCTE Shafer MEA acute burst duration mean ~

AEID2462 CCTE Shafer MEA acute per burst spike number mean ~

AEID2466 CCTE Shafer MEA acute burst percentage mean ~

AEID2363 TOX21 PXR LUC Agonist ~

A	' fer MEA acute per network burst electrodes number mean ~

AEID2458 CCTE Shafer MEA acute burst number ~

AEID2476 CCTE Shafer MEA acute network burst percentage ~

AE ID 2480 CCTE Shafer MEA acute cross correlation HWHM ~

AEID2478 CCTE Shafer MEA acute cross correlation area ~

AE ID 2486 CCTE Deisenroth AIME 96WELL LUC Inactive ~

AEID2484 CCTE Deisenroth AIME 96WELL LUC Active ~

AEID2496 CCTE Shafer MEA dev burst rate ~

-------
AEID2490 CCTE Deisenroth AIME 384WELL LUC Inactive ~

AEID2494	CCTE Shafer MEA dev firing rate mean ~

AEID2491	CCTE Deisenroth AIME 384WELL CTox Inactive ~

AEID2492	CCTE Deisenroth AIME 384WELL LUC Shift ~

AEID2488	CCTE Deisenroth AIME 384WELL LUC Active ~

AEID2487	CCTE Deisenroth AIME 96WELL CTox Inactive ~

AEID2489	CCTE Deisenroth AIME 384WELL CTox Active ~

AEID2500	CCTE Shafer MEA dev bursting electrodes number ~

AEID2504	CCTE Shafer MEA dev per burst spike percent ~

AEID2502	CCTE Shafer MEA dev per burst interspike interval ~

AEID2498	CCTE Shafer MEA dev active electrodes number ~

AEID2506	CCTE Shafer MEA dev burst duration mean ~

AEID2508	CCTE Shafer MEA dev interburst interval mean ~

AEID2510	CCTE Shafer MEA dev network spike number ~

AEID2512	CCTE Shafer MEA dev network spike peak ~

AEID2520	CCTE Shafer MEA dev per network spike spike number mean ~

AEID2.518	CCTE Shafer MEA dev inter network spike interval mean ~

AEID2516	CCTE Shafer MEA dev network spike duration std ~

AEID2532	CCTE GLTED hDIQ2. ~

AEID2540	CCTE Shafer MEA acute LDH ~

AEID2514 CCTE Shafer MEA dev spike duration mean ~

AEID2530

CCTE

Shafer

MEA dev AB ~

AEID2529

CCTE

Shafer

MEA dev LDH ~

AEID2533

CCTE

GLTED

hDI03 ~

AEID2526

CCTE

Shafer

MEA dev mutual information norm i

AEID2541

CCTE

Shafer

MEA acute AB ~

AEID2697 UKN2 HCS IMR90 neural migration ~

AEID2522 CCTE Shafer MEA dev per network spike spike percent ~

AEID2524 CCTE Shafer MEA dev correlation coefficient mean ~

AE ID 2779 CCTE Mundv HCI Cortical NOG NeuriteLength ~

AEID2782 CCTE Mundy HCI Cortical Synap Neur Matur CellBodySpotCount ~

AEID2778 CCTE Mundv HCI Cortical NOG NeuriteCount ~

AEID2699 UKN2 HCS IMR90 cell viability ~

AEID2547 UKN5 HCS SBAD2 cell viability ~

AE ID 2.777 CCTE Mundv HCI Cortical NOG BPCount ~

AEID2545 UKN5 HCS SBAD2 neurite outgrowth ~

AEID2783 CCTE Mundv HCI Cortical Synap Neur Matur NeuriteCount ~

AEID2791 CCTE Mundv HCI hN2 NOG NeuriteLength ~

AE ID 2.773 IUF NPCla proliferation area 72hr ~

AEID2703 UKN4 HCS LUHMES cell viability ~

AEID2786 CCTE Mundv HCI Cortical Synap Neur Matur NeuriteSpotCountPerNeun
AEID2789 CCTE Mundv HCI hN2 NOG BPCou
AEID2701 UKN4 HCS LUHMES neurite outgrowth ~

AEID2797 CCTE Mundv HCI hNPl Pro ResponderAvglnten ~

-------
AEID2.788 CCTE Muiidv HCI Cortical Svnap Neur Matur SvnapseCount ~

AEID2790 CCTE Mundy HCI HN2 NOG NeuriteCount ~

AEID2841 BSK MvoF MMP1 ~

AEID2837 BSK MvoF 1L8 ~

AEID2847 BSK MvoF TIMP1 ~

AEID2817 BSK BT xltlt/T ~

AEID2821 BSK BT xlL6 ~

AEID2831 BSK MvoF Collagen! ~

AE ID 2.771 1UF NPClb proliferation BrdU 72hr ~

AEID2835 BSK MvoF CollagenlV ~

AEID2855 BSK BF4T ICAM1 ~

AEID2873 BSK BF4T SRB ~

AEID2839 BSK MvoF Decorin ~

AE ID 2.775 1UF NPC1 viability 72hr ~

AEID2853 BSK BF4T VCAM1 ~

AEID2865 BSK BF4T MMP1 ~

AE ID 2.877 BSK BF4T uPA ~

AEID2863 BSK BF4T Keratin818 ~

AEID2849 BSK BF4T MCP1 ~

AEID2843 BSK MvoF PA11 ~

AEID2871 BSK BF4T PA11 ~

AEID2897 BSK hDFCGF Collagenl ~

AEID2851 BSK BF4T Eotaxin3 ~

AE ID 2.913

BSK IMphe VCAMi ~

AEID2879

BSK

BE3C

AEID2869

BSK

BF4T MMPf) ~

AEID2891

BSK

CASM3C PAI1 ~

AEID2867

BSK

BF4T MMP3 ~

AEID2883

BSK

BE3C 11.8 ~

AEID2885

BSK

BE3C EGFR ~

AEID2889

BSK

BE3C MMPf) ~

AEID2861

BSK

BF4T Ilia ~

AEID2893

BSK

hDFCGF MCP1 ~

AEID2857

BSK

BF4T CD90 ~

AEID2911	BSK IMphe MlPla ~

AEID2887	BSK BE3C KeratinS 18 ~

AEID2859	BSK BF4T 1L8 ~

AEID2925	BSK IMphe MCSF ~

AEID2921	BSK IMphe 118 ~

AEID2919	BSK IMphg CD69 ~

AEID2895	BSK hDFCGF 1CAM1 ~

AEID2935	BSK LPS CD69 ~

AEID3068 CCTE Mundy HCI iCellGluta NOG NeuriteCount ~

AEID2899	BSK hDFCGF ITAC ~

-------
AEID2907 BSK KF3CT PA11 ~

AEID2903 BSK KF3CT IL8 ~

AEID2942 1UF NPC2b neuronal migration 120hr ~

AEID2944 1UF NPC2c oligodendrocyte migration 120hr ~

AEID2.915 BSK IMphg CD40 ~

AEID2901 BSK hDFCGF TIMP2 ~

AE ID 2.917 BSK IMphg ESelectin ~

AE ID 2.931 BSK IMphg SRB.Mphg ~

AEID3025 VALA TUBIPS Antagonist CellCount ~

AEID3022. VALA TUBHUV Antagonist TubuleLength ~

AEID3069 CCTE Mundv HCI iCellGluta NOG NeuriteLength ~

AEIDE	3HUV1 ScratchOnlv CellCount ~

AEID3070 CCTE Mundv HCI iCellGluta NOG NeuronCount ~

AEID2946 IUF NPC3 neuronal differentiation 120hr ~

AEID2954 IUF NPC2-5 cytotoxicity 72hr ~

AEID2960 IUF-NPC2-5 viability 120hr ~

AEID2956 IUF NPC2-5 cytotoxicity 120hr ~

AEID2958 IUF NPC2-5 cell number 120hr ~

AEID3074 CCTE Deisenroth 5AR NBTE donor ~

AEID2948 IUF NPC4 neurite length 120hr ~

AEID3067 CCTE Mundv HCI iCellGluta NOG BPCount ~

AEID3031 VALA MIGHUV2 WoundArea ~

AEID3030 VALA MIGHUV2 Bcatenin ~

AEID3028 VALA MIGHUV1 ScratchOnlv WoundArea ~

AEID3029 VALA MIGHUV2. CellCount ~

AEID3032 CCTE GLTED hlYD ~

AEID3078 CCTE Deisenroth 5AR NBTE ratio ~

AEID3087 IUF NPC1 cytotoxicity 72hr ~



AEID3090 CCTE GLTED hTPO ~



AEID3095 CCTE Deisenroth DEVTOX-GLR

legacy Sox2 ~

AEID31Q1 ATG rtGR EcoTox2 ~



AEID3096 CCTE Deisenroth DEVTOX-GLR

legacy Bra ~

AEID3091 CCTE GLTED xDI03 ~



AEID31G3 ATG imGR EcoTox2. ~



AEID3098 CCTE Deisenroth DEVTOX-GLR

legacy CellCount ~

AEID3105 ATG zfGR EcoTox2. ~



AEID3111 ATG rtPPARa EcoTox2. ~



AEID3099 ATG frGR EcoTox2. ~



AEID3092 CCTE GLTED xlYD ~



Aj	:	onist r;

AEID3115 ATG zfPPARa EcoTox2 ~
AEID3109 ATG frPPARa EcoTox2. ~
AEID3119 ATG frPPARg EcoTox2 ~

Aj	mPPARa EcoTo

-------
AEID3129 ATG frRXRb EcoTox2 ~

AEID3131

ATG

rtRXRb EcoTox2 ~

AEID3127

ATG

HRXRb EcoTox2 ~

AEID3135

ATG

zfRXRb EcoTox2 ~

AEID3117 ATG hPPARg EcoTox2 ~

AEID3141

ATG

frERl EcoTox2 ~

AEID3123

ATG

imPPARg EcoTo)

AEID3145

ATG

zfAR EcoTox2 ~

AEID3133 ATG imRXRb EcoTox2 ~

AEID3137

ATG

hERa EcoTox2 ~

AEID3125 ATG zfPPARg EcoTox2 ~

AEID1367

ATG

ERb TRANS2 ~

AEID3139

ATG

zfERl EcoTok2 ~

AEID1371

ATG

TR2 TRANS2 ~

AEID1362

ATG

RORa TRANS2, ~

AEID3147

ATG

M 61 EcoTox2 ~

AE1D1384 Tanguav ZF 120hpf PIG legacy ~
AEID1373 Tanguav ZF 120hpf YSE legacy ~
AEID1378 Tanguav ZF 120hpf OTIC legacy ~
AEID1377 Tanguav ZF 120hpf JAW legacy ~
AEID1380 Tanguav ZF 120hpf BRAI legacy ~
AEID1381 Tanguav ZF 120hpf SOMI legacy ~
AEID1387 Tanguav ZF 120hpf SWIM legacy ~
AEID1379 Tanguav ZF 120hpf PE legacy ~
AEID1619 CLD CYP2C9 8hr~

AEID3155

ATG

HGR EcoTox2 ~

AEID3151

ATG

M 32 EcoTox2 ~

AEID1386	Tanguav ZF 120hpf TRUN legacy ~

AEID3167	CCTE Mundv HCI iCellGABA NOG NeuronCount ~

AE1D162.7	CLD ABCB1 24hr ~

AEID3163	CCTE Deisenroth H295R-HTRF 384WELL CTOX ~

AEID3162	CCTE Deisenroth H295R-HTRF 384WELL TESTOSTERONE ~

AEID3161	CCTE Deisenroth H295R-HTRF 384WELL ESTRADIOL ~

AEID3196	Tanguav ZF 120hpf SM2.4 ~

AEID3153	ATG M 19 EcoTox2. ~

AEID3165	CCTE Mundv HCI iCellGABA NOG NeuriteCount ~
AEID3164 CCTE Mundv HCI iCellGABA NOG BPCount ~

AEID3166	CCTE Mundv HCI iCellGABA NOG NeuriteLength ~
AEID3194 Tanguav ZF 120hpf MQ2.4 ~

AEID3202	Tanguav ZF 120hpf LTRK ~

AEID3149	ATG M 06 EcoTox2. ~

AEID3197	Tanguav ZF 120hpf MORT ~

AEID3198 Tanguav ZF 120hpf CRAN ~

-------
AEIDE	iieuav ZF 120hpf ANY ~

AEID3204 Tanguav ZF 120hpf SKIN ~

AEID3195 Tanguav ZF 120hpf DP24 ~

AEID 1632. CLP CYP1A2 24hr ~

AEID3203 Tanguav ZF 120hpf BRN ~

AEID3211 CCTE Padilla ZF Score.Living ~

AEID3216 CCTE Padilla ZF Score.Edema ~

AEID3200 Tanguav ZF 120hpf EDEM ~

AEiD32.15 CCTE Padilla ZF Score.Craniofacial ~

AEID3206 Tanguav ZF 120hpf TCHR ~

AE1D162.5 CLP SULT2.A 6hr ~

AEID3228 CCTE Deisenroth DEVTOX-GLR Meso SOX2 ~

AE ID 1766 ATG GPCR GCGR TRANS ~

AEID3222 CCTE Padilla ZF Score.Anv ~

AEID3219 CCTE Padilla ZF Score.Position ~

AEID3214 CCTE Padilla ZF Score.Swirri bladder ~

AEID3229 CCTE Deisenroth DEVTOX-GLR Meso BRA ~

AE ID 1768 ATG GPCR GPBAR1 TRANS ~

AEID3220 CCTE Padilla ZF Score.Tail ~

AEID3217 CCTE Padilla ZF Score.Spine ~

AEID3224 CCTE Deisenroth DEVTOX-GLR Endo SOX2 ~

AE ID 1790 ATG GPCR MC3R TRANS ~

AEID3226 CCTE Deisenroth DEVTOX-GLR Endo CellCount ~

AEID3199 Tanguav ZF 120hpf AXIS ~

AE ID 1776 ATG GPCR GS TRANS ~

AEID1772 ATG GPCR GQ TRANS ~

AE ID 1770 ATG GPCR GPR40 TRANS ~

AE1D32.18 CCTE Padilla ZF Score.Pigmentation ~

AEID3264 CCTE GLTED hTTR 0.125uM ~

AE1D32.2.7 CCTE Deisenroth DEVTOX-GLR Meso SC «M ' ^

AEID1762 ATG GPCR DRD5 TRANS ~

AEID3223 CCTE Deisenroth DEVTOX-GLR Endo SOX17 ~

AE ID 1786 ATG GPCR MC1R TRANS ~

AE ID 1780 ATG GPCR HTR6 TRANS ~

AE ID 1784 ATG GPCR LPAR4 TRANS ~

AEID1792 ATG GPCR MC4R TRANS ~

AE1D32.37 CCTE Deisenroth DEVTOX-GLR Pluri BRx t."

AE ID1760 ATG GPCR DRD1 TRANS ~

AEID3236 CCTE Deisenroth DEVTOX-GLR Pluri SOX2. ~

AE1D1816 TOX21 AR LUC MDAKB2 Antagonist 0.5nM R1881 ~

AE1D32.31 CCTE Deisenroth DEVTOX-GLR Ecto SOX17 ~

)X21 AR LUC MDAKB2 Antagonist 0.5nM R1881 viabili
AEID3230 CCTE Deisenr	X-GLR Meso CellCount ~

AEID1690 STM H9 OrnCvsslSnorm ratio ~

-------
AE ID 1645

CLD

ABCG2 48hr ~





AEID3233

CCTE

Deisenroth DEVTOX-GLR

Ecto

B

AE1D1656

CLD

SLCOIBI 48hr~





AEID3238

CCTE

Deisenroth DEVTOX-GLR

Pluri

CellCount ~

AE ID1794

ATG

GPCR PTGDR TRANS ~





AEID3234

CCTE

Deisenroth DEVTOX-GLR

Ecto

CellCount ~

AE ID1778

ATG

GPCR HRH1 TRANS ~





AE ID1774

ATG

GPCR GS1 TRANS ~





AE ID 1782

ATG

GPCR HTR7 TRANS~





AEID1825 AruiiA CellTiter hNP ~

AEID1838 ArunA NOG BranchPointsPerNeurite ~

AEID1931 ATG frER2 XSP1 ~

AEID1822 TOX21 AR LUC MDAKB2 Agonist 3uM Nilutamide ~
AEID1839 TC	Antagonist ~

AEID1829 ArunA Migration hNC ~

AEiD1915 ATG frAR XSP1 ~

AEID1927

ATG

zfER2a XSPI ~

AE ID1917

ATG

hAR XSPi ~

AEID1919

ATG

trAR XSPI ~

AEID1850 AC I A AR agonist AUC viability ~

AEID1929 ATG zfER2b XSP1 ~

AEID1999 ATG zfER2b XSP2 ~

AEID2007 ATG frER2 XSP2 ~

AEID1983 ATG zfERl XSP2 ~

AEID1857 AC I A AR antagonist AUC viability ~

AE ID 2.015 ATG rriPXR XSP2 ~

AE1D2.009 ATG mPPARg XSP2 ~

AE ID 2.013 ATG hPPARg XSP2 ~

AE ID 2.019 ATG zfT'Ra XSP2. ~

AE1D2.02.1 ATG zfTRb XSP2. ~

AE1D2.02.7 ATG hTRa XSP2. ~

AE1D2.057 TOX2.1 ERR LUC Antagonist ~

AEID2054 TOX21 ERa LUC VM7 Antagonist O.lnM E2 viability ~

AE1D2U5 TOX2.1 ERb BLA Agonist ratio ~

AEID2143 CEETOX H295R 11DCORT noMTC ~

AE1D2.2.14 TOX2.1 PR BLA Followup Agonist chl ~

AEID2119 TOX21 ERb BLA Antagonist ratio ~

AEID2216 TOX21 PR BLA Followup Agonist ch2. ~

AEID2212 TOX21 ERa LUC VM7 Agoni	VI ICI182780 viabil

AEID2456 CCTE Shafer MEA acute firing rate mean ~

AEID2454 CCTE Shafer MEA acute spike number ~

AE1D2.2.15 TOX2.1 PR BLA Followup Antagonist chl ~

AEID2147 CEETOX H295R OHPROG noMTC ~

AEID1340 TOX21 ES	onis

-------
AEID2211 T0X21 ERa LUC VM7 Agonist IQiiM ICI182780 ~

AEID2217 TOX2.1 PR BLA Followup Antagonist ch2. ~

AEID2482 CCTE Shafer MEA acute synchrony index ~

AEID2485 CCTE Deisenroth AIME 96WELL CTox Active ~

AEID2.793 CCTE Mundv HCI hNPl Casp3 7 ~

AEiD2780 CCTE Mundv HCI Cortical NOG NeuronCount ~

AE1D2.792. CCTE Mundv HCI HN2 NOG NeuronCount ~

AEID2362 TOX21 PXR LUC Agonist viability ~

AE1D2.781 CCTE Mundv HCI Cortical Synap Neur Matur BPCount ~

AE ID 2.794 CCTE Mundv HCI hNPl CellTiter ~

A	> CCTE Mundv HCI Cortical Synap Neur Matur NeuriteSpotCountPerNeuriteLength

~

AEID2813 BSK BT slgG ~

AE1D2.784 CCTE Mundv HCI Cortical Synap Neur Matur NeuriteLength ~

AEID2823 BSK BT xTNFa ~

AEID2809 BSK BT Bcell Proliferation ~

AEID2811 BSK BT PBMCCytotoxicity ~

AEID2827 BSK MvoF bFGF ~

AEID2819 BSK BT x!L2 ~

AEID2825 BSK MvoF ACTA1 ~

AEID2815 BSK BT xfLUAj»

AEID2845 BSK MvoF SRB ~

AEID2833 BSK MvoF Collagenlll ~

AEID2829 BSK MvoF VCAM1 ~

AEID2875 BSK BF4T tPA ~

AEID2905 BSK KF3CT MIG ~

AEID2929 BSK IMphg SRB ~

AEID2938 1UF NPC2a radial glia migration 72hr ~

AEID2909 BSK IMphg MCPl ~

AE1D2.92.7 BSK IMphg IL10 ~

AEID2933 BSK LPS Thrombomodulin ~

AEID2923 BSK IMphg ILla ~

AE1D3019 VALA TUBHUV Agonist CellCount ~

AE1D302.1 VALA TUBHUV Antagonist CellCount ~

AE1D302.3 VALA TUBIPS Agonist CellCount ~

AEID3088 CCTE GLTED hTBG ~

AEID2950 IUF NPC4 neurite area 120hr ~

AEID2940 IUF NPC2a radial glia migration 120hr ~

AEID302.4 VALA TUBIPS Agonist TubuleLength ~

AEID302.0 VALA TUBHUV Agonist TubuleLength ~

AEID3107 ATG hPPARa EcoTox2. ~

AEID2952 IUF N	godendrocvte differentiation 120hr ~

AEID3089 CCTE GLTED hTTR O.SuM ~

AEID3143 ATG hAR EcoTox2. ~

-------
AEID3076 CCTE Deisenroth 5AR NBTE acceptor ~

AEID1612 CLP ABCB11 6hr ~

AEID3072 CCTE Deisenroth 5AR NBTE autofluor ~

AEID1364 ATG RXRg TRANS2. ~

AEID3094 CCTE Deisenroth DEVTOX-GLR legacy Soxl7 ~

AEID1382 Taneuav ZF 120hpf PFIN legacy ~

AE1D162.0 CLP CYP3A4 6hr ~

AEID3205 Taneuav ZF 120hpf NC ~

AE1D3235 CCTE Deisenroth DEVTOX-GLR Pluri SOX17 ~

AEID3221 CCTE Padilla ZF Score.Blood pooling ~

AEID3213 CCTE Padilla ZF Score.General

~

AEID3225 CCTE Deisenroth DEVTOX-GLR

Endo BRA ~

AEID 1754 ATG GPCR ADRB2 TRANS ~
AEID3232 CCTE Deisenroth DEVTOX-GLR

Ecto SOX2 ~

AE ID 1788 ATG GPCR MC2R TRANS ~
AEID1833 ArunA NOG NeuriteLength ~



AEID1797 Tanguay ZF 12Qhpf ActivityScore legacy ~

At H iO<21 GR BLA Agonist ch2. ~

AEID2145	CEETOX H295R OHPREG noMTC ~

AEID1935	ATG hERa XSP1 ~

AEID2787 CCTE Mundv HCI Cortical Synap Neur Matur NeuronCount ~

AEID1353	ATG Rev ERB A TRANS2. ~

AEID2881 BSK BE3C 1TAC ~

AEID3212 CCTE Padilla ZF Score.Hatched ~

AEID3201 Taneuav ZF 120hpf MUSC ~

-------
Assay EndpointlD:2

ACEA_ER_80hr

1. General Information

1.1 Assay Title: ACEA Biosciences xCELLigence Real-Time Cell Analysis on Estrogen Receptor Agonism for
Proliferation

1.2 Assay Summary: ACEA_ER is a cell-based, single-readout assay that uses T47D, a human breast cell line, with
measurements taken at 80 hours after chemical dosing in a 96-well plate, although T02 (mcO.srcf) used a 384-
well plate. Differences in plate size can be ignored given data normalization. ACEA_ER_80hr is one of two assay
component(s) measured or calculated from the ACEA_ER assay. It is designed to make measurements of real-
time cell-growth kinetics, a form of growth reporter, as detected with electrical impedance signals by Real-Time
Cell Electrode Sensor (RT-CES) technology. Data from the assay component ACEA_ER_80hr was analyzed into 1
assay endpoint. This assay endpoint, ACEA_ER_80hr_Positive, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of growth reporter,
measures of the cells for gain-of-signal activity can be used to understand the signaling at the pathway-level as
they relate to the gene ESR1. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is steroidal.

1.3 Date of Document Creation: September 05 2024

1.4 Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5 Assay Source: ACEA Biosciences, Inc. (ACEA) is a privately owned biotechnology company that developed a
realtime, label free, cell growth assay system called xCELLigence based on a microelectronic impedance readout.

1.6 Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7 References: For complete list of references, see Section 6: Bibliography.

1.8 Proprietary Elements: Assay is non-proprietary; xCELLigence RTCA software and biosensor technology are
available from ACEA Biosciences, Inc. and T-47D cells are commercially available from American Type Culture
Collection (ATCC HTB-133) with signed Material Transfer Agreement (MTA).

1.9 Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing positive
controls for proliferation (17 beta-estradiol) and cytotoxicity (MG132), negative controls (assay media, RPMI
1640), and two concentrations (0.5 percent and 0.125 percent) of DMSO solvent controls. Following a 24-hour
incubation period, the cells are exposed to test chemicals for 80 hours and response is monitored no less than
once per hour.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

2.1 Purpose: Electrical impedance is used to quantify changes to the growth of the cells where increase
impedance is positively correlated with increased cell growth.

The ACEA_ER assay exposed human breast carcinoma cell (T-47D) cultures to the ToxCast library of diverse
environmental chemicals using an eight-point, 1:4 dilution series concentration-response format (starting at a
maximum final concentration of lOOuM), using MG132 (cytotoxicity) and Estradiol (E2) (proliferation) as positive

-------
controls and assay media and DMSO as a negative control and solvent control, respectively. All control chemicals
were tested in quadruplicate on each plate. The ACEA_ER assay analyzed changes in cell adhesion and
morphology at the electrode: solution interface (located on the bottom of culture wells) using electronic
microsensors. Changes in electrical impedance were monitored in real-time at the plate surface to investigate
the potential activation of the estrogen signaling pathway and subsequent increases in growth or changes in
cell structure following 80-hour incubation with the test chemicals. The electrical signal produced by the
experimental system can be used to detect changes in cell number, morphology and adhesion which occur in
response to xenoestrogenic activation of ER-mediated pathways, and concentration-response curves were
modeled for each chemical to determine half-maximal activity levels.

2.2 Scientific Principles: Endocrine disrupting chemicals (EDCs) interfere with normal hormone biosynthesis,
signaling or metabolism and impact regulatory pathways in humans and wildlife. Many EDCs interfere with
normal steroidal activity by impacting estrogenic signaling pathways. The estrogen receptor mediates gene
expression in response to estrogen exposure, and modulates the activity for a wide variety of physiological
processes. The activity of estrogenic chemicals is generally probed in vitro by monitoring ligand-binding in
experimental systems, however estrogenic potency is also a function of interaction with transcriptional
machinery and other signaling pathways. This assay was designed to identify chemical perturbagens which can
affect a cell proliferation response in human breast carcinoma cells by acting as xenoestrogenic compounds
which impact estrogen signaling pathways. While cell proliferation rates can be altered via multiple pathways,
growth responses in T47D cells are considered to be particularly reliable indicators of estrogenic activation. This
assay is intended for use as a part of an integrated testing strategy, to screen a large structurally diverse chemical
library for compounds which potentially affect endocrine systems in exposed populations by interacting with
estrogen receptor mediated signaling pathways. There is strong evidence that estrogen receptor activity in early
life is a molecular initiating event (MIE) in a developing Adverse Outcome Pathways (AOP) leading to breast
cancer in both animal and human models and to endometrial carcinoma in the mouse, and ER agonism is the
leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen receptor
activation is the MIE for putative adverse outcome pathways leading to reduced survival due to renal failure
and leading to skewed sex ratios due to altered sexual differentiation in males. ER antagonism has strong
evidence as the MIE for an AOP describing reduction of vitellogenin synthesis in liver, which can lead to reduced
cumulative fecundity in repeat-spawning fish species. Chemical-activity profiles derived from this assay can
inform prioritization decisions for compound selection in more resource intensive in vivo studies to further
investigate the involvement of ER interference in pathways leading to hazardous outcomes in biological systems.

2.3 Experimental System: adherent T47D cell line used. T-47D human breast carcinoma ductal cell line, originally
derived in 1974 from pleural effusion of a 57-year-old patient, which exhibits epithelial-like morphology
(Horwitz et al. 1978, Keydar et al. 1979).

2.4 Metabolic Competence: T-47D cells contain specific high affinity receptors for estradiol, progesterone,
glucocorticoid and androgen (Horwitz et al. 1978). Some potential for P450 mediated metabolism is present,
e.g. CYP1A1, CYP1A2, CYP1B1 (Angus et al. 1999, Hevir et al. 2011, MacPherson and Matthews 2010, Spink et
al. 2002, Spink et al. 1998), CYP2B6 (Lo et al. 2010), CYP3A4 (Nagaoka et al. 2006) and CYP2C8(Mitra et al. 2011),
as well as some experimental evidence for the capacity to retain expression of some phase II metabolizing
enzymes, e.g., UGTs (Harrington et al. 2006, Hevir et al. 2011), GSTs (Hevir et al. 2011) and sulphotransferases
e.g., SULTlA3(Miki et al. 2006), SULT1E1, SULT2B1 (Hevir et al. 2011).

2.5 Exposure Regime: The xCELLigence system Multi-E-Plate stations were used to measure the time-dependent
response to chemicals. Each compound was tested in an eight-point, 1:4 serial dilution series starting at a
maximum final concentration of 100 uM. A maximum starting concentration of 0.5% DMSO was present in the
100 uM chemical samples and was diluted along with the test article dilution series. The screen was performed
in biological duplicate using two separate, 96-well, E-Plates 96 for each dilution series (n = 2). Positive controls
(MG132 and E2) and a negative control (assay media) were tested in quadruplicate on each testing plate. Then,
0.5% and 0.125% DMSO were tested in duplicates in each plate to serve as solvent controls for the 2 highest
concentrations of testing compounds: 100 uM and 25 uM. Reference compounds were tested with 8

-------
concentrations with 1:5 serial dilutions. All screening was carried out by ACEA Biosciences, Inc. (San Diego, CA).
T-47D cells purchased from ATCC were maintained in RPMI1640 media supplemented with 10% characterized
fetal bovine serum (FBS). Before screening, T-47D cells were preconditioned in assay medium: phenol red-free
RPMI1640 supplemented with 10% charcoal-stripped FBS. Cells were then detached and seeded in E-Plates 96
in assay medium. After overnight monitoring of growth once every hour, compounds were added toT-47D cells
and remained in the medium until the end of the experiment. Cellular responses were then recorded once every
5 min for the first 5 h, and once every hour for an additional 100 h.

Baseline median absolute deviation for the assay (bmad): 8.497
Response cutoff threshold used to determine hit calls: 25.492
Detection technology used: RT-CES (Label Free Technology)

2.6 Response: Increased cell proliferation in response to xenoestrogenic interference with ER-mediated pathways
as measured by monitoring electrical impedance at the cell-plate interface. One possible effect of endocrine
disrupting chemicals is increased cell growth through perturbation of pathways linked to cell cycle regulation.
Activation of the estrogen receptor (ER) signaling pathway, for example, is one possible mechanism that
underlies cell proliferation in hormonally sensitive tissues such as mammary and endometrial tissue. The role of
steroid hormones in the regulation of some mammary tumors has been well established (Russo and Russo 2006,
Yager and Davidson 2006) and has motivated the development of estrogen pathway-based chemotherapeutics.
This assay was designed to identify those chemicals with the potential to affect cell growth by activating the
estrogen receptor-mediated cell proliferation pathway. The assay uses electronic microsensors located at the
bottom of the cell culture well to detect changes in cell number, morphology, and adhesion through electrical
impedance measurement at the electrode-solution interface following 80-hour incubation with test chemicals.

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8 Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

25 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

Standard maximum concentration tested:

250 nM
Neutral vehicle control:

DMSO

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were collected from the xCELLigence system which converts raw impedance values into the
Cell Index (CI) value; this is a measure of adhesion where CI = (impedance at time point n - impedance in the
absence of cells)/nominal impedance value. These data were then converted to a Normalized Cell Index
according to the equation NCI(Ti) = CI(Ti)/CI(Tk), where i = 1,2,3,....N where CI(Tk) is the last time point before
chemical addition, CI(Ti) is the cell index at the i-th measured time point, and N is the total number of time
points. Data were grouped by chemical and smoothed to combine replicates using a simple moving average (as
the replicates were assessed in duplicate on separate plates so the time points were not identical). DMSO
controls were considered as baseline for activity, and 17-beta-Estradiol was used as a positive control and 100
percent activity for all the test chemicals on that plate. For cell loss, the NCI value at the time of compound
administration was considered to represent complete (100%) viability. MG132 (2 uM), a proteasome inhibitor
and known cytotoxic agent, was used as the positive control for cell loss and was tested in quadruplicate on
each plate. The minimum average response on each plate was used as a positive control for cell loss for all the
test chemicals on the corresponding plate. If a chemical sample was run on two different plates, then the
minimum NCI values for MG132 were averaged. If an NCI value for MG132 fell below zero, the response was
considered to be below the limit of detection and was replaced with the minimum value greater than zero across
all plates. All smoothened NCI values were then converted to a percentage of positive control, which was
considered to represent no (0%) viability. Concentration response curves were generated using smoothed NCI
values and all statistical analyses were conducted using R programming language, employing tcpl package to
generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise maximum,
by assay plate ID (apid), of the medians of the corrected values (cval) forgain-of-signal single- or multiple-
concentration negative control wells (wilt = m or o) by apid, well type, and concentration.), 5: resp.pc
(Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference between
the corrected (cval) and baseline (bval) values divided the difference between the positive control (pval)
and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 18: resp.shiftneg.3bmad (Shift all the normalized response
values (resp) less than -3 multiplied by the baseline median absolute deviation (bmad) to 0; if resp < -
3*bmad, resp = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response

-------
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 3395	Number of chemicals tested: 3183

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
401

Inactive hit count: 0
-------
power(pow) model:
linear-polynomial (polyl) model:

939

116

quadratic-polynomial(poly2) model: 421

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

4

774

17

147

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-------
1.645

Neutral control median absolute deviation, by plate: nmad	0.109

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.98%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	2.993

Positive control well median absolute deviation, by plate: pmad	0.203

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	5.416

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	1

Negative control well median absolute deviation value, by plate: mmad	0

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-5.314

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 147.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Xing JZ, Zhu L, Gabos S, Xie L. Microelectronic cell sensor assay for detection of cytotoxicity and
prediction of acute toxicity. Toxicol In Vitro. 2006 Sep;20(6):995-1004. Epub 2006 Feb 14. PubMed PMID:
16481145., Rotroff DM, Dix DJ, Houck KA, Kavlock RJ, Knudsen TB, Martin MT, Reif DM, Richard AM, Sipes NS,
Abassi YA, Jin C, Stampfl M, Judson RS. Real-time growth kinetics measuring hormone mimicry for ToxCast
chemicals in T-47D human ductal carcinoma cells. Chem Res Toxicol. 2013 Jul 15;26(7):1097-107.
doi:10.1021/tx400117y. Epub 2013 Jun 10. PubMed PMID: 23682706.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1850

ACEA_AR_agon ist_AU C_via bi lity

1.	General Information

1.1	Assay Title: ACEA Biosciences xCELLigence Real-Time Cell Analysis on Androgen Receptor Agonism for Viability

1.2	Assay Summary: ACEA_AR_agonist is a cell-based, single-readout assay that uses 22Rvl, a human prostate
cancer cell line, with measurements taken at 80 hours after chemical dosing in a 384-well plate, although T05
and T06 (mcO.srcf) used a 96-well plate. Differences in plate size can be ignored given data normalization.
ACEA_AR_80hr is one of two assay component(s) measured or calculated from the ACEA_AR assay. It is designed
to make measurements of real-time cell-growth kinetics, a form of growth reporter, as detected with electrical
impedance signals by Real-Time Cell Electrode Sensor (RT-CES) technology. Data from the assay component
ACEA_AR_AUC_viability was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of growth reporter, loss-of-signal activity can be used to
understand changes in the viability. Furthermore, this assay endpoint can be referred to as a secondary readout,
because this assay has produced multiple assay endpoints where this one serves a viability function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell cycle
intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ACEA Biosciences, Inc. (ACEA) is a privately owned biotechnology company that developed a
realtime, label free, cell growth assay system called xCELLigence based on a microelectronic impedance readout.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; xCELLigence RTCA software and biosensor technology are
available from ACEA Biosciences, Inc. and 22Rvl cells are commercially available from American Type Culture
Collection (ATCC HTB-133) with signed Material Transfer Agreement (MTA).

1.9	Assay Throughput: 384-well plate. The assay is conducted on 96-well plates with each plate containing positive
controls for proliferation (testosterone) and cytotoxicity (MG132), negative controls (assay media, RPMI1640),
and two concentrations (0.5 percent and 0.125 percent) of DMSO solvent controls. Following a 24-hour
incubation period, the cells are exposed to test chemicals for 80 hours and response is monitored no less than
once per hour.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Electrical impedance is used to quantify changes to the growth of the cells where increase
impedance is positively correlated with increased cell growth

The ACEA_AR assay exposed human prostate cell (22Rvl) cultures to the ToxCast library of diverse
environmental chemicals using an eight-point, 1:4 dilution series concentration-response format (starting at a
maximum final concentration of lOOuM), using MG132 (cytotoxicity) and testosterone (proliferation) as positive
controls and assay media and DMSO as a negative control and solvent control, respectively. All control chemicals
were tested in quadruplicate on each plate. The ACEA_AR assay analyzed changes in cell adhesion and

-------
morphology at the electrode: solution interface (located on the bottom of culture wells) using electronic
microsensors. Changes in electrical impedance were monitored in real-time at the plate surface to investigate
the potential activation of the androgen signaling pathway and subsequent increases in growth or changes in
cell structure following 80-hour incubation with the test chemicals. The electrical signal produced by the
experimental system can be used to detect changes in cell number, morphology and adhesion which occur in
response to xenoestrogenic activation of AR-mediated pathways, and concentration-response curves were
modeled for each chemical to determine half-maximal activity levels.

2.2	Scientific Principles: Endocrine disrupting chemicals (EDCs) interfere with normal hormone biosynthesis,
signaling or metabolism and impact regulatory pathways in humans and wildlife. Androgens, such as
testosterone, are widely recognized for their importance in sexual development and differentiation but also
play roles in metabolism, growth, development, and behavior and act as an intercellular signal (Bhasin et al.,
2007; Monks & Holmes, 2018; Sumpter, 2005). Agonism of the androgen receptor is listed as a molecular
initiating event in AOP #23, leading to reproductive dysfunction in fish (Villeneuve, 2021).

2.3	Experimental System: adherent 22Rvl cell line used. 22Rvl is a human prostate carcinoma epithelial cell line
derived from a xenograft that was serially propagated in mice.

2.4	Metabolic Competence: The 22Rvl cell line expresses androgen receptor (AR) and prostate-specific antigen
(PSA), both of which are markers of prostate cancer. The presence of these markers in 22Rvl cells confirms their
origin from prostate cancer tissue and highlights their relevance in studying the disease. Importantly, the 22Rvl
cell line is unique in that it expresses both full-length and truncated forms of ARs. This mixed expression pattern
is commonly observed in androgen deprivation resistant prostate cancers, making the 22Rvl cell line a valuable
model for studying the mechanisms underlying resistance to hormonal therapies. Morphologically, 22Rvl cells
exhibit epithelial characteristics and are cultured as adherent monolayers, providing a convenient system for in
vitro experimentation.

2.5	Exposure Regime: The xCELLigence system Multi-E-Plate stations were used to measure the time-dependent
response to chemicals. Each compound was tested in an eight-point, 1:4 serial dilution series starting at a
maximum final concentration of 100 uM. A maximum starting concentration of 0.5% DMSO was present in the
100 uM chemical samples and was diluted along with the test article dilution series. The screen was performed
in biological duplicate using two separate, 96-well, E-Plates 96 for each dilution series (n = 2). Positive controls
(MG132 and testosterone) and a negative control (assay media) were tested in quadruplicate on each testing
plate. Then, 0.5% and 0.125% DMSO were tested in duplicates in each plate to serve as solvent controls for the
2 highest concentrations of testing compounds: 100 uM and 25 uM. Reference compounds were tested with 8
concentrations with 1:5 serial dilutions. All screening was carried out by ACEA Biosciences, Inc. (San Diego, CA).
22Rvl cells purchased from ATCC were maintained in media supplemented with 10% fetal bovine serum (FBS).
Before screening, 22Rvl cells were preconditioned in assay medium. Cells were then detached and seeded in
E-Plates 96 in assay medium. After overnight monitoring of growth once every hour, compounds were added to
T-47D cells and remained in the medium until the end of the experiment. Cellular responses were then recorded
once every 5 min for the first 5 h, and once every hour for an additional 100 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.144 uM
Key positive control:

MG 132

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

105 uM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 8.956
Response cutoff threshold used to determine hit calls: 26.869
Detection technology used: RT-CES (Label Free Technology)

-------
2.6	Response: Increased cell proliferation in response to xenoestrogenic interference with AR-mediated pathways
as measured by monitoring electrical impedance at the cell-plate interface. One possible effect of endocrine
disrupting chemicals is increased cell growth through perturbation of pathways linked to cell cycle regulation.
Activation of the androgen receptor (AR) signaling pathway, for example, is one possible mechanism that
underlies cell proliferation in hormonally sensitive tissues such as mammary and endometrial tissue. This assay
was designed to identify those chemicals with the potential to affect cell growth by activating the androgen
receptor-mediated cell proliferation pathway. The assay uses electronic microsensors located at the bottom of
the cell culture well to detect changes in cell number, morphology, and adhesion through electrical impedance
measurement at the electrode-solution interface following 80-hour incubation with test chemicals.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were collected from the xCELLigence system which converts raw impedance values into the
Cell Index (CI) value; this is a measure of adhesion where CI = (impedance at time point n - impedance in the
absence of cells)/nominal impedance value. These data were then converted to a Normalized Cell Index
according to the equation NCI(Ti) = CI(Ti)/CI(Tk), where i = 1,2,3,....N where CI(Tk) is the last time point before
chemical addition, CI(Ti) is the cell index at the i-th measured time point, and N is the total number of time
points. Data were grouped by chemical and smoothed to combine replicates using a simple moving average (as
the replicates were assessed in duplicate on separate plates so the time points were not identical). DMSO
controls were considered as baseline for activity, and testosterone was used as a positive control and 100
percent activity for all the test chemicals on that plate. For cell loss, the NCI value at the time of compound
administration was considered to represent complete (100%) viability. MG132 (2 uM), a proteasome inhibitor
and known cytotoxic agent, was used as the positive control for cell loss and was tested in quadruplicate on
each plate. The minimum average response on each plate was used as a positive control for cell loss for all the
test chemicals on the corresponding plate. If a chemical sample was run on two different plates, then the
minimum NCI values for MG132 were averaged. If an NCI value for MG132 fell below zero, the response was
considered to be below the limit of detection and was replaced with the minimum value greater than zero across
all plates. All smoothened NCI values were then converted to a percentage of positive control, which was
considered to represent no (0%) viability. Concentration response curves were generated using smoothed NCI
values and all statistical analyses were conducted using R programming language, employing tcpl package to
generate model parameters and confidence intervals.

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 15:
pval.apid.medncbyconc.min (Calculate the positive control value (pval) as the plate-wise minimum, by
assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single- or multiple-
concentration negative control wells (wilt = m or o) by apid, well type, and concentration.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 27:
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag

-------
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1845	Number of chemicals tested: 1830

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	173.792

Neutral control median absolute deviation, by plate: nmad	6.742

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	4.03%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	223.133

Positive control well median absolute deviation, by plate: pmad	20.616

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	2.438

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	73.087

Negative control well median absolute deviation value, by plate: mmad	2.686

-------
Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

-12.678

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 188.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Judson R, Houck K, Paul Friedman K, Brown J, Browne P, Johnston PA, Close DA, Mansouri K,
Kleinstreuer N. Selecting a minimal set of androgen receptor assays for screening chemicals. Regul Toxicol
Pharmacol. 2020 Nov; 117:104764. doi: 10.1016/j.yrtph.2020.104764. Epub 2020 Aug 14. PMID: 32798611;
PMCID: PMC8356084.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1852

ACEA_ER_AUC_viability

1.	General Information

1.1	Assay Title: ACEA Biosciences xCELLigence Real-Time Cell Analysis on Estrogen Receptor Agonism for Viability

1.2	Assay Summary: ACEA_ER is a cell-based, single-readout assay that uses T47D, a human breast cell line, with
measurements taken at 80 hours after chemical dosing in a 96-well plate, although T02 (mcO.srcf) used a 384-
well plate. Differences in plate size can be ignored given data normalization. ACEA_ER_AUC_viability is one of
two assay component(s) measured or calculated from the ACEA_ER assay. It is designed to make measurements
of real-time cell-growth kinetics, a form of growth reporter, as detected with electrical impedance signals by
Real-Time Cell Electrode Sensor (RT-CES) technology. Data from the assay component ACEA_ER_AUC_viability
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of growth reporter, loss-of-signal activity can be used to understand changes in the
viability. Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has
produced multiple assay endpoints where this one serves a viability function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the
subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ACEA Biosciences, Inc. (ACEA) is a privately owned biotechnology company that developed a
realtime, label free, cell growth assay system called xCELLigence based on a microelectronic impedance readout.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; xCELLigence RTCA software and biosensor technology are
available from ACEA Biosciences, Inc. and T-47D cells are commercially available from American Type Culture
Collection (ATCC HTB-133) with signed Material Transfer Agreement (MTA).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing positive
controls for proliferation (17 beta-estradiol) and cytotoxicity (MG132), negative controls (assay media, RPMI
1640), and two concentrations (0.5 percent and 0.125 percent) of DMSO solvent controls. Following a 24-hour
incubation period, the cells are exposed to test chemicals for 80 hours and response is monitored no less than
once per hour.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Electrical impedance is used to quantify changes to the growth of the cells where increase
impedance is positively correlated with increased cell growth

The ACEA_ER assay exposed human breast carcinoma cell (T-47D) cultures to the ToxCast library of diverse
environmental chemicals using an eight-point, 1:4 dilution series concentration-response format (starting at a
maximum final concentration of lOOuM), using MG132 (cytotoxicity) and Estradiol (E2) (proliferation) as positive
controls and assay media and DMSO as a negative control and solvent control, respectively. All control chemicals
were tested in quadruplicate on each plate. The ACEA_ER assay analyzed changes in cell adhesion and

-------
morphology at the electrode: solution interface (located on the bottom of culture wells) using electronic
microsensors. Changes in electrical impedance were monitored in real-time at the plate surface to investigate
the potential activation of the estrogen signaling pathway and subsequent increases in growth or changes in
cell structure following 80-hour incubation with the test chemicals. The electrical signal produced by the
experimental system can be used to detect changes in cell number, morphology and adhesion which occur in
response to xenoestrogenic activation of ER-mediated pathways, and concentration-response curves were
modeled for each chemical to determine half-maximal activity levels.

2.2	Scientific Principles: Endocrine disrupting chemicals (EDCs) interfere with normal hormone biosynthesis,
signaling or metabolism and impact regulatory pathways in humans and wildlife. Many EDCs interfere with
normal steroidal activity by impacting estrogenic signaling pathways. The estrogen receptor mediates gene
expression in response to estrogen exposure, and modulates the activity for a wide variety of physiological
processes. The activity of estrogenic chemicals is generally probed in vitro by monitoring ligand-binding in
experimental systems, however estrogenic potency is also a function of interaction with transcriptional
machinery and other signaling pathways. This assay was designed to identify chemical perturbagens which can
affect a cell proliferation response in human breast carcinoma cells by acting as xenoestrogenic compounds
which impact estrogen signaling pathways. While cell proliferation rates can be altered via multiple pathways,
growth responses in T47D cells are considered to be particularly reliable indicators of estrogenic activation. This
assay is intended for use as a part of an integrated testing strategy, to screen a large structurally diverse chemical
library for compounds which potentially affect endocrine systems in exposed populations by interacting with
estrogen receptor mediated signaling pathways. There is strong evidence that estrogen receptor activity in early
life is a molecular initiating event (MIE) in a developing Adverse Outcome Pathways (AOP) leading to breast
cancer in both animal and human models and to endometrial carcinoma in the mouse, and ER agonism is the
leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen receptor
activation is the MIE for putative adverse outcome pathways leading to reduced survival due to renal failure
and leading to skewed sex ratios due to altered sexual differentiation in males. ER antagonism has strong
evidence as the MIE for an AOP describing reduction of vitellogenin synthesis in liver, which can lead to reduced
cumulative fecundity in repeat-spawning fish species. Chemical-activity profiles derived from this assay can
inform prioritization decisions for compound selection in more resource intensive in vivo studies to further
investigate the involvement of ER interference in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent T47D cell line used. T-47D human breast carcinoma ductal cell line, originally
derived in 1974 from pleural effusion of a 57-year-old patient, which exhibits epithelial-like morphology
(Horwitz et al. 1978, Keydar et al. 1979).

2.4	Metabolic Competence: T-47D cells contain specific high affinity receptors for estradiol, progesterone,
glucocorticoid and androgen (Horwitz et al. 1978). Some potential for P450 mediated metabolism is present,
e.g. CYP1A1, CYP1A2, CYP1B1 (Angus et al. 1999, Hevir et al. 2011, MacPherson and Matthews 2010, Spink et
al. 2002, Spink et al. 1998), CYP2B6 (Lo et al. 2010), CYP3A4 (Nagaoka et al. 2006) and CYP2C8(Mitra et al. 2011),
as well as some experimental evidence for the capacity to retain expression of some phase II metabolizing
enzymes, e.g., UGTs (Harrington et al. 2006, Hevir et al. 2011), GSTs (Hevir et al. 2011) and sulphotransferases
e.g., SULTlA3(Miki et al. 2006), SULT1E1, SULT2B1 (Hevir et al. 2011).

2.5	Exposure Regime: The xCELLigence system Multi-E-Plate stations were used to measure the time-dependent
response to chemicals. Each compound was tested in an eight-point, 1:4 serial dilution series starting at a
maximum final concentration of 100 uM. A maximum starting concentration of 0.5% DMSO was present in the
100 uM chemical samples and was diluted along with the test article dilution series. The screen was performed
in biological duplicate using two separate, 96-well, E-Plates 96 for each dilution series (n = 2). Positive controls
(MG132 and E2) and a negative control (assay media) were tested in quadruplicate on each testing plate. Then,
0.5% and 0.125% DMSO were tested in duplicates in each plate to serve as solvent controls for the 2 highest
concentrations of testing compounds: 100 uM and 25 uM. Reference compounds were tested with 8
concentrations with 1:5 serial dilutions. All screening was carried out by ACEA Biosciences, Inc. (San Diego, CA).
T-47D cells purchased from ATCC were maintained in RPMI1640 media supplemented with 10% characterized

-------
fetal bovine serum (FBS). Before screening, T-47D cells were preconditioned in assay medium: phenol red-free
RPMI1640 supplemented with 10% charcoal-stripped FBS. Cells were then detached and seeded in E-Plates 96
in assay medium. After overnight monitoring of growth once every hour, compounds were added to T-47D cells
and remained in the medium until the end of the experiment. Cellular responses were then recorded once every
5 min for the first 5 h, and once every hour for an additional 100 h.

Baseline median absolute deviation for the assay (bmad): 5.861
Response cutoff threshold used to determine hit calls: 20
Detection technology used: RT-CES (Label Free Technology)

2.6	Response: Increased cell proliferation in response to xenoestrogenic interference with ER-mediated pathways
as measured by monitoring electrical impedance at the cell-plate interface. One possible effect of endocrine
disrupting chemicals is increased cell growth through perturbation of pathways linked to cell cycle regulation.
Activation of the estrogen receptor (ER) signaling pathway, for example, is one possible mechanism that
underlies cell proliferation in hormonally sensitive tissues such as mammary and endometrial tissue. The role of
steroid hormones in the regulation of some mammary tumors has been well established (Russo and Russo 2006,
Yager and Davidson 2006) and has motivated the development of estrogen pathway-based chemotherapeutics.
This assay was designed to identify those chemicals with the potential to affect cell growth by activating the
estrogen receptor-mediated cell proliferation pathway. The assay uses electronic microsensors located at the
bottom of the cell culture well to detect changes in cell number, morphology, and adhesion through electrical
impedance measurement at the electrode-solution interface following 80-hour incubation with test chemicals.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region, ToxCast ER Pathway Model: Estrogen
receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

25 nM
Key positive control:

MG 132

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

250 nM
Neutral vehicle control:

DMSO

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were collected from the xCELLigence system which converts raw impedance values into the
Cell Index (CI) value; this is a measure of adhesion where CI = (impedance at time point n - impedance in the
absence of cells)/nominal impedance value. These data were then converted to a Normalized Cell Index
according to the equation NCI(Ti) = CI(Ti)/CI(Tk), where i = 1,2,3,....N where CI(Tk) is the last time point before
chemical addition, CI(Ti) is the cell index at the i-th measured time point, and N is the total number of time
points. Data were grouped by chemical and smoothed to combine replicates using a simple moving average (as
the replicates were assessed in duplicate on separate plates so the time points were not identical). DMSO
controls were considered as baseline for activity, and 17-beta-Estradiol was used as a positive control and 100
percent activity for all the test chemicals on that plate. For cell loss, the NCI value at the time of compound
administration was considered to represent complete (100%) viability. MG132 (2 uM), a proteasome inhibitor
and known cytotoxic agent, was used as the positive control for cell loss and was tested in quadruplicate on
each plate. The minimum average response on each plate was used as a positive control for cell loss for all the
test chemicals on the corresponding plate. If a chemical sample was run on two different plates, then the
minimum NCI values for MG132 were averaged. If an NCI value for MG132 fell below zero, the response was
considered to be below the limit of detection and was replaced with the minimum value greater than zero across
all plates. All smoothened NCI values were then converted to a percentage of positive control, which was
considered to represent no (0%) viability. Concentration response curves were generated using smoothed NCI
values and all statistical analyses were conducted using R programming language, employing tcpl package to
generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 15:
pval.apid.medncbyconc.min (Calculate the positive control value (pval) as the plate-wise minimum, by
assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single- or multiple-
concentration negative control wells (wilt = m or o) by apid, well type, and concentration.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -

-------
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 27:
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 3395	Number of chemicals tested: 3183

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
895

Inactive hit count: Oihitc 0.9
1432

WINING MODEL SELECTION

NA hit count: hitc^O
1068

Number of sample-assay endpoints with winning hill model:
gain-loss (gnls) model:

115
418

-------
power(pow) model:
linear-polynomial (polyl) model:

251

918

quadratic-polynomialfpoly2) model: 732

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

246

39

545

126

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

108.849

Neutral control median absolute deviation, by plate: nmad	4.323

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	4.2%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	146.774

Positive control well median absolute deviation, by plate: pmad	7.057

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	4.208

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	18.516

Negative control well median absolute deviation value, by plate: mmad	3.052

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-14.527

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 246.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Xing JZ, Zhu L, Gabos S, Xie L. Microelectronic cell sensor assay for detection of cytotoxicity and
prediction of acute toxicity. Toxicol In Vitro. 2006 Sep;20(6):995-1004. Epub 2006 Feb 14. PubMed PMID:
16481145., Rotroff DM, Dix DJ, Houck KA, Kavlock RJ, Knudsen TB, Martin MT, Reif DM, Richard AM, Sipes NS,
Abassi YA, Jin C, Stampfl M, Judson RS. Real-time growth kinetics measuring hormone mimicry for ToxCast
chemicals in T-47D human ductal carcinoma cells. Chem Res Toxicol. 2013 Jul 15;26(7):1097-107.
doi:10.1021/tx400117y. Epub 2013 Jun 10. PubMed PMID: 23682706.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1855

ACEA_AR_agon ist_80h r

1.	General Information

1.1	Assay Title: ACEA Biosciences xCELLigence Real-Time Cell Analysis on Androgen Receptor Agonism for
Proliferation

1.2	Assay Summary: ACEA_AR_agonist is a cell-based, single-readout assay that uses 22Rvl, a human prostate
cancer cell line, with measurements taken at 80 hours after chemical dosing in a 384-well plate, although T05
and T06 (mcO.srcf) used a 96-well plate. Differences in plate size can be ignored given data normalization.
ACEA_AR_agonist_80hr is one of two assay component(s) measured or calculated from the ACEA_AR assay. It
is designed to make measurements of real-time cell-growth kinetics, a form of growth reporter, as detected
with electrical impedance signals by Real-Time Cell Electrode Sensor (RT-CES) technology. Data from the assay
component ACEA_AR_agonist_80hr was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of growth reporter, measures of the cells for gain-of-
signal activity can be used to understand the signaling at the pathway-level as they relate to the geneAR
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the
subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ACEA Biosciences, Inc. (ACEA) is a privately owned biotechnology company that developed a
realtime, label free, cell growth assay system called xCELLigence based on a microelectronic impedance readout.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; xCELLigence RTCA software and biosensor technology are
available from ACEA Biosciences, Inc. and 22Rvl cells are commercially available from American Type Culture
Collection (ATCC HTB-133) with signed Material Transfer Agreement (MTA).

1.9	Assay Throughput: 384-well plate. The assay is conducted on 96-well plates with each plate containing positive
controls for proliferation (testosterone) and cytotoxicity (MG132), negative controls (assay media, RPMI1640),
and two concentrations (0.5 percent and 0.125 percent) of DMSO solvent controls. Following a 24-hour
incubation period, the cells are exposed to test chemicals for 80 hours and response is monitored no less than
once per hour.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Electrical impedance is used to quantify changes to the growth of the cells where increase
impedance is positively correlated with increased cell growth

The ACEA_AR assay exposed human prostate cell (22Rvl) cultures to the ToxCast library of diverse
environmental chemicals using an eight-point, 1:4 dilution series concentration-response format (starting at a
maximum final concentration of lOOuM), using MG132 (cytotoxicity) and testosterone (proliferation) as positive

-------
controls and assay media and DMSO as a negative control and solvent control, respectively. All control chemicals
were tested in quadruplicate on each plate. The ACEA_AR assay analyzed changes in cell adhesion and
morphology at the electrode: solution interface (located on the bottom of culture wells) using electronic
microsensors. Changes in electrical impedance were monitored in real-time at the plate surface to investigate
the potential activation of the androgen signaling pathway and subsequent increases in growth or changes in
cell structure following 80-hour incubation with the test chemicals. The electrical signal produced by the
experimental system can be used to detect changes in cell number, morphology and adhesion which occur in
response to xenoestrogenic activation of AR-mediated pathways, and concentration-response curves were
modeled for each chemical to determine half-maximal activity levels.

2.2	Scientific Principles: Endocrine disrupting chemicals (EDCs) interfere with normal hormone biosynthesis,
signaling or metabolism and impact regulatory pathways in humans and wildlife. Androgens, such as
testosterone, are widely recognized for their importance in sexual development and differentiation but also
play roles in metabolism, growth, development, and behavior and act as an intercellular signal (Bhasin et al.,
2007; Monks & Holmes, 2018; Sumpter, 2005). Agonism of the androgen receptor is listed as a molecular
initiating event in AOP #23, leading to reproductive dysfunction in fish (Villeneuve, 2021).

2.3	Experimental System: adherent 22Rvl cell line used. 22Rvl is a human prostate carcinoma epithelial cell line
derived from a xenograft that was serially propagated in mice.

2.4	Metabolic Competence: The 22Rvl cell line expresses androgen receptor (AR) and prostate-specific antigen
(PSA), both of which are markers of prostate cancer. The presence of these markers in 22Rvl cells confirms their
origin from prostate cancer tissue and highlights their relevance in studying the disease. Importantly, the 22Rvl
cell line is unique in that it expresses both full-length and truncated forms of ARs. This mixed expression pattern
is commonly observed in androgen deprivation resistant prostate cancers, making the 22Rvl cell line a valuable
model for studying the mechanisms underlying resistance to hormonal therapies. Morphologically, 22Rvl cells
exhibit epithelial characteristics and are cultured as adherent monolayers, providing a convenient system for in
vitro experimentation.

2.5	Exposure Regime: The xCELLigence system Multi-E-Plate stations were used to measure the time-dependent
response to chemicals. Each compound was tested in an eight-point, 1:4 serial dilution series starting at a
maximum final concentration of 100 uM. A maximum starting concentration of 0.5% DMSO was present in the
100 uM chemical samples and was diluted along with the test article dilution series. The screen was performed
in biological duplicate using two separate, 96-well, E-Plates 96 for each dilution series (n = 2). Positive controls
(MG132 and testosterone) and a negative control (assay media) were tested in quadruplicate on each testing
plate. Then, 0.5% and 0.125% DMSO were tested in duplicates in each plate to serve as solvent controls for the
2 highest concentrations of testing compounds: 100 uM and 25 uM. Reference compounds were tested with 8
concentrations with 1:5 serial dilutions. All screening was carried out by ACEA Biosciences, Inc. (San Diego, CA).
22Rvl cells purchased from ATCC were maintained in media supplemented with 10% fetal bovine serum (FBS).
Before screening, 22Rvl cells were preconditioned in assay medium. Cells were then detached and seeded in
E-Plates 96 in assay medium. After overnight monitoring of growth once every hour, compounds were added to
T-47D cells and remained in the medium until the end of the experiment. Cellular responses were then recorded
once every 5 min for the first 5 h, and once every hour for an additional 100 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.144 uM
Key positive control:
testosterone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

105 uM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 7.544
Response cutoff threshold used to determine hit calls: 22.633

-------
Detection technology used: RT-CES (Label Free Technology)

2.6	Response: Increased cell proliferation in response to xenoestrogenic interference with AR-mediated pathways
as measured by monitoring electrical impedance at the cell-plate interface. One possible effect of endocrine
disrupting chemicals is increased cell growth through perturbation of pathways linked to cell cycle regulation.
Activation of the androgen receptor (AR) signaling pathway, for example, is one possible mechanism that
underlies cell proliferation in hormonally sensitive tissues such as mammary and endometrial tissue. This assay
was designed to identify those chemicals with the potential to affect cell growth by activating the androgen
receptor-mediated cell proliferation pathway. The assay uses electronic microsensors located at the bottom of
the cell culture well to detect changes in cell number, morphology, and adhesion through electrical impedance
measurement at the electrode-solution interface following 80-hour incubation with test chemicals.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were collected from the xCELLigence system which converts raw impedance values into the
Cell Index (CI) value; this is a measure of adhesion where CI = (impedance at time point n - impedance in the
absence of cells)/nominal impedance value. These data were then converted to a Normalized Cell Index
according to the equation NCI(Ti) = CI(Ti)/CI(Tk), where i = 1,2,3,....N where CI(Tk) is the last time point before
chemical addition, CI(Ti) is the cell index at the i-th measured time point, and N is the total number of time
points. Data were grouped by chemical and smoothed to combine replicates using a simple moving average (as
the replicates were assessed in duplicate on separate plates so the time points were not identical). DMSO
controls were considered as baseline for activity, and testosterone was used as a positive control and 100
percent activity for all the test chemicals on that plate. For cell loss, the NCI value at the time of compound
administration was considered to represent complete (100%) viability. MG132 (2 uM), a proteasome inhibitor
and known cytotoxic agent, was used as the positive control for cell loss and was tested in quadruplicate on
each plate. The minimum average response on each plate was used as a positive control for cell loss for all the
test chemicals on the corresponding plate. If a chemical sample was run on two different plates, then the
minimum NCI values for MG132 were averaged. If an NCI value for MG132 fell below zero, the response was
considered to be below the limit of detection and was replaced with the minimum value greater than zero across
all plates. All smoothened NCI values were then converted to a percentage of positive control, which was
considered to represent no (0%) viability. Concentration response curves were generated using smoothed NCI

-------
values and all statistical analyses were conducted using R programming language, employing tcpl package to
generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise maximum,
by assay plate ID (apid), of the medians of the corrected values (cval) forgain-of-signal single- or multiple-
concentration negative control wells (wilt = m or o) by apid, well type, and concentration.), 5: resp.pc
(Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference between
the corrected (cval) and baseline (bval) values divided the difference between the positive control (pval)
and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 18: resp.shiftneg.3bmad (Shift all the normalized response
values (resp) less than -3 multiplied by the baseline median absolute deviation (bmad) to 0; if resp < -
3*bmad, resp = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50

-------
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1845	Number of chemicals tested: 1830

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4.02

Neutral control median absolute deviation, by plate: nmad	0.204

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.14%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	6.915

Positive control well median absolute deviation, by plate: pmad	0.853

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	2.698

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	1

-------
Negative control well median absolute deviation value, by plate: mmad	0

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-14.766

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 106.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.

Bibliography: Judson R, Houck K, Paul Friedman K, Brown J, Browne P, Johnston PA, Close DA, Mansouri K,
Kleinstreuer N. Selecting a minimal set of androgen receptor assays for screening chemicals. Regul Toxicol

-------
Pharmacol. 2020 Nov; 117:104764. doi: 10.1016/j.yrtph.2020.104764. Epub 2020 Aug 14. PMID: 32798611;
PMCID: PMC8356084.

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1856

ACEA_AR_antagonist_80hr

1.	General Information

1.1	Assay Title: ACEA Biosciences xCELLigence Real-Time Cell Analysis on Androgen Receptor Antagonism for
Proliferation

1.2	Assay Summary: ACEA_AR_antagonist is a cell-based, single-readout assay that uses 22Rvl, a human prostate
cancer cell line, with measurements taken at 80 hours after chemical dosing in a 384-well plate, although T05
and T06 (mcO.srcf) used a 96-well plate. Differences in plate size can be ignored given data normalization.
ACEA_AR_antagonist_80hr is one of two assay component(s) measured or calculated from the ACEA_AR assay.
It is designed to make measurements of real-time cell-growth kinetics, a form of growth reporter, as detected
with electrical impedance signals by Real-Time Cell Electrode Sensor (RT-CES) technology. Data from the assay
component ACEA_AR_antagonist_80hr was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of growth reporter, measures of the cells for loss-
of-signal activity can be used to understand the signaling at the pathway-level as they relate to the gene AR.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the
subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ACEA Biosciences, Inc. (ACEA) is a privately owned biotechnology company that developed a
realtime, label free, cell growth assay system called xCELLigence based on a microelectronic impedance readout.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; xCELLigence RTCA software and biosensor technology are
available from ACEA Biosciences, Inc. and 22Rvl cells are commercially available from American Type Culture
Collection (ATCC HTB-133) with signed Material Transfer Agreement (MTA).

1.9	Assay Throughput: 384-well plate. The assay is conducted on 96-well plates with each plate containing positive
controls for proliferation (testosterone) and cytotoxicity (MG132), negative controls (assay media, RPMI1640),
and two concentrations (0.5 percent and 0.125 percent) of DMSO solvent controls. Following a 24-hour
incubation period, the cells are exposed to test chemicals for 80 hours and response is monitored no less than
once per hour.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Electrical impedance is used to quantify changes to the growth of the cells where increase
impedance is positively correlated with increased cell growth

The ACEA_AR assay exposed human prostate cell (22Rvl) cultures to the ToxCast library of diverse
environmental chemicals using an eight-point, 1:4 dilution series concentration-response format (starting at a
maximum final concentration of lOOuM), using MG132 (cytotoxicity) and testosterone (proliferation) as positive

-------
controls and assay media and DMSO as a negative control and solvent control, respectively. All control chemicals
were tested in quadruplicate on each plate. The ACEA_AR assay analyzed changes in cell adhesion and
morphology at the electrode: solution interface (located on the bottom of culture wells) using electronic
microsensors. Changes in electrical impedance were monitored in real-time at the plate surface to investigate
the potential activation of the androgen signaling pathway and subsequent increases in growth or changes in
cell structure following 80-hour incubation with the test chemicals. The electrical signal produced by the
experimental system can be used to detect changes in cell number, morphology and adhesion which occur in
response to xenoestrogenic activation of AR-mediated pathways, and concentration-response curves were
modeled for each chemical to determine half-maximal activity levels.

2.2	Scientific Principles: Endocrine disrupting chemicals (EDCs) interfere with normal hormone biosynthesis,
signaling or metabolism and impact regulatory pathways in humans and wildlife. Androgens, such as
testosterone, are widely recognized for their importance in sexual development and differentiation but also
play roles in metabolism, growth, development, and behavior and act as an intercellular signal (Bhasin et al.,
2007; Monks & Holmes, 2018; Sumpter, 2005). Agonism of the androgen receptor is listed as a molecular
initiating event in AOP #23, leading to reproductive dysfunction in fish (Villeneuve, 2021).

2.3	Experimental System: adherent 22Rvl cell line used. 22Rvl is a human prostate carcinoma epithelial cell line
derived from a xenograft that was serially propagated in mice.

2.4	Metabolic Competence: The 22Rvl cell line expresses androgen receptor (AR) and prostate-specific antigen
(PSA), both of which are markers of prostate cancer. The presence of these markers in 22Rvl cells confirms their
origin from prostate cancer tissue and highlights their relevance in studying the disease. Importantly, the 22Rvl
cell line is unique in that it expresses both full-length and truncated forms of ARs. This mixed expression pattern
is commonly observed in androgen deprivation resistant prostate cancers, making the 22Rvl cell line a valuable
model for studying the mechanisms underlying resistance to hormonal therapies. Morphologically, 22Rvl cells
exhibit epithelial characteristics and are cultured as adherent monolayers, providing a convenient system for in
vitro experimentation.

2.5	Exposure Regime: The xCELLigence system Multi-E-Plate stations were used to measure the time-dependent
response to chemicals. Each compound was tested in an eight-point, 1:4 serial dilution series starting at a
maximum final concentration of 100 uM. A maximum starting concentration of 0.5% DMSO was present in the
100 uM chemical samples and was diluted along with the test article dilution series. The screen was performed
in biological duplicate using two separate, 96-well, E-Plates 96 for each dilution series (n = 2). Positive controls
(MG132 and testosterone) and a negative control (assay media) were tested in quadruplicate on each testing
plate. Then, 0.5% and 0.125% DMSO were tested in duplicates in each plate to serve as solvent controls for the
2 highest concentrations of testing compounds: 100 uM and 25 uM. Reference compounds were tested with 8
concentrations with 1:5 serial dilutions. All screening was carried out by ACEA Biosciences, Inc. (San Diego, CA).
22Rvl cells purchased from ATCC were maintained in media supplemented with 10% fetal bovine serum (FBS).
Before screening, 22Rvl cells were preconditioned in assay medium. Cells were then detached and seeded in
E-Plates 96 in assay medium. After overnight monitoring of growth once every hour, compounds were added to
T-47D cells and remained in the medium until the end of the experiment. Cellular responses were then recorded
once every 5 min for the first 5 h, and once every hour for an additional 100 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.144 uM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

105 uM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.122
Response cutoff threshold used to determine hit calls: 0.366

-------
Detection technology used: RT-CES (Label Free Technology)

2.6	Response: Increased cell proliferation in response to xenoestrogenic interference with AR-mediated pathways
as measured by monitoring electrical impedance at the cell-plate interface. One possible effect of endocrine
disrupting chemicals is increased cell growth through perturbation of pathways linked to cell cycle regulation.
Activation of the androgen receptor (AR) signaling pathway, for example, is one possible mechanism that
underlies cell proliferation in hormonally sensitive tissues such as mammary and endometrial tissue. This assay
was designed to identify those chemicals with the potential to affect cell growth by activating the androgen
receptor-mediated cell proliferation pathway. The assay uses electronic microsensors located at the bottom of
the cell culture well to detect changes in cell number, morphology, and adhesion through electrical impedance
measurement at the electrode-solution interface following 80-hour incubation with test chemicals.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were collected from the xCELLigence system which converts raw impedance values into the
Cell Index (CI) value; this is a measure of adhesion where CI = (impedance at time point n - impedance in the
absence of cells)/nominal impedance value. These data were then converted to a Normalized Cell Index
according to the equation NCI(Ti) = CI(Ti)/CI(Tk), where i = 1,2,3,....N where CI(Tk) is the last time point before
chemical addition, CI(Ti) is the cell index at the i-th measured time point, and N is the total number of time
points. Data were grouped by chemical and smoothed to combine replicates using a simple moving average (as
the replicates were assessed in duplicate on separate plates so the time points were not identical). DMSO
controls were considered as baseline for activity, and testosterone was used as a positive control and 100
percent activity for all the test chemicals on that plate. For cell loss, the NCI value at the time of compound
administration was considered to represent complete (100%) viability. MG132 (2 uM), a proteasome inhibitor
and known cytotoxic agent, was used as the positive control for cell loss and was tested in quadruplicate on
each plate. The minimum average response on each plate was used as a positive control for cell loss for all the
test chemicals on the corresponding plate. If a chemical sample was run on two different plates, then the
minimum NCI values for MG132 were averaged. If an NCI value for MG132 fell below zero, the response was
considered to be below the limit of detection and was replaced with the minimum value greater than zero across
all plates. All smoothened NCI values were then converted to a percentage of positive control, which was
considered to represent no (0%) viability. Concentration response curves were generated using smoothed NCI

-------
values and all statistical analyses were conducted using R programming language, employing tcpl package to
generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were

-------
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1850	Number of chemicals tested: 1835

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	6.926

Neutral control median absolute deviation, by plate: nmad	0.984

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.38%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	4.202

Positive control well median absolute deviation, by plate: pmad	0.198

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.524

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	1

Negative control well median absolute deviation value, by plate: mmad	0

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-5.936

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 205.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Judson R, Houck K, Paul Friedman K, Brown J, Browne P, Johnston PA, Close DA, Mansouri K,
Kleinstreuer N. Selecting a minimal set of androgen receptor assays for screening chemicals. Regul Toxicol
Pharmacol. 2020 Nov; 117:104764. doi: 10.1016/j.yrtph.2020.104764. Epub 2020 Aug 14. PMID: 32798611;
PMCID: PMC8356084.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1857

AC E A_AR_a ntago n i st_AU C_vi a b i I ity

1.	General Information

1.1	Assay Title: ACEA Biosciences xCELLigence Real-Time Cell Analysis on Androgen Receptor Antagonism for
Viability

1.2	Assay Summary: ACEA_AR_antagonist is a cell-based, single-readout assay that uses 22Rvl, a human prostate
cancer cell line, with measurements taken at 80 hours after chemical dosing in a 384-well plate, although T05
and T06 (mcO.srcf) used a 96-well plate. Differences in plate size can be ignored given data normalization.
ACEA_AR_80hr is one of two assay component(s) measured or calculated from the ACEA_ER assay. It is designed
to make measurements of real-time cell-growth kinetics, a form of growth reporter, as detected with electrical
impedance signals by Real-Time Cell Electrode Sensor (RT-CES) technology. Data from the assay component
ACEA_AR_antagonist_AUC_viability was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of growth reporter, loss-of-signal activity can be used
to understand changes in the viability. Furthermore, this assay endpoint can be referred to as a secondary
readout, because this assay has produced multiple assay endpoints where this one serves a viability function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell cycle
intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ACEA Biosciences, Inc. (ACEA) is a privately owned biotechnology company that developed a
realtime, label free, cell growth assay system called xCELLigence based on a microelectronic impedance readout.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; xCELLigence RTCA software and biosensor technology are
available from ACEA Biosciences, Inc. and 22Rvl cells are commercially available from American Type Culture
Collection (ATCC HTB-133) with signed Material Transfer Agreement (MTA).

1.9	Assay Throughput: 384-well plate. The assay is conducted on 96-well plates with each plate containing positive
controls for proliferation (testosterone) and cytotoxicity (MG132), negative controls (assay media, RPMI1640),
and two concentrations (0.5 percent and 0.125 percent) of DMSO solvent controls. Following a 24-hour
incubation period, the cells are exposed to test chemicals for 80 hours and response is monitored no less than
once per hour.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Electrical impedance is used to quantify changes to the growth of the cells where increase
impedance is positively correlated with increased cell growth

The ACEA_AR assay exposed human prostate cell (22Rvl) cultures to the ToxCast library of diverse
environmental chemicals using an eight-point, 1:4 dilution series concentration-response format (starting at a
maximum final concentration of lOOuM), using MG132 (cytotoxicity) and testosterone (proliferation) as positive
controls and assay media and DMSO as a negative control and solvent control, respectively. All control chemicals

-------
were tested in quadruplicate on each plate. The ACEA_AR assay analyzed changes in cell adhesion and
morphology at the electrode: solution interface (located on the bottom of culture wells) using electronic
microsensors. Changes in electrical impedance were monitored in real-time at the plate surface to investigate
the potential activation of the androgen signaling pathway and subsequent increases in growth or changes in
cell structure following 80-hour incubation with the test chemicals. The electrical signal produced by the
experimental system can be used to detect changes in cell number, morphology and adhesion which occur in
response to xenoestrogenic activation of AR-mediated pathways, and concentration-response curves were
modeled for each chemical to determine half-maximal activity levels.

2.2	Scientific Principles: Endocrine disrupting chemicals (EDCs) interfere with normal hormone biosynthesis,
signaling or metabolism and impact regulatory pathways in humans and wildlife. Androgens, such as
testosterone, are widely recognized for their importance in sexual development and differentiation but also
play roles in metabolism, growth, development, and behavior and act as an intercellular signal (Bhasin et al.,
2007; Monks & Holmes, 2018; Sumpter, 2005). Agonism of the androgen receptor is listed as a molecular
initiating event in AOP #23, leading to reproductive dysfunction in fish (Villeneuve, 2021).

2.3	Experimental System: adherent 22Rvl cell line used. 22Rvl is a human prostate carcinoma epithelial cell line
derived from a xenograft that was serially propagated in mice.

2.4	Metabolic Competence: The 22Rvl cell line expresses androgen receptor (AR) and prostate-specific antigen
(PSA), both of which are markers of prostate cancer. The presence of these markers in 22Rvl cells confirms their
origin from prostate cancer tissue and highlights their relevance in studying the disease. Importantly, the 22Rvl
cell line is unique in that it expresses both full-length and truncated forms of ARs. This mixed expression pattern
is commonly observed in androgen deprivation resistant prostate cancers, making the 22Rvl cell line a valuable
model for studying the mechanisms underlying resistance to hormonal therapies. Morphologically, 22Rvl cells
exhibit epithelial characteristics and are cultured as adherent monolayers, providing a convenient system for in
vitro experimentation.

2.5	Exposure Regime: The xCELLigence system Multi-E-Plate stations were used to measure the time-dependent
response to chemicals. Each compound was tested in an eight-point, 1:4 serial dilution series starting at a
maximum final concentration of 100 uM. A maximum starting concentration of 0.5% DMSO was present in the
100 uM chemical samples and was diluted along with the test article dilution series. The screen was performed
in biological duplicate using two separate, 96-well, E-Plates 96 for each dilution series (n = 2). Positive controls
(MG132 and testosterone) and a negative control (assay media) were tested in quadruplicate on each testing
plate. Then, 0.5% and 0.125% DMSO were tested in duplicates in each plate to serve as solvent controls for the
2 highest concentrations of testing compounds: 100 uM and 25 uM. Reference compounds were tested with 8
concentrations with 1:5 serial dilutions. All screening was carried out by ACEA Biosciences, Inc. (San Diego, CA).
22Rvl cells purchased from ATCC were maintained in media supplemented with 10% fetal bovine serum (FBS).
Before screening, 22Rvl cells were preconditioned in assay medium. Cells were then detached and seeded in
E-Plates 96 in assay medium. After overnight monitoring of growth once every hour, compounds were added to
T-47D cells and remained in the medium until the end of the experiment. Cellular responses were then recorded
once every 5 min for the first 5 h, and once every hour for an additional 100 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.144 uM
Key positive control:

MG 132

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

105 uM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 9.65
Response cutoff threshold used to determine hit calls: 28.951
Detection technology used: RT-CES (Label Free Technology)

-------
2.6	Response: Increased cell proliferation in response to xenoestrogenic interference with AR-mediated pathways
as measured by monitoring electrical impedance at the cell-plate interface. One possible effect of endocrine
disrupting chemicals is increased cell growth through perturbation of pathways linked to cell cycle regulation.
Activation of the androgen receptor (AR) signaling pathway, for example, is one possible mechanism that
underlies cell proliferation in hormonally sensitive tissues such as mammary and endometrial tissue. This assay
was designed to identify those chemicals with the potential to affect cell growth by activating the androgen
receptor-mediated cell proliferation pathway. The assay uses electronic microsensors located at the bottom of
the cell culture well to detect changes in cell number, morphology, and adhesion through electrical impedance
measurement at the electrode-solution interface following 80-hour incubation with test chemicals.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were collected from the xCELLigence system which converts raw impedance values into the
Cell Index (CI) value; this is a measure of adhesion where CI = (impedance at time point n - impedance in the
absence of cells)/nominal impedance value. These data were then converted to a Normalized Cell Index
according to the equation NCI(Ti) = CI(Ti)/CI(Tk), where i = 1,2,3,....N where CI(Tk) is the last time point before
chemical addition, CI(Ti) is the cell index at the i-th measured time point, and N is the total number of time
points. Data were grouped by chemical and smoothed to combine replicates using a simple moving average (as
the replicates were assessed in duplicate on separate plates so the time points were not identical). DMSO
controls were considered as baseline for activity, and testosterone was used as a positive control and 100
percent activity for all the test chemicals on that plate. For cell loss, the NCI value at the time of compound
administration was considered to represent complete (100%) viability. MG132 (2 uM), a proteasome inhibitor
and known cytotoxic agent, was used as the positive control for cell loss and was tested in quadruplicate on
each plate. The minimum average response on each plate was used as a positive control for cell loss for all the
test chemicals on the corresponding plate. If a chemical sample was run on two different plates, then the
minimum NCI values for MG132 were averaged. If an NCI value for MG132 fell below zero, the response was
considered to be below the limit of detection and was replaced with the minimum value greater than zero across
all plates. All smoothened NCI values were then converted to a percentage of positive control, which was
considered to represent no (0%) viability. Concentration response curves were generated using smoothed NCI

-------
values and all statistical analyses were conducted using R programming language, employing tcpl package to
generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 15:
pval.apid.medncbyconc.min (Calculate the positive control value (pval) as the plate-wise minimum, by
assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single- or multiple-
concentration negative control wells (wilt = m or o) by apid, well type, and concentration.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 27:
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1850	Number of chemicals tested: 1835

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	216.845

Neutral control median absolute deviation, by plate: nmad	21.373
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100 8.91%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	170.663
Positive control well median absolute deviation, by plate: pmad 6.345

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.199

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

72.977

-------
Negative control well median absolute deviation value, by plate: mmad	2.556

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-7.083

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 199.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.

Bibliography: Judson R, Houck K, Paul Friedman K, Brown J, Browne P, Johnston PA, Close DA, Mansouri K,
Kleinstreuer N. Selecting a minimal set of androgen receptor assays for screening chemicals. Regul Toxicol

-------
Pharmacol. 2020 Nov; 117:104764. doi: 10.1016/j.yrtph.2020.104764. Epub 2020 Aug 14. PMID: 32798611;
PMCID: PMC8356084.

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:4

APR_HepG2_Cel ICycleArrest_lh r

1.	General Information

1.1	Assay Title: Aprendica 1-hour HepG2 Human Liver Cell Assay for Cell Cyle Arrest

1.2	Assay Summary: APR_HepG2_lhr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 1 hour after chemical dosing in a 384-well plate.
APR_HepG2_CellCycleArrest_lhr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_lhr assay. It is designed to make measurements of cell phenotype, a form of morphology reporter,
as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay
component APR_HepG2_CellCycleArrest_lhr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_CellCycleArrest_lhr, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of morphology reporter, measures of all nuclear dna for gain or
loss-of-signal activity can be used to understand the signaling at the pathway-level as they relate to the gene .
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is arrest.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Hoechst-33342 dye is used as a stain for DNA to understand morphology in the system.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

-------
2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano, 2006), which applies automated
image analysis techniques to capture multiple cytological features using fluorescent labels, to measure the
concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully metabolically
capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated capacity to
predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien et al. 2006;
Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-state
trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test period.
The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle

-------
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.072
Response cutoff threshold used to determine hit calls: 0.725
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity level of Hoechst-33342 stained DNA indicates cell phenotypes which can be
used to identify cell cycle arrest.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.
3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.391 nM
Key positive control:

Camptothecin;Anisomycin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

NA

-------
3.2 Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 320	Number of chemicals tested: 310

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 35.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:6

APR_HepG2_CellLoss_lhr

1.	General Information

1.1	Assay Title: Aprendica 1-hour HepG2 Human Livery Cell Assay for Cell Loss

1.2	Assay Summary: APR_HepG2_lhr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 1 hour after chemical dosing in a 384-well plate. APR_HepG2_CellLoss_lhr is
one of 10 assay component(s) measured or calculated from the APR_HepG2_lhr assay. It is designed to make
measurements of cell number, a form of viability reporter, as detected with fluorescence intensity signals by
HCS Fluorescent Imaging technology. Data from the assay component APR_HepG2_CellLoss_lhr was analyzed
into 1 assay endpoint. This assay endpoint, APR_HepG2_CellLoss_lhr, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter, measures of
all nuclear dna for gain or loss-of-signal activity can be used to understand the viability at the cellular-level.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a viability function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is proliferation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Hoechst-33342 dye is used as a stain for DNA to understand viability in the system.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2

Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	2

Standard minimum concentration tested:	Standard maximum concentration tested:

0.391 nM	200 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin;Paclitaxel;CCCP	DMSO

Baseline median absolute deviation for the assay (bmad): 0.056
Response cutoff threshold used to determine hit calls: 0.557
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity level of Hoechst-33342 stained DNA indicates cell phenotypes which can be
used to identify cell cycle arrest.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate

-------
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);

-------
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 320	Number of chemicals tested: 310

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inaclive hit count: 0
-------
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

-------
Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 16.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:8

APR_HepG2_MicrotubuleCSK_lhr

1.	General Information

1.1	Assay Title: Aprendica 1-hour HepG2 Human Liver Cell Assay for Microtubule CSK Stabilty

1.2	Assay Summary: APR_HepG2_lhr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 1 hour after chemical dosing in a 384-well plate.
APR_HepG2_MicrotubuleCSK_lhr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_lhr assay. It is designed to make measurements of protein conformation, a form of conformation
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_MicrotubuleCSK_lhr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_MicrotubuleCSK_lhr, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of conformation reporter, measures of protein for gain or loss-of-
signal activity can be used to understand the signaling at the cellular-level. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cell morphology intended target family, where the subfamily is cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-a-tubulin antibody is used to tag and quantify the level of tubulin, alpha la protein. Changes
in the signals are indicative of protein expression changes as a cellular response to stress in the system
[GeneSymbokTUBAlA | GenelD:7846 | Uniprot_SwissProt_Accession:Q71U36],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

-------
2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle

-------
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	1

Standard minimum concentration tested:	Standard maximum concentration tested:

0.391 nM	200 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin;Paclitaxel;CCCP	DMSO

Baseline median absolute deviation for the assay (bmad): 0.124
Response cutoff threshold used to determine hit calls: 1.236
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity level of Hoechst-33342 stained DNA indicates cell phenotypes which can be
used to identify cell cycle arrest.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

-------
3.2 Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 320	Number of chemicals tested: 310

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 19.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 10

APR_HepG2_MitoMass_lhr

1.	General Information

1.1	Assay Title: Aprendica 1-hour HepG2 Human Liver Cell Assay for Mitochondrial Mass

1.2	Assay Summary: APR_HepG2_lhr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 1 hour after chemical dosing in a 384-well plate. APR_HepG2_MitoMass_lhr
is one of 10 assay component(s) measured or calculated from the APR_HepG2_lhr assay. It is designed to make
measurements of cell phenotype, a form of morphology reporter, as detected with fluorescence intensity signals
by HCS Fluorescent Imaging technology. Data from the assay component APR_HepG2_MitoMass_lhr was
analyzed into 1 assay endpoint. This assay endpoint, APR_HepG2_MitoMass_lhr, was analyzed with
bidirectional fitting relative to DMSOas the negative control and baseline of activity. Using a type of morphology
reporter, gain or loss-of-signal activity can be used to understand changes in the signaling. Furthermore, this
assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a signaling function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the cell morphology intended target family, where the subfamily is
organelle conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: MitoTracker Red is used as a stain for the morphology of the mitochondria.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2 Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	1

Standard minimum concentration tested:	Standard maximum concentration tested:

0.391 nM	200 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin;Paclitaxel;CCCP	DMSO

Baseline median absolute deviation for the assay (bmad): 0.036
Response cutoff threshold used to determine hit calls: 0.356
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity level of Hoechst-33342 stained DNA is used to quantify cell number to report
viability as a marker for cell death.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate

-------
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);

-------
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 320	Number of chemicals tested: 310

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inaclive hit count: 0
-------
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

-------
Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 24.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 12

APR_HepG2_MitoMembPot_lhr

1.	General Information

1.1	Assay Title: Aprendica 1-hour HepG2 Human Liver Cell Assay for Mitochondrial Membrane Potential

1.2	Assay Summary: APR_HepG2_lhr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 1 hour after chemical dosing in a 384-well plate.
APR_HepG2_MitoMembPot_lhr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_lhr assay. It is designed to make measurements of dye binding, a form of membrane potential
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_MitoMembPot_lhr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_MitoMembPot_lhr, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of membrane potential reporter, gain or loss-of-signal activity can
be used to understand changes in the signaling. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell
morphology intended target family, where the subfamily is organelle conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: MitoTracker Red is used as a stain for the membrane potential of the mitochondria.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2 Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	1

Standard minimum concentration tested:	Standard maximum concentration tested:

0.391 nM	200 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin;Paclitaxel;CCCP	DMSO

Baseline median absolute deviation for the assay (bmad): 0.179
Response cutoff threshold used to determine hit calls: 1.787
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity level of Hoechst-33342 stained DNA is used to quantify cell number to report
viability as a marker for cell death.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate

-------
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);

-------
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 320	Number of chemicals tested: 310

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inaclive hit count: 0
-------
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

-------
Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 35.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 14

APR_HepG2_MitoticArrest_lhr

1.	General Information

1.1	Assay Title: Aprendica 1-hour HepG2 Human Liver Cell Assay for Mitotic Arrest

1.2	Assay Summary: APR_HepG2_lhr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 1 hour after chemical dosing in a 384-well plate.
APR_HepG2_MitoticArrest_lhr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_lhr assay. It is designed to make measurements of cell phenotype, a form of morphology reporter,
as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay
component APR_HepG2_MitoticArrest_lhr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_MitoticArrest_lhr, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of morphology reporter, measures of protein for gain or loss-of-signal
activity can be used to understand the signaling at the pathway-level as they relate to the gene . Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a signaling function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily is arrest.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-phospho-histone-H3 antibody is used to tag and quantify the level of phosphorylated H3
histone, family 3A protein. Changes in the signals are indicative of protein expression changes as a cellular
response to stress in the system [GeneSymbol:H3F3A | GenelD:3020 | Uniprot_SwissProt_Accession:P84243],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

-------
2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle

-------
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	1

Standard minimum concentration tested:	Standard maximum concentration tested:

0.391 nM	200 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin;Paclitaxel;CCCP	DMSO

Baseline median absolute deviation for the assay (bmad): 0.118
Response cutoff threshold used to determine hit calls: 1.178
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity level of Hoechst-33342 stained DNA is used to quantify cell number to report
viability as a marker for cell death.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

-------
3.2 Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 320	Number of chemicals tested: 310

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 31.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 16

APR_HepG2_NuclearSize_lhr

1.	General Information

1.1	Assay Title: Aprendica 1-hour HepG2 Human Liver Cell Assay for Nuclear Size

1.2	Assay Summary: APR_HepG2_lhr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 1 hour after chemical dosing in a 384-well plate. APR_HepG2_NuclearSize_lhr
is one of 10 assay component(s) measured or calculated from the APR_HepG2_lhr assay. It is designed to make
measurements of cell phenotype, a form of morphology reporter, as detected with fluorescence intensity signals
by HCS Fluorescent Imaging technology. Data from the assay component APR_HepG2_NuclearSize_lhr was
analyzed into 1 assay endpoint. This assay endpoint, APR_HepG2_NuclearSize_lhr, was analyzed with
bidirectional fitting relative to DMSOas the negative control and baseline of activity. Using a type of morphology
reporter, measures of all nuclear dna for gain or loss-of-signal activity can be used to understand the signaling
at the nuclear-level. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cell morphology intended
target family, where the subfamily is organelle conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Hoechst-33342 dye is used as a stain for DNA to understand morphology in the system.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2 Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	1

Standard minimum concentration tested:	Standard maximum concentration tested:

0.391 nM	200 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin;Paclitaxel;CCCP	DMSO

Baseline median absolute deviation for the assay (bmad): 0.008
Response cutoff threshold used to determine hit calls: 0.263
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Protein stabilization of microtubules is identified through fluorescent intensity of anti-a-tubulin
antibody tagged tubulin, alpha la protein, and is a sign of cellular response to stress.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate

-------
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);

-------
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 320	Number of chemicals tested: 310

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inaclive hit count: 0
-------
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

-------
Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 38.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 18

APR_HepG2_P-H2AX_lhr

1.	General Information

1.1	Assay Title: Aprendica 1-hour HepG2 Human Liver Cell Assay for H2AX Phosphorylation

1.2	Assay Summary: APR_HepG2_lhr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 1 hour after chemical dosing in a 384-well plate. APR_HepG2_P-H2AX_lhr is
one of 10 assay component(s) measured or calculated from the APR_HepG2_lhr assay. It is designed to make
measurements of dna content, a form of viability reporter, as detected with fluorescence intensity signals by
HCS Fluorescent Imaging technology. Data from the assay component APR_HepG2_P-H2AX_lhr was analyzed
into 1 assay endpoint. This assay endpoint, APR_HepG2_P-H2AX_lhr, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter, measures of
protein for gain or loss-of-signal activity can be used to understand the signaling at the pathway-level as they
relate to the gene . Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the dna binding intended target
family, where the subfamily is cellular response to DNA damage stimulus.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Anti-phospho-histone-H2AX antibody is used to tag and quantify the level of phosphorylated H2A
histone family, member X protein. Changes in the signals are indicative of protein expression changes as a
cellular response to stress in the system [GeneSymbol:H2AFX | GenelD:3014 |
Uniprot_SwissProt_Accession:P16104],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the

-------
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for

-------
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	1

Standard minimum concentration tested:	Standard maximum concentration tested:

0.391 nM	200 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin	DMSO

Baseline median absolute deviation for the assay (bmad): 0.077
Response cutoff threshold used to determine hit calls: 0.772
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Protein stabilization of microtubules is identified through fluorescent intensity of anti-a-tubulin
antibody tagged tubulin, alpha la protein, and is a sign of cellular response to stress.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 320	Number of chemicals tested: 310

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

-------
Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 28.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,

-------
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 20

APR_HepG2_p53Act_lhr

1.	General Information

1.1	Assay Title: Aprendica 1-hour HepG2 Human Liver Cell Assay for p53 Activation

1.2	Assay Summary: APR_HepG2_lhr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 1 hour after chemical dosing in a 384-well plate. APR_HepG2_p53Act_lhr is
one of 10 assay component(s) measured or calculated from the APR_HepG2_lhr assay. It is designed to make
measurements of dna content, a form of viability reporter, as detected with fluorescence intensity signals by
HCS Fluorescent Imaging technology. Data from the assay component APR_HepG2_p53Act_lhr was analyzed
into 1 assay endpoint. This assay endpoint, APR_HepG2_p53Act_lhr, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter, measures of
protein for gain or loss-of-signal activity can be used to understand the signaling at the pathway-level as they
relate to the gene TP53. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the dna binding intended target
family, where the subfamily is tumor suppressor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-p53 antibody is used to tag and quantify the level of tumor protein p53 protein. Changes in
the signals are indicative of protein expression changes as a cellular response to stress in the system
[GeneSymbol:TP53 | GenelD:7157 | Uniprot_SwissProt_Accession:P04637],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

-------
2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle

-------
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.084
Response cutoff threshold used to determine hit calls: 0.843
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Protein stabilization of microtubules is identified through fluorescent intensity of anti-a-tubulin
antibody tagged tubulin, alpha la protein, and is a sign of cellular response to stress.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.
3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.391 nM
Key positive control:

Camptothecin;Anisomycin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

NA

-------
3.2 Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 320	Number of chemicals tested: 310

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 21.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 22

APR_HepG2_StressKinase_lhr

1.	General Information

1.1	Assay Title: Aprendica 1-hour HepG2 Human Liver Cell Assay for Stress Kinase

1.2	Assay Summary: APR_HepG2_lhr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 1 hour after chemical dosing in a 384-well plate.
APR_HepG2_StressKinase_lhr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_lhr assay. It is designed to make measurements of enzyme activity, a form of enzyme reporter, as
detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay
component APR_HepG2_StressKinase_lhr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_StressKinase_lhr, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of enzyme reporter, measures of protein for gain or loss-of-signal activity
can be used to understand the signaling at the pathway-level as they relate to the gene . Furthermore, this
assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a signaling function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily is stress
response.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-phospho-c-jun antibody is used to tag and quantify the level of phosphorylated jun proto-
oncogene protein. Changes in the signals are indicative of protein expression changes as a cellular response to
stress in the system [GeneSymbokJUN | GenelD:3725 | Uniprot_SwissProt_Accession:P05412],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the

-------
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for

-------
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	1

Standard minimum concentration tested:	Standard maximum concentration tested:

0.391 nM	200 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin	DMSO

Baseline median absolute deviation for the assay (bmad): 0.079
Response cutoff threshold used to determine hit calls: 0.791
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Fluorescent intensity of MitoTracker Red stained mitochondria is an indicator of mitochondrial
morphology and cell cycle staging.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 320	Number of chemicals tested: 310

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

-------
Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 30.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,

-------
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:24

APR_HepG2_CellCycleArrest_24hr

1.	General Information

1.1	Assay Title: Aprendica 24-hour HepG2 Human Liver Cell Assay for Cell Cyle Arrest

1.2	Assay Summary: APR HepG2 24hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 384-well plate.
APR_HepG2_CellCycleArrest_24hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_24hr assay. It is designed to make measurements of cell phenotype, a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_CellCycleArrest_24hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_CellCycleArrest_24hr, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of morphology reporter, measures of all nuclear dna for gain or
loss-of-signal activity can be used to understand the signaling at the pathway-level as they relate to the gene .
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is arrest.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Hoechst-33342 dye is used as a stain for DNA to understand morphology in the system.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

-------
2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle

-------
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.07
Response cutoff threshold used to determine hit calls: 0.705
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of MitoTracker Red stained mitochondria is an indicator of mitochondrial
morphology and cell cycle staging.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.
3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Camptothecin;Anisomycin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
3.2 Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 158.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 26

APR_HepG2_Cel I Loss_24hr

1.	General Information

1.1	Assay Title: Aprendica 24-hour HepG2 Human Livery Cell Assay for Cell Loss

1.2	Assay Summary: APR HepG2 24hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 384-well plate.
APR_HepG2_CellLoss_24hr is one of 10 assay component(s) measured or calculated from the APR_HepG2_24hr
assay. It is designed to make measurements of cell number, a form of viability reporter, as detected with
fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay component
APR_HepG2_CellLoss_24hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_CellLoss_24hr, was analyzed with bidirectional fitting relative to DMSO as the negative control and
baseline of activity. Using a type of viability reporter, measures of all nuclear dna for gain or loss-of-signal
activity can be used to understand the viability at the cellular-level. Furthermore, this assay endpoint can be
referred to as a secondary readout, because this assay has produced multiple assay endpoints where this one
serves a viability function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is proliferation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Hoechst-33342 dye is used as a stain for DNA to understand viability in the system.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2

Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	2

Standard minimum concentration tested:	Standard maximum concentration tested:

0.58 nM	297 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin;Paclitaxel;CCCP	DMSO

Baseline median absolute deviation for the assay (bmad): 0.066
Response cutoff threshold used to determine hit calls: 0.662
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of MitoTracker Red stained mitochondria is an indicator of mitochondrial
morphology and cell cycle staging.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate

-------
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);

-------
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inaclive hit count: 0
-------
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

-------
Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 121.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 28

APR_HepG2_MicrotubuleCSK_24hr

1.	General Information

1.1	Assay Title: Aprendica 24-hour HepG2 Human Liver Cell Assay for Microtubule CSK Stabilty

1.2	Assay Summary: APR HepG2 24hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 384-well plate.
APR_HepG2_MicrotubuleCSK_24hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_24hr assay. It is designed to make measurements of protein conformation, a form of conformation
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_MicrotubuleCSK_24hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_MicrotubuleCSK_24hr, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of conformation reporter, measures of protein for gain or loss-of-
signal activity can be used to understand the signaling at the cellular-level. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cell morphology intended target family, where the subfamily is cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-a-tubulin antibody is used to tag and quantify the level of tubulin, alpha la protein. Changes
in the signals are indicative of protein expression changes as a cellular response to stress in the system
[GeneSymbokTUBAlA | GenelD:7846 | Uniprot_SwissProt_Accession:Q71U36],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

-------
2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle

-------
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.082
Response cutoff threshold used to determine hit calls: 0.818
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of MitoTracker Red stained mitochondria is used to as a membrane potential
reporter for mitochondrial depolarization as indicated by the level of dye binding.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.
3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Camptothecin;Anisomycin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
3.2 Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 140.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 30

APR_HepG2_MitoMass_24hr

1.	General Information

1.1	Assay Title: Aprendica 24-hour HepG2 Human Liver Cell Assay for Mitochondrial Mass

1.2	Assay Summary: APR HepG2 24hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 384-well plate.
APR_HepG2_MitoMass_24hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_24hr assay. It is designed to make measurements of cell phenotype, a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_MitoMass_24hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_MitoMass_24hr, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of morphology reporter, gain or loss-of-signal activity can be used to
understand changes in the signaling. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves a signaling function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell morphology
intended target family, where the subfamily is organelle conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: MitoTracker Red is used as a stain for the morphology of the mitochondria.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2 Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.05
Response cutoff threshold used to determine hit calls: 0.498
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of MitoTracker Red stained mitochondria is used to as a membrane potential
reporter for mitochondrial depolarization as indicated by the level of dye binding.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Paclitaxel;CCCP

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);

-------
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inaclive hit count: 0
-------
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

-------
Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 138.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:32

APR_HepG2_MitoMembPot_24hr

1.	General Information

1.1	Assay Title: Aprendica 24-hour HepG2 Human Liver Cell Assay for Mitochondrial Membrane Potential

1.2	Assay Summary: APR HepG2 24hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 384-well plate.
APR_HepG2_MitoMembPot_24hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_24hr assay. It is designed to make measurements of dye binding, a form of membrane potential
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_MitoMembPot_24hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_MitoMembPot_24hr, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of membrane potential reporter, gain or loss-of-signal activity can
be used to understand changes in the signaling. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell
morphology intended target family, where the subfamily is organelle conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: MitoTracker Red is used as a stain for the membrane potential of the mitochondria.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2 Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.083
Response cutoff threshold used to determine hit calls: 0.831
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of MitoTracker Red stained mitochondria is used to as a membrane potential
reporter for mitochondrial depolarization as indicated by the level of dye binding.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Paclitaxel;CCCP

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);

-------
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inaclive hit count: 0
-------
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

-------
Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 148.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 34

APR_HepG2_MitoticArrest_24hr

1.	General Information

1.1	Assay Title: Aprendica 24-hour HepG2 Human Liver Cell Assay for Mitotic Arrest

1.2	Assay Summary: APR HepG2 24hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 384-well plate.
APR_HepG2_MitoticArrest_24hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_24hr assay. It is designed to make measurements of cell phenotype, a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_MitoticArrest_24hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_MitoticArrest_24hr, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of morphology reporter, measures of protein for gain or loss-of-
signal activity can be used to understand the signaling at the pathway-level as they relate to the gene .
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is arrest.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-phospho-histone-H3 antibody is used to tag and quantify the level of phosphorylated H3
histone, family 3A protein. Changes in the signals are indicative of protein expression changes as a cellular
response to stress in the system [GeneSymbol:H3F3A | GenelD:3020 | Uniprot_SwissProt_Accession:P84243],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the

-------
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for

-------
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.11
Response cutoff threshold used to determine hit calls: 1.102
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of anti-phospho-histone-H3 antibody tagged phosphorylated H3 histone is
indicative mitotic arrest due to protein expression changes as a cellular response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Paclitaxel;CCCP

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

-------
Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 128.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,

-------
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 36

APR_HepG2_NuclearSize_24hr

1.	General Information

1.1	Assay Title: Aprendica 24-hour HepG2 Human Liver Cell Assay for Nuclear Size

1.2	Assay Summary: APR HepG2 24hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 384-well plate.
APR_HepG2_NuclearSize_24hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_24hr assay. It is designed to make measurements of cell phenotype, a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_NuclearSize_24hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_NuclearSize_24hr, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of morphology reporter, measures of all nuclear dna for gain or loss-of-
signal activity can be used to understand the signaling at the nuclear-level. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cell morphology intended target family, where the subfamily is organelle conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Hoechst-33342 dye is used as a stain for DNA to understand morphology in the system.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2 Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.01
Response cutoff threshold used to determine hit calls: 0.263
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of anti-phospho-histone-H3 antibody tagged phosphorylated H3 histone is
indicative mitotic arrest due to protein expression changes as a cellular response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Paclitaxel;CCCP

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);

-------
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inaclive hit count: 0
-------
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

-------
Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 156.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 38

APR_HepG2_P-H2AX_24h r

1.	General Information

1.1	Assay Title: Aprendica 24-hour HepG2 Human Liver Cell Assay for H2AX Phosphorylation

1.2	Assay Summary: APR HepG2 24hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 384-well plate. APR_HepG2_P-
H2AX_24hr is one of 10 assay component(s) measured or calculated from the APR_HepG2_24hr assay. It is
designed to make measurements of dna content, a form of viability reporter, as detected with fluorescence
intensity signals by HCS Fluorescent Imaging technology. Data from the assay component APR_HepG2_P-
H2AX_24hr was analyzed into 1 assay endpoint. This assay endpoint, APR_HepG2_P-H2AX_24hr, was analyzed
with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of
viability reporter, measures of protein for gain or loss-of-signal activity can be used to understand the signaling
at the pathway-level as they relate to the gene . Furthermore, this assay endpoint can be referred to as a
primary readout, because this assay has produced multiple assay endpoints where this one serves a signaling
function. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
dna binding intended target family, where the subfamily is cellular response to DNA damage stimulus.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Anti-phospho-histone-H2AX antibody is used to tag and quantify the level of phosphorylated H2A
histone family, member X protein. Changes in the signals are indicative of protein expression changes as a
cellular response to stress in the system [GeneSymbol:H2AFX | GenelD:3014 |
Uniprot_SwissProt_Accession:P16104],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the

-------
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for

-------
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	1

Standard minimum concentration tested:	Standard maximum concentration tested:

0.58 nM	297 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin	DMSO

Baseline median absolute deviation for the assay (bmad): 0.082
Response cutoff threshold used to determine hit calls: 0.821
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Fluorescent intensity of anti-phospho-histone-H3 antibody tagged phosphorylated H3 histone is
indicative mitotic arrest due to protein expression changes as a cellular response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

-------
Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 140.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,

-------
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:40

APR_HepG2_p53Act_24h r

1.	General Information

1.1	Assay Title: Aprendica 24-hour HepG2 Human Liver Cell Assay for p53 Activation

1.2	Assay Summary: APR HepG2 24hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 384-well plate.
APR_HepG2_p53Act_24hr is one of 10 assay component(s) measured or calculated from the APR_HepG2_24hr
assay. It is designed to make measurements of dna content, a form of viability reporter, as detected with
fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay component
APR_HepG2_p53Act_24hr was analyzed into 1 assay endpoint. This assay endpoint, APR_HepG2_p53Act_24hr,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of viability reporter, measures of protein for gain or loss-of-signal activity can be used to understand the
signaling at the pathway-level as they relate to the geneTP53. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
signaling function. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the dna binding intended target family, where the subfamily is tumor suppressor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-p53 antibody is used to tag and quantify the level of tumor protein p53 protein. Changes in
the signals are indicative of protein expression changes as a cellular response to stress in the system
[GeneSymbol:TP53 | GenelD:7157 | Uniprot_SwissProt_Accession:P04637],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

-------
2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle

-------
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.097
Response cutoff threshold used to determine hit calls: 0.972
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of Hoechst-33342 stained DNA is used to measure nuclear size as an identifier
of cell phenotypes to understand the morphology in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.
3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Camptothecin;Anisomycin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
3.2 Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 176.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 42

APR_HepG2_StressKinase_24hr

1.	General Information

1.1	Assay Title: Aprendica 24-hour HepG2 Human Liver Cell Assay for Stress Kinase

1.2	Assay Summary: APR HepG2 24hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 384-well plate.
APR_HepG2_StressKinase_24hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_24hr assay. It is designed to make measurements of enzyme activity, a form of enzyme reporter,
as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay
component APR_HepG2_StressKinase_24hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_StressKinase_24hr, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of enzyme reporter, measures of protein for gain or loss-of-signal activity
can be used to understand the signaling at the pathway-level as they relate to the gene . Furthermore, this
assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a signaling function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily is stress
response.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-phospho-c-jun antibody is used to tag and quantify the level of phosphorylated jun proto-
oncogene protein. Changes in the signals are indicative of protein expression changes as a cellular response to
stress in the system [GeneSymbokJUN | GenelD:3725 | Uniprot_SwissProt_Accession:P05412],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the

-------
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for

-------
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.092
Response cutoff threshold used to determine hit calls: 0.924
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of Hoechst-33342 stained DNA is used to measure nuclear size as an identifier
of cell phenotypes to understand the morphology in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Camptothecin;Anisomycin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

-------
Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 127.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,

-------
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 44

APR_HepG2_CellCycleArrest_72hr

1.	General Information

1.1	Assay Title: Aprendica 72-hour HepG2 Human Liver Cell Assay for Cell Cyle Arrest

1.2	Assay Summary: APR HepG2 72hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 72 hours after chemical dosing in a 384-well plate.
APR_HepG2_CellCycleArrest_72hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_72hr assay. It is designed to make measurements of cell phenotype, a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_CellCycleArrest_72hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_CellCycleArrest_72hr, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of morphology reporter, measures of all nuclear dna for gain or
loss-of-signal activity can be used to understand the signaling at the pathway-level as they relate to the gene .
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is arrest.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Hoechst-33342 dye is used as a stain for DNA to understand morphology in the system.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

-------
2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle

-------
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.093
Response cutoff threshold used to determine hit calls: 0.927
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of Hoechst-33342 stained DNA is used to measure nuclear size as an identifier
of cell phenotypes to understand the morphology in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.
3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Camptothecin;Anisomycin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
3.2 Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 168.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:46

APR_HepG2_CellLoss_72hr

1.	General Information

1.1	Assay Title: Aprendica 72-hour HepG2 Human Livery Cell Assay for Cell Loss

1.2	Assay Summary: APR HepG2 72hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 72 hours after chemical dosing in a 384-well plate.
APR_HepG2_CellLoss_72hr is one of 10 assay component(s) measured or calculated from the APR_HepG2_72hr
assay. It is designed to make measurements of cell number, a form of viability reporter, as detected with
fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay component
APR_HepG2_CellLoss_72hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_CellLoss_72hr, was analyzed with bidirectional fitting relative to DMSO as the negative control and
baseline of activity. Using a type of viability reporter, measures of all nuclear dna for gain or loss-of-signal
activity can be used to understand the viability at the cellular-level. Furthermore, this assay endpoint can be
referred to as a secondary readout, because this assay has produced multiple assay endpoints where this one
serves a viability function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is proliferation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Hoechst-33342 dye is used as a stain for DNA to understand viability in the system.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2

Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	2

Standard minimum concentration tested:	Standard maximum concentration tested:

0.58 nM	297 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin;Paclitaxel;CCCP	DMSO

Baseline median absolute deviation for the assay (bmad): 0.089
Response cutoff threshold used to determine hit calls: 0.887
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of anti-p53 antibody is used to tag and quantify the level of tumor protein p53
protein via fluorescent intensity. Changes in the signals are indicative of protein expression changes as a cellular
response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window

-------
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag

-------
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 133.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:48

APR_HepG2_MicrotubuleCSK_72hr

1.	General Information

1.1	Assay Title: Aprendica 72-hour HepG2 Human Liver Cell Assay for Microtubule CSK Stabilty

1.2	Assay Summary: APR HepG2 72hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 72 hours after chemical dosing in a 384-well plate.
APR_HepG2_MicrotubuleCSK_72hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_72hr assay. It is designed to make measurements of protein conformation, a form of conformation
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_MicrotubuleCSK_72hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_MicrotubuleCSK_72hr, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of conformation reporter, measures of protein for gain or loss-of-
signal activity can be used to understand the signaling at the cellular-level. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cell morphology intended target family, where the subfamily is cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-a-tubulin antibody is used to tag and quantify the level of tubulin, alpha la protein. Changes
in the signals are indicative of protein expression changes as a cellular response to stress in the system
[GeneSymbokTUBAlA | GenelD:7846 | Uniprot_SwissProt_Accession:Q71U36],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

-------
2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle

-------
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.104
Response cutoff threshold used to determine hit calls: 1.038
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of anti-p53 antibody is used to tag and quantify the level of tumor protein p53
protein via fluorescent intensity. Changes in the signals are indicative of protein expression changes as a cellular
response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.
3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Camptothecin;Anisomycin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
3.2 Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1092	Number of chemicals tested: 1051

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 144.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 50

APR_HepG2_MitoMass_72hr

1.	General Information

1.1	Assay Title: Aprendica 72-hour HepG2 Human Liver Cell Assay for Mitochondrial Mass

1.2	Assay Summary: APR HepG2 72hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 72 hours after chemical dosing in a 384-well plate.
APR_HepG2_MitoMass_72hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_72hr assay. It is designed to make measurements of cell phenotype, a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_MitoMass_72hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_MitoMass_72hr, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of morphology reporter, gain or loss-of-signal activity can be used to
understand changes in the signaling. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves a signaling function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell morphology
intended target family, where the subfamily is organelle conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: MitoTracker Red is used as a stain for the morphology of the mitochondria.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2 Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.068
Response cutoff threshold used to determine hit calls: 0.684
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of anti-p53 antibody is used to tag and quantify the level of tumor protein p53
protein via fluorescent intensity. Changes in the signals are indicative of protein expression changes as a cellular
response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Paclitaxel;CCCP

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag

-------
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1092	Number of chemicals tested: 1051

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 128.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:52

APR_HepG2_MitoMembPot_72hr

1.	General Information

1.1	Assay Title: Aprendica 72-hour HepG2 Human Liver Cell Assay for Mitochondrial Membrane Potential

1.2	Assay Summary: APR HepG2 72hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 72 hours after chemical dosing in a 384-well plate.
APR_HepG2_MitoMembPot_72hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_72hr assay. It is designed to make measurements of dye binding, a form of membrane potential
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_MitoMembPot_72hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_MitoMembPot_72hr, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of membrane potential reporter, gain or loss-of-signal activity can
be used to understand changes in the signaling. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell
morphology intended target family, where the subfamily is organelle conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: MitoTracker Red is used as a stain for the membrane potential of the mitochondria.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2 Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.073
Response cutoff threshold used to determine hit calls: 0.729
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of anti-phospho-histone-H2AX antibody is used to tag and quantify the level of
phosphorylated H2A histone family, member X protein. Changes in the signals are indicative of protein
expression changes as a cellular response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Paclitaxel;CCCP

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag

-------
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1092	Number of chemicals tested: 1051

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 147.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 54

APR_HepG2_MitoticArrest_72hr

1.	General Information

1.1	Assay Title: Aprendica 72-hour HepG2 Human Liver Cell Assay for Mitotic Arrest

1.2	Assay Summary: APR HepG2 72hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 72 hours after chemical dosing in a 384-well plate.
APR_HepG2_MitoticArrest_72hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_72hr assay. It is designed to make measurements of cell phenotype, a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_MitoticArrest_72hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_MitoticArrest_72hr, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of morphology reporter, measures of protein for gain or loss-of-
signal activity can be used to understand the signaling at the pathway-level as they relate to the gene .
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is arrest.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-phospho-histone-H3 antibody is used to tag and quantify the level of phosphorylated H3
histone, family 3A protein. Changes in the signals are indicative of protein expression changes as a cellular
response to stress in the system [GeneSymbol:H3F3A | GenelD:3020 | Uniprot_SwissProt_Accession:P84243],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the

-------
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for

-------
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.142
Response cutoff threshold used to determine hit calls: 1.419
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of anti-phospho-histone-H2AX antibody is used to tag and quantify the level of
phosphorylated H2A histone family, member X protein. Changes in the signals are indicative of protein
expression changes as a cellular response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Paclitaxel;CCCP

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1092	Number of chemicals tested: 1051

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

-------
Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 156.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,

-------
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 56

APR_HepG2_NuclearSize_72hr

1.	General Information

1.1	Assay Title: Aprendica 72-hour HepG2 Human Liver Cell Assay for Nuclear Size

1.2	Assay Summary: APR HepG2 72hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 72 hours after chemical dosing in a 384-well plate.
APR_HepG2_NuclearSize_72hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_72hr assay. It is designed to make measurements of cell phenotype, a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component APR_HepG2_NuclearSize_72hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_NuclearSize_72hr, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of morphology reporter, measures of all nuclear dna for gain or loss-of-
signal activity can be used to understand the signaling at the nuclear-level. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cell morphology intended target family, where the subfamily is organelle conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Hoechst-33342 dye is used as a stain for DNA to understand morphology in the system.

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2 Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to

-------
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of

-------
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.015
Response cutoff threshold used to determine hit calls: 0.263
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of anti-phospho-histone-H2AX antibody is used to tag and quantify the level of
phosphorylated H2A histone family, member X protein. Changes in the signals are indicative of protein
expression changes as a cellular response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Paclitaxel;CCCP

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag

-------
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 140.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 58

APR_HepG2_P-H2AX_72hr

1.	General Information

1.1	Assay Title: Aprendica 72-hour HepG2 Human Liver Cell Assay for H2AX Phosphorylation

1.2	Assay Summary: APR HepG2 72hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 72 hours after chemical dosing in a 384-well plate. APR_HepG2_P-
H2AX_72hr is one of 10 assay component(s) measured or calculated from the APR_HepG2_72hr assay. It is
designed to make measurements of dna content, a form of viability reporter, as detected with fluorescence
intensity signals by HCS Fluorescent Imaging technology. Data from the assay component APR_HepG2_P-
H2AX_72hr was analyzed into 1 assay endpoint. This assay endpoint, APR_HepG2_P-H2AX_72hr, was analyzed
with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of
viability reporter, measures of protein for gain or loss-of-signal activity can be used to understand the signaling
at the pathway-level as they relate to the gene . Furthermore, this assay endpoint can be referred to as a
primary readout, because this assay has produced multiple assay endpoints where this one serves a signaling
function. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
dna binding intended target family, where the subfamily is cellular response to DNA damage stimulus.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Anti-phospho-histone-H2AX antibody is used to tag and quantify the level of phosphorylated H2A
histone family, member X protein. Changes in the signals are indicative of protein expression changes as a
cellular response to stress in the system [GeneSymbol:H2AFX | GenelD:3014 |
Uniprot_SwissProt_Accession:P16104],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the

-------
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for

-------
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	1

Standard minimum concentration tested:	Standard maximum concentration tested:

0.58 nM	297 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin	DMSO

Baseline median absolute deviation for the assay (bmad): 0.11
Response cutoff threshold used to determine hit calls: 1.097
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Fluorescent intensity of anti-phospho-c-jun antibody is used to tag and quantify the level of
phosphorylated jun proto-oncogene protein. Changes in the signals are indicative of protein expression changes
as a cellular response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

-------
Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 154.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,

-------
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 60

APR_HepG2_p53Act_72hr

1.	General Information

1.1	Assay Title: Aprendica 72-hour HepG2 Human Liver Cell Assay for p53 Activation

1.2	Assay Summary: APR HepG2 72hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 72 hours after chemical dosing in a 384-well plate.
APR_HepG2_p53Act_72hr is one of 10 assay component(s) measured or calculated from the APR_HepG2_72hr
assay. It is designed to make measurements of dna content, a form of viability reporter, as detected with
fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay component
APR_HepG2_p53Act_72hr was analyzed into 1 assay endpoint. This assay endpoint, APR_HepG2_p53Act_72hr,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of viability reporter, measures of protein for gain or loss-of-signal activity can be used to understand the
signaling at the pathway-level as they relate to the geneTP53. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
signaling function. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the dna binding intended target family, where the subfamily is tumor suppressor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-p53 antibody is used to tag and quantify the level of tumor protein p53 protein. Changes in
the signals are indicative of protein expression changes as a cellular response to stress in the system
[GeneSymbol:TP53 | GenelD:7157 | Uniprot_SwissProt_Accession:P04637],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

-------
2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle

-------
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

Baseline median absolute deviation for the assay (bmad): 0.118
Response cutoff threshold used to determine hit calls: 1.182
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Fluorescent intensity of anti-phospho-c-jun antibody is used to tag and quantify the level of
phosphorylated jun proto-oncogene protein. Changes in the signals are indicative of protein expression changes
as a cellular response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.
3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.58 nM
Key positive control:

Camptothecin;Anisomycin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

297 nM
Neutral vehicle control:

DMSO

NA

-------
3.2 Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 152.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 62

APR_HepG2_StressKinase_72hr

1.	General Information

1.1	Assay Title: Aprendica 72-hour HepG2 Human Liver Cell Assay for Stress Kinase

1.2	Assay Summary: APR HepG2 72hr is a cell-based, multiplexed-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 72 hours after chemical dosing in a 384-well plate.
APR_HepG2_StressKinase_72hr is one of 10 assay component(s) measured or calculated from the
APR_HepG2_72hr assay. It is designed to make measurements of enzyme activity, a form of enzyme reporter,
as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay
component APR_HepG2_StressKinase_72hr was analyzed into 1 assay endpoint. This assay endpoint,
APR_HepG2_StressKinase_72hr, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of enzyme reporter, measures of protein for gain or loss-of-signal activity
can be used to understand the signaling at the pathway-level as they relate to the gene . Furthermore, this
assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a signaling function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily is stress
response.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Apredica, a part of Cyprotex, is a preclinical Contract Research Organization (CRO) that provides
services including the CellCiphr High Content Imaging system.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; imaging and analysis software are available from Cellomics, Inc.

1.9	Assay Throughput: 384-well plate. Assay was conducted om human hepatocellular carcinoma cell line HepG2
(HB-8065) on 384-well plates. HCI was used to evaluate the effects of chemicals (in concentrations ranging from
0.4 to 200 uM) on HepG2 cells over a 72-hr exposure period.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: anti-phospho-c-jun antibody is used to tag and quantify the level of phosphorylated jun proto-
oncogene protein. Changes in the signals are indicative of protein expression changes as a cellular response to
stress in the system [GeneSymbokJUN | GenelD:3725 | Uniprot_SwissProt_Accession:P05412],

High-content screening methods in early safety assessments are critical to understand the complex biology
triggered by potentially harmful molecules in cells of target organs. High-content imaging (HCI) allows
simultaneous measurement of multiple cellular phenotypic changes, which can be an important tool for
evaluating the biological activity of chemicals. To analyze dynamic cellular changes, HCI was used to identify the
"tipping point" at which the cells did not show recover towards a normal phenotypic state. The goal of
integrating cellular toxicology models with HCS detection is to generate a platform that can both predict the

-------
safety risk liability of a compound with high specificity and sensitivity while also identifying mechanism(s) of
action of the toxic response.

2.2	Scientific Principles: Researchers used high-content imaging (HCI) (Giuliano et al. 2006), which applies
automated image analysis techniques to capture multiple cytological features using fluorescent labels, to
measure the concentration-dependent dynamic changes in the state of HepG2 cells. Although they are not fully
metabolically capable, HepG2 cells can undergo continuous proliferation in culture and have a demonstrated
capacity to predict hepatotoxicity of pharmaceutical compounds with good sensitivity and specificity (O'Brien
et al. 2006; Abraham et al. 2008). Researchers used computational tools to deconvolute HCI responses into cell-
state trajectories and to analyze them for their propensity to recover to normal (basal) conditions over the test
period. The critical concentrations associated with nonrecoverable cellular trajectories were determined, where
possible, and compiled into a novel chemical classification scheme.

2.3	Experimental System: adherent HepG2 cell line used. Human hepatocellular carcinoma cell line HepG2
(HB8065), used for the Brdll incorporation assay, was purchased from American Type Culture Collection (ATCC)
and used before passage 20. Cells were maintained and expanded in complete media [10% fetal bovine serum
(FBS) in Minimum Essential Medium with Earle's Balanced Salt Solution (MEM/EBSS) supplemented with
penicillin/streptomycin, L-glutamine, and non-essential amino acids]. Cell culture reagents were obtained from
VWR International. HepG2 cells were harvested by trypsinization and plated at different densities in 25 uL of
culture medium, depending on incubation time, in clear-bottom, 384-well microplates (Falcon 3962) that were
coated with rat tail collagen I. The cells were incubated overnight to allow attachment and spreading.

2.4	Metabolic Competence: HepG2 cells are an immortalized cell line with characteristics that differ from those of
normal hepatocytes. For example, these cells easily proliferate in culture but have limited metabolic activity
compared with primary hepatocytes. The HepG2 cell model used was a two-dimensional monoculture that does
not reflect the complex cell-to-cell interactions present in intact organs that have multiple cell types.

2.5	Exposure Regime: Cells were treated with dimethyl sulfoxide (DMSO) as a solvent control at a final concentration
of 0.5% v/v or with compounds in DMSO with a resulting final DMSO concentration of 0.5% v/v. Compound
treatment was done at concentrations of 0.39,0.78,1.56, 3.12, 6.24,12.5, 25, 50,100, and 200 uM in duplicate
on each plate. Cells were treated with compounds for 1, 24, or 72 hr. Carbonyl cyanide m-chloro-
phenylhydrazone (CCCP) and taxol were used as positive controls for mitochondrial function and cytoskeletal
stability, respectively; DMSO served as the negative control for this experiment. Cells were fixed by the direct
addition of 50 uL formaldehyde in Hank's Balanced Salt Solution (HBSS) to a final concentration of 3.7%. After
incubation in the fixation medium for 30 min at room temperature (293-298 K), cells were rinsed twice with
HBSS and treated with cell permeabilization buffer (16 uLof 0.5% Triton X-100) for 10 min at room temperature
(293-298 K) before labeling. For mitochondrial membrane potential and mitochondrial measurements, pre-
fixed cells were incubated with 50 uL of MitoTracker Red CMXRos (Invitrogen) at a concentration of 250 nM for
30 min before fixation. In the remaining cases, post-fixed cells were labeled by incubation with a multiplexed
mixture of primary antibodies in HBSS for 60 min at room temperature (293-298 K) to detect immunoreactivity
of c-Jun (1:500), phospho-histone H3 (1:100), phospho-histone H2A.X (1:200), p53 (1:400), alpha-tubulin (1:200)
and Hoechst 33342 (2 ug/mL). Cells were labeled for multiplexed imaging on two separate plates: a) Hoechst
33342, MitoTracker Red, phospho-histone H3, and alpha-tubulin, and b) Hoechst 33342, phospho-histone
H2A.x, and c-Jun. A final rinse with HBSS (50 uL) was performed before analysis. The primary and secondary
antibodies for the proteins were phospho-histone H3 (rabbit anti-phospho-histone H3 and FITC-donkey anti-
rabbit IgG), phospho-histone H2A.X (mouse anti-phospho-histone H2A.X and FITC-donkey anti-mouse IgG), c-
Jun (rabbit anti-phospho-c-Jun and Cy3-donkey anti-rabbit IgG), p53 (sheep anti-p53 and Cy5-donkey anti-sheep
IgG), alpha-tubulin (mouse anti-alpha-tubulin and Cy5-donkey anti-mouse IgG). These antibodies are available
as the CellCiphr HepG2 assay kit (Millipore). Digital images of each well were captured using a Cellomics
ArrayScan VTI (Thermo Scientific Cellomics) (0.8 NA objective, 0.63x optical coupler, and XF-93 filter set) at 20x
magnification. The images were acquired using the autofocus feature of the ArrayScan instrument, which entails
the following steps. First, the camera focuses on channel 1 (Hoechst 33342), where nuclei are identified. Second,
a Z offset of 1 um is used for capturing mitochondria (MitoTracker Red). Third, a Z offset of -2 um is used for

-------
capturing the cytoskeleton (tubulin). Six digital images were captured in each well and analyzed using
BioApplication software, which was provided with the instrument. All images were analyzed using the
Compartmental Analysis and Cell Cycle Analysis BioApplication software from Cellomics. The Cell Cycle
BioApplication software used the nuclear stain to identify valid cells, to measure nuclear diameter, and to
quantify DNA content. These features were used to calculate the average nuclear size, cell cycle arrest (ratio of
2N/4N), and cell number. The Compartmental Analysis BioApplication software module was used to measure
the average cell intensities for c-Jun phosphorylation, p53 protein activation, phospho-histone H2A.X activation,
mitochondria, and alpha-tubulin. The average intensity of mitochondria was used to define mitochondrial
membrane potential, and the total intensity was used to define mitochondrial mass. Data from cellular features
measured in the nucleus were excluded for wells where there was a significant decrease in nuclear size and
brightness. Detailed documentation about the algorithms and parameters used by the BioApplication software
for this analysis are available upon request. Cellular features were aggregated at the well level to quantify the
following end points: p53 activation, c-Jun activation (stress kinase), phospho-histone H2A.X (DNA damage
produced by oxidative stress), phospho-histone H3 (mitotic arrest), alpha-tubulin (microtubules), mitochondrial
membrane potential, mitochondrial mass, cell cycle arrest, nuclear size, and cell number.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

10	1

Standard minimum concentration tested:	Standard maximum concentration tested:

0.58 nM	297 nM

Key positive control:	Neutral vehicle control:

Camptothecin;Anisomycin	DMSO

Baseline median absolute deviation for the assay (bmad): 0.109
Response cutoff threshold used to determine hit calls: 1.088
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Fluorescent intensity of anti-phospho-c-jun antibody is used to tag and quantify the level of
phosphorylated jun proto-oncogene protein. Changes in the signals are indicative of protein expression changes
as a cellular response to stress in the system.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration response data from the HCI experiment were smoothed and normalized for every
chemical, end point, and time. The raw concentration responses were smoothed using a Hamming window
(Blackman and Tukey 1958) of length 7. Next, the smoothed data (r) for end points measured on each plate
were normalized to the median response (r*) to calculate perturbations as the logarithm (base 2) of fold change
values. The normalized changes (x = log2 r/r*) were also standardized (z = (x - x*)/sigma * x) to evaluate the
importance of perturbations (where sigma * x is the standard deviation of x). The lowest effect concentration
(LEC) for each chemical and end point was calculated as the concentration that produced a fold change
perturbation at least one standard deviation (i.e., sigma * x = 1) above or below the median value. An absolute
perturbation > one standard deviation was called a "hit" (i.e., |sigma * x| >1). The LEC was estimated by
numerically solving for: |z| = 1 (the minimum value was selected if there were multiple solutions). The efficacy
was measured as maximum positive or negative value of x.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 7: bmadlO (Add a cutoff
value of 10 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated
using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1108	Number of chemicals tested: 1066

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

-------
Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 153.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Giuliano KA, Gough AH, Taylor DL, Vernetti LA, Johnston PA. Early safety assessment using cellular
systems biology yields insights into mechanisms of action. J Biomol Screen. 2010 Aug;15(7):783-97. doi:
10.1177/1087057110376413. Epub 2010 Jul 16. PubMed PMID: 20639501., Shah, I., Setzer, R. W., Jack, J.,
Houck, K. A., Judson, R. S., Knudsen, T. B., Liu, J., Martin, M. T., Reif, D. M., Richard, A. M., Thomas, R. S., Crofton,

-------
K. M., Dix, D. J., & Kavlock, R. J. (2016). Using ToxCast™ Data to Reconstruct Dynamic Cell State Trajectories and
Estimate Toxicological Points of Departure. Environmental health perspectives, 124(7), 910-919.
https://doi.org/10.1289/ehp.1409029

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1825

Aru n A_Cel ITiter_h N P

1.	General Information

1.1	Assay Title: Viability Assessment in the ArunA Biomedical's Oris Neural Crest (hNC) Cell Migration Assay

1.2	Assay Summary: ArunA_CellTiter_hNP is a cell-based, single-readout assay that uses human H9-derived
neuroprogenitor stem cells (hNPl).Measurements were taken 72 hours after chemical dosing in a 96-well plate.
ArunA_CellTiter_hNP is an assay component measured from the ArunA_CellTiter_hNP assay. It is designed to
make measurements of viability, a form of viability reporter, as detected with fluorescence intensity signals by
HCS Fluorescent Imaging technology. Data from the assay component ArunA_CellTiter_hNP was analyzed at
the endpoint, ArunA_CellTiter_hNP, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used to
understand the viability. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ArunA Biomedical is a privately owned biotechnology company and Contract Research
Organization (CRO) formerly providing toxicology screening using neural stem cell-based assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This protocol describes the use of ArunA Biomedical's hNPl Neural Progenitor Cells in
conjunction with an Oris Cell Migration Assembly Kit- FLEX to measure the effect of neuroactive compounds and
biologies that modulate proliferation and migration of neural progenitor cells. Certain uses of these products
may be covered by U.S. Pat. No. 6,200,806; No. 7,531,354,B2 licensed to ARUNA and U.S. Pat. No. 7,842,499;
No. 7,018,838; No. 10/597,118; No. 11/342,413; No. 11/890,740; and No. 12/195,007 licensed to PLATYPUS.

1.9	Assay Throughput: 96-well plate. ArunA systems offer high throughput chemical screening in a 96-well format
for the human neuroprogenitor (hNP) and human neural crest (hNC) migration and cell titer endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to the number of [3H]-thymidine labelled nuceli is
indicative of the viability of the system.

Chemical-induced perturbations to cellular key events across neurogenic outcomes, including migration
(neuroprogenitor and neural crest cells) and neural network formation (neurite length, neurite length, and
branch points for neurites), in vitro can inform on cell-based prioritization of neurodevelopmental hazard
potential.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is migration of neuroprogenitors and neural crest cells (NCCs). Impaired neuroprogenitor and NCC

-------
migration can lead to cerebral malformations and neurodevelopmental disorders, such as diencephalic-
mesencephalic dysplasia syndrome, cerebral palsy, cerebellar ataxia, and microcephaly.

2.3	Experimental System: adherent hNPl cell line used. The hN2 cell line is derived from neuroepithelial cells of
WA09 hESC (Thomson et al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et
al., 2006). Importantly, as opposed to other methods of deriving neural progenitors through three-dimensional
neurosphere and embryoid body formations (Reubinoff et al., 2001, Zhang et al., 2001), these adherent
monolayer cultures are uniformly exposed to growth factors and/or morphogens throughout their propagation.
Neurogenic lineages from human embryonic stem cell line WA09 were locked into three neural differentiation
states: neuroprogenitor (hNPl - Cat no. 7009), neural crest (hNC - Cat no. 7029), and neural network (hNN -
Cat no. hNJL7014). Prior to differentiation into hN2 cells the population was confirmed karyotypically normal,
>95% nestin positive and <3% OCT-4 positive (Shin et al., 2006). The cells were produced in bulk by propagation
for an additional 2 weeks beyond the neuroepithelial stage by removal of bFGF from the media and
cryopreserved (ArunA Biomedical, Athens, GA) for end user applications. The hNP and hNC cell endpoints
consisted of cell titer and migratory measurements whereas hNN cell endpoints consisted of neuron count and
three neurite-specific metrics to assess network formation: neurite length, neurites per neuron, and branch
points for neurites. For this study, ArunA Bio extended the differentiation period of the hNN cells by
approximately two weeks more than in the original hN2 protocol. This allowed for increased neural network cell
axonation leading to better quantitation of network-specific endpoints. The utility of dissociated hN2 cultures
as an in vitro model for neurite outgrowth was assessed using automated high-content image analysis (HCA). In
addition, the molecular phenotype of these cells was examined using immunocytochemical staining.

2.4	Metabolic Competence: H9-derived cells are locked at different neuronal developmental states of interest to
DNT investigations of chemical exposures. Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: To assess the hNP and hNC migration and cell titer endpoints, 60,000 cells per well were
plated onto Matrigel in basal growth medium with LIF and bFGF in a 96-well plate format. Plates were incubated
for 16 h at 37C followed by a 72 h exposure to chemical in the test medium. For the migration endpoints, cells
were seeded and incubated in presence of 'seeding stoppers' to prevent cell migration and growth into the
detection zone. At the beginning of chemical exposure, stoppers were removed, and growth medium was
replaced with test medium. In the case of the stopper control wells, stoppers remained in place following
replacement of growth medium with test medium. Following 72 -h exposure to the test medium, cells were
stained at 37C for 30-60 min with calcein-AM. Cell viability in the detection zone was quantitated using a
Flexstation3 microplate reader (ex494 nm/em 517 nm). Corresponding cell titer endpoints were assessed for
the hNP and hNC cells using the Promega CellTiter Aqueous One Solution Cell Proliferation Assay (Cat no. G3581;
CellTiter 96). Finally, to gain insight into the mechanisms by which cells migrate into the detection zone, Ki-67
expression was quantified for 10 additional chemicals in the hNP and hNC systems. Additionally, cytochalasin D
was used as a positive control to inhibit cell migration. Supplementing the AB2 Basal Medium: 1.
Decontaminate the external surfaces of all supplement vials and the medium bottle with ethanol or isopropanol.
2. Aseptically open each supplement vial and add the amount indicated below to the basal medium with a
pipette. To make 100 ml of complete medium: AB2 Neural Medium 96 mL, ANS Supplement 2 mL, bFGF (50
ug/mL) 40 uL, LIF (10 ug/mL) 100 uL, L-Glutamine (200 mM) 1 mL, Penicillin (5,000 U/mL)/Streptomycin (5,000
Ug/mL) 1 mL. 3. Supplemented medium should be stored at 2-8C, protected from light. The complete medium
should be given a 2 week expiration date. Dispense the complete medium into aliquots to avoid repeated
heating prior to each use. Plate Coating Protocol for hNPl Neural Progenitor Expansion: To coat dishes perform
the following steps: 1. Thaw BD Matrigel at 2-8C overnight. Matrix will gel rapidly at 22C to 35C. Keep Matrigel
on ice and use pre-cooled pipettes, plates and tubes when preparing. Gelled Matrigel may be re-liquified if
placed at 2-8C on ice for 24 to 48 hours. 2. Handle using aseptic technique in a laminar flow hood. 3. Once BD
Matrigel Matrix is thawed, swirl vial to be sure that material is evenly dispersed. 4. Place thawed vial of BD
Matrigel Matrix in sterile area, decontaminate the external surfaces with ethanol or isopropanol and air dry. BD
Matrigel Matrix may be gently pipetted using a pre-cooled pipette to ensure homogeneity. 5. Dilute Matrigel
1:200 with cooled Dulbecco's Modified Eagle's Medium. Keep on ice. 6. Add 2 mL diluted Matrigel to a 35-mm
dish. Swirl to ensure the entire surface of the 35-mm dish is covered with the Matrigel solution. 7. Place dishes

-------
at 2-8Cfor 1-3 hours. 8. Rinse thoroughly with PBS. 9. Remove PBS and use immediately. Cell Thawing Protocol
for hNPl Neural Progenitor Expansion: To plate the cells perform the following steps: 1. Do not thaw the cells
until the recommended medium and appropriately coated plasticware and/or glassware are on hand. 2. Remove
the vial from liquid nitrogen and incubate in a 37C water bath. Closely monitor until the cells are completely
thawed. Maximum cell viability is dependent on the rapid and complete thawing of frozen cells. IMPORTANT:
Do not vortex the cells. Breaking cells down to single cell suspensions will significantly increase cell death. 3. As
soon as the cells are completely thawed, disinfect the outside of the vial with 70% ethanol or isopropanol.
Proceed immediately to the next step. 4. In a laminar flow hood, use a 1 or 2 mL pipette to transfer the cells to
a sterile 15 mL conical tube. Be careful to not introduce any bubbles during the transfer process. 5. Using a 10
mL pipette, slowly add dropwise 9 mL of fully supplemented AB2 Neural Medium (pre-warmed to 37C) to the
15 mL conical tube. IMPORTANT: Do not add the whole volume of medium at once to the cells. This may result
in decreased cell viability due to osmotic shock. 6. Gently mix the cell suspension by slow pipetting up and down
twice. Be careful to not introduce any bubbles. IMPORTANT: Do not vortex the cells. Breaking cells down to
single cell suspensions will significantly increase cell death. 7. Centrifuge the tube at room temperature at 200
x g for 4 minutes to pellet the cells. 8. Aspirate as much of the supernatant as possible. Steps 4-8 are necessary
to remove residual cryopreservative (DMSO). 9. Resuspend the cells in a total volume of 2 mL of fully
supplemented AB2 Neural Medium (pre-warmed to 37C). 10. Plate the 2 mL cell suspension of hNPl cells onto
a Matrigel-coated 35 mm dish. 11. Incubate the cells at 37C in a 5% C02 humidified incubator. 12. Exchange the
medium with fresh fully supplemented AB2 Neural Medium 24 hours post plating. Exchange with fresh medium
every other day thereafter. Use caution not to dislodge the cells; do not pipette media directly onto the cells
but rather onto the side of the culture dish. 13. Once the hNPl cells reach 100% confluence, they can be
dissociated manually for passaging (e.g., by cell scraping or by gentle and slow pipetting up and down to detach
the cells). The cells should be maintained at a high density at all times - the recommended passaging ratio is
1:2. Subculture of hNPl Cells: 1. Once the hNPl cells reach 100% confluence, carefully remove the medium
from the 35 mm dish. 2. Apply 2 mL fully supplemented AB2 Neural Medium (pre-warmed to 37C) to the cells
so that the cells can be harvested in fresh medium. 3. Using a pipette, manually detach the cells from the dish
by slow pipetting up and down the dish. Be careful to avoid introducing any bubbles. We recommend using a
200 uL or 1000 uL manual pipette to dislodge the attached cells. Alternatively, cells can be dislodged with a
sterile cell scraper. IMPORTANT: We do NOT recommend enzymatic methods for passaging the hNPl cells.
Doing so reduces the long term viability of the cells and can cause karyotypic abnormalities. 4. Plates should be
observed to ensure that all cells have been removed. This is most easily accomplished by working under a
dissection microscope within a laminar flow hood, but can also be achieved by frequent observation under a
bright field or phase contrast microscope. 5. Transfer the dissociated cells to a 50 mL conical tube. Inspect the
plate to ensure that all the cells have been removed. 6. If necessary, count the cells and calculate the cell
concentration. Cells can be centrifuged at 200 x g for 4 minutes in order to concentrate the cell suspension for
higher plating densities. 7. Plate the cells at the desired density into the appropriately coated flasks, plates or
wells in fully supplemented AB2 Neural Medium. We recommend keeping the cells at a high cell density by
passaging 1:2. 8. Incubate the cells at 37C in a 5% C02 humidified incubator. 9. Exchange the medium with fresh
fully supplemented AB2 Neural Medium 24 hours post plating. Exchange with fresh medium every other day
thereafter. Use caution not to dislodge the cells; do not pipette media directly onto the cells but rather onto
the side of the culture dish. Plate Coating Protocol for Cell Migration Assay: 1. Thaw BD Matrigel at 2-8C
overnight. Since it will gel rapidly at 22C to 35C, keep Matrigel on ice and use pre-cooled pipettes, plates and
tubes when preparing. Gelled Matrigel may re-liquefy if placed at 2-8C on ice for 24 to 48 hours. 2. Handle using
aseptic technique in a laminar flow hood. 3. Once the Matrigel is thawed, swirl vial to be sure that material is
evenly dispersed. 4. Place thawed vial of Matrigel in sterile area, decontaminate the external surfaces with
ethanol or isopropanol and air dry. Matrigel may be gently pipetted using a pre-cooled pipette to ensure
homogeneity. 5. Dilute Matrigel 1:200 with cooled AB2 Neural Culture Medium. Prepare 1 mL diluted Matrigel
for each column (8 wells) to be used. Keep on ice. 6. Add 100 uL of diluted Matrigel to each well intended for
use in the 96 well plate. 7. Tap the plate gently to ensure the entire surface of the well is covered with diluted
Matrigel. 8. Place dishes at 2-8C for 1-3 hours. 9. Remove the residual coating solution and rinse each well twice
with 200 uL of PBS per well. 10. Remove PBS and insert the Oris Cell Seeding Stoppers into the coated wells of
the 96-well plate. 11. Visually inspect to ensure that the Oris Cell Seeding Stoppers are firmly sealed. Cell
Migration Assay Protocol: 1. Harvest cells as described in steps 1-5 of section Subculture of hNPl Neural

-------
Progenitor cells. 2. Count cells and adjust cell suspension volume to the following concentration: 600,000
cells/mL 3. Plate 100 uL of suspended cells into each stoppered well for a cell density of 60,000 cells per well. 4.
Incubate the cells at 37C in a 5% C02 humidified incubator overnight (16-24 hours) to permit cell attachment.
5. Using the Oris Stopper Tool, remove all stoppers, except for those in "no migration controls" which will remain
in place until time of staining. 6. Carefully remove the seeding media from the wells and add 200 uL medium
containing the test compound per well. 7. Briefly examine the wells by phase contrast microscopy to ensure
continued adherence of the cells. 8. Incubate the cells at 37C/5% C02 for 72 hours to permit cell migration. 9.
After 72 hours, mix 5 uL Calcein AM, 5 uL Hoechst 33342, and 10 mL phenol red-free Neurobasal medium with
0.1% BSA. 10. Carefully remove stoppers from the "no migration controls". 11. Carefully remove the test
medium from all wells and add 100 uL of diluted Calcein/Hoechst solution to each well. 12. Incubate plate at
37C/5% C02for 30- 60 minutes with the lid on and in the dark (the darkness of a standard incubator will suffice).
13. For use with a fluorescence microplate reader, attach the Oris Detection Mask and read promptly for Calcein
fluorescence (ex 494 nm/ em 517 nm). 14. For image analysis, photomicrograph wells using epifluorescence
illumination with or without the Oris Detection mask. Images can then be analyzed using either area closure
with the calcein stain or number of cells (nuclei) using the Hoechst stain. ImageJ freeware available from the
NIH (http://rsbweb.nih.gov/ij/) can be used for migration data analysis as percent area closure or cellular
enumeration

Baseline median absolute deviation for the assay (bmad): 0.115
Response cutoff threshold used to determine hit calls: 0.344
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: The ArunA migration assay measures growth and survival in human embryonic neuroprogenitor
(hNP) and human neural crest (hNC) cells by tracking the presence/absence of viable nuclei movement into a
defined circular area in each microplate well. These different measurements are assessed following 72 hour
incubations with test chemical to evaluate the potential to disrupt neural migration in developing human
embryos.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

1.2 nM
Key positive control:

NA

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

NA

3.

Additionally, this assay was annotated to the intended target family of cell cycle.
Data Interpretation

-------
The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The migration of neuroprogenitor and neural crest cells into the detection zone was assessed by
comparing the percent migration of proliferative Ki-67 cells to total migrating cells following 72 hours exposure.
This was accomplished by determining the percentage of total cells migrating into the detection zone, i.e. the
migration index (Ml), compared to the percentage of migrating cells that expressed the Ki-67 proliferative
marker within the detection zone, i.e. the proliferative index (PI). Normalized response values for each assay
endpoint were calculated as resp = 100 x (rval-bval) / (pval-bval) where rval, bval, and pval correspond to the
raw value, the plate level DMSO control median, and the plate level positive/negative control median,
respectively. In the parallel viability assessment, normalized response was calculated as resp = log2(rval/bval).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.

-------
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 63	Number of chemicals tested: 58

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
20

Inactive hit count: 0
-------
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

1.332

Neutral control median absolute deviation, by plate: nmad

0.103

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

7.79%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

NA

NA

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

NA

NA

-------
Positive control signal-to-noise: ((pmed-nmed)/nmad)

NA

Positive control signal-to-background: (pmed/nmed)

NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

NA

Negative control well median absolute deviation value, by plate: mmad

NA

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrtfmmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

-------
•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1826

Aru n A_Cel ITiter_h NC

1.	General Information

1.1	Assay Title: Viability Assessment in the ArunA Biomedical's Oris Neural Crest (hNC) Cell Migration Assay

1.2	Assay Summary: ArunA_CellTiter_hNC is a cell-based, single-readout assay that uses human H9-derived
embryonic neural crest stem cells (hNC). Measurements were taken 72 hours after chemical dosing in a 96-well
plate. ArunA_CellTiter_hNC is an assay component measured from the ArunA_CellTiter_hNC assay. It is
designed to make measurements of viability, a form of viability reporter, as detected with fluorescence intensity
signals by HCS Fluorescent Imaging technology. Data from the assay component ArunA_CellTiter_hNC was
analyzed at the endpoint, ArunA_CellTiter_hNC, in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used to
understand viability. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ArunA Biomedical is a privately owned biotechnology company and Contract Research
Organization (CRO) formerly providing toxicology screening using neural stem cell-based assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This protocol describes the use of ArunA Biomedical's hNPl Neural Progenitor Cells in
conjunction with an Oris Cell Migration Assembly Kit- FLEX to measure the effect of neuroactive compounds and
biologies that modulate proliferation and migration of neural progenitor cells. Certain uses of these products
may be covered by U.S. Pat. No. 6,200,806; No. 7,531,354,B2 licensed to ARUNA and U.S. Pat. No. 7,842,499;
No. 7,018,838; No. 10/597,118; No. 11/342,413; No. 11/890,740; and No. 12/195,007 licensed to PLATYPUS.

1.9	Assay Throughput: 96-well plate. ArunA systems offer high throughput chemical screening in a 96-well format
for the human neuroprogenitor (hNP) and human neural crest (hNC) migration and cell titer endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to the number of [3H]-thymidine labelled nuceli is
indicative of the viability of the system.

Chemical-induced perturbations to cellular key events across neurogenic outcomes, including migration
(neuroprogenitor and neural crest cells) and neural network formation (neurite length, neurite length, and
branch points for neurites), in vitro can inform on cell-based prioritization of neurodevelopmental hazard
potential.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is migration of neuroprogenitors and neural crest cells (NCCs). Impaired neuroprogenitor and NCC

-------
migration can lead to cerebral malformations and neurodevelopmental disorders, such as diencephalic-
mesencephalic dysplasia syndrome, cerebral palsy, cerebellar ataxia, and microcephaly.

2.3	Experimental System: adherent hNC cell line used. The hN2 cell line is derived from neuroepithelial cells of
WA09 hESC (Thomson et al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et
al., 2006). Importantly, as opposed to other methods of deriving neural progenitors through three-dimensional
neurosphere and embryoid body formations (Reubinoff et al., 2001, Zhang et al., 2001), these adherent
monolayer cultures are uniformly exposed to growth factors and/or morphogens throughout their propagation.
Neurogenic lineages from human embryonic stem cell line WA09 were locked into three neural differentiation
states: neuroprogenitor (hNPl - Cat no. 7009), neural crest (hNC - Cat no. 7029), and neural network (hNN -
Cat no. hNJL7014). Prior to differentiation into hN2 cells the population was confirmed karyotypically normal,
>95% nestin positive and <3% OCT-4 positive (Shin et al., 2006). The cells were produced in bulk by propagation
for an additional 2 weeks beyond the neuroepithelial stage by removal of bFGF from the media and
cryopreserved (ArunA Biomedical, Athens, GA) for end user applications. The hNP and hNC cell endpoints
consisted of cell titer and migratory measurements whereas hNN cell endpoints consisted of neuron count and
three neurite-specific metrics to assess network formation: neurite length, neurites per neuron, and branch
points for neurites. For this study, ArunA Bio extended the differentiation period of the hNN cells by
approximately two weeks more than in the original hN2 protocol. This allowed for increased neural network cell
axonation leading to better quantitation of network-specific endpoints. The utility of dissociated hN2 cultures
as an in vitro model for neurite outgrowth was assessed using automated high-content image analysis (HCA). In
addition, the molecular phenotype of these cells was examined using immunocytochemical staining.

2.4	Metabolic Competence: H9-derived cells are locked at different neuronal developmental states of interest to
DNT investigations of chemical exposures. Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: To assess the hNP and hNC migration and cell titer endpoints, 60,000 cells per well were
plated onto Matrigel in basal growth medium with LIF and bFGF in a 96-well plate format. Plates were incubated
for 16 h at 37C followed by a 72 h exposure to chemical in the test medium. For the migration endpoints, cells
were seeded and incubated in presence of 'seeding stoppers' to prevent cell migration and growth into the
detection zone. At the beginning of chemical exposure, stoppers were removed, and growth medium was
replaced with test medium. In the case of the stopper control wells, stoppers remained in place following
replacement of growth medium with test medium. Following 72 -h exposure to the test medium, cells were
stained at 37C for 30-60 min with calcein-AM. Cell viability in the detection zone was quantitated using a
Flexstation3 microplate reader (ex494 nm/em 517 nm). Corresponding cell titer endpoints were assessed for
the hNP and hNC cells using the Promega CellTiter Aqueous One Solution Cell Proliferation Assay (Cat no. G3581;
CellTiter 96). Finally, to gain insight into the mechanisms by which cells migrate into the detection zone, Ki-67
expression was quantified for 10 additional chemicals in the hNP and hNC systems. Additionally, cytochalasin D
was used as a positive control to inhibit cell migration. Supplementing the AB2 Basal Medium: 1.
Decontaminate the external surfaces of all supplement vials and the medium bottle with ethanol or isopropanol.
2. Aseptically open each supplement vial and add the amount indicated below to the basal medium with a
pipette. To make 100 ml of complete medium: AB2 Neural Medium 96 mL, ANS Supplement 2 mL, bFGF (50
ug/mL) 40 uL, LIF (10 ug/mL) 100 uL, L-Glutamine (200 mM) 1 mL, Penicillin (5,000 U/mL)/Streptomycin (5,000
Ug/mL) 1 mL. 3. Supplemented medium should be stored at 2-8C, protected from light. The complete medium
should be given a 2 week expiration date. Dispense the complete medium into aliquots to avoid repeated
heating prior to each use. Plate Coating Protocol for hNPl Neural Progenitor Expansion: To coat dishes perform
the following steps: 1. Thaw BD Matrigel at 2-8C overnight. Matrix will gel rapidly at 22C to 35C. Keep Matrigel
on ice and use pre-cooled pipettes, plates and tubes when preparing. Gelled Matrigel may be re-liquified if
placed at 2-8C on ice for 24 to 48 hours. 2. Handle using aseptic technique in a laminar flow hood. 3. Once BD
Matrigel Matrix is thawed, swirl vial to be sure that material is evenly dispersed. 4. Place thawed vial of BD
Matrigel Matrix in sterile area, decontaminate the external surfaces with ethanol or isopropanol and air dry. BD
Matrigel Matrix may be gently pipetted using a pre-cooled pipette to ensure homogeneity. 5. Dilute Matrigel
1:200 with cooled Dulbecco's Modified Eagle's Medium. Keep on ice. 6. Add 2 mL diluted Matrigel to a 35-mm
dish. Swirl to ensure the entire surface of the 35-mm dish is covered with the Matrigel solution. 7. Place dishes

-------
at 2-8Cfor 1-3 hours. 8. Rinse thoroughly with PBS. 9. Remove PBS and use immediately. Cell Thawing Protocol
for hNPl Neural Progenitor Expansion: To plate the cells perform the following steps: 1. Do not thaw the cells
until the recommended medium and appropriately coated plasticware and/or glassware are on hand. 2. Remove
the vial from liquid nitrogen and incubate in a 37C water bath. Closely monitor until the cells are completely
thawed. Maximum cell viability is dependent on the rapid and complete thawing of frozen cells. IMPORTANT:
Do not vortex the cells. Breaking cells down to single cell suspensions will significantly increase cell death. 3. As
soon as the cells are completely thawed, disinfect the outside of the vial with 70% ethanol or isopropanol.
Proceed immediately to the next step. 4. In a laminar flow hood, use a 1 or 2 mL pipette to transfer the cells to
a sterile 15 mL conical tube. Be careful to not introduce any bubbles during the transfer process. 5. Using a 10
mL pipette, slowly add dropwise 9 mL of fully supplemented AB2 Neural Medium (pre-warmed to 37C) to the
15 mL conical tube. IMPORTANT: Do not add the whole volume of medium at once to the cells. This may result
in decreased cell viability due to osmotic shock. 6. Gently mix the cell suspension by slow pipetting up and down
twice. Be careful to not introduce any bubbles. IMPORTANT: Do not vortex the cells. Breaking cells down to
single cell suspensions will significantly increase cell death. 7. Centrifuge the tube at room temperature at 200
x g for 4 minutes to pellet the cells. 8. Aspirate as much of the supernatant as possible. Steps 4-8 are necessary
to remove residual cryopreservative (DMSO). 9. Resuspend the cells in a total volume of 2 mL of fully
supplemented AB2 Neural Medium (pre-warmed to 37C). 10. Plate the 2 mL cell suspension of hNPl cells onto
a Matrigel-coated 35 mm dish. 11. Incubate the cells at 37C in a 5% C02 humidified incubator. 12. Exchange the
medium with fresh fully supplemented AB2 Neural Medium 24 hours post plating. Exchange with fresh medium
every other day thereafter. Use caution not to dislodge the cells; do not pipette media directly onto the cells
but rather onto the side of the culture dish. 13. Once the hNPl cells reach 100% confluence, they can be
dissociated manually for passaging (e.g., by cell scraping or by gentle and slow pipetting up and down to detach
the cells). The cells should be maintained at a high density at all times - the recommended passaging ratio is
1:2. Subculture of hNPl Cells: 1. Once the hNPl cells reach 100% confluence, carefully remove the medium
from the 35 mm dish. 2. Apply 2 mL fully supplemented AB2 Neural Medium (pre-warmed to 37C) to the cells
so that the cells can be harvested in fresh medium. 3. Using a pipette, manually detach the cells from the dish
by slow pipetting up and down the dish. Be careful to avoid introducing any bubbles. We recommend using a
200 uL or 1000 uL manual pipette to dislodge the attached cells. Alternatively, cells can be dislodged with a
sterile cell scraper. IMPORTANT: We do NOT recommend enzymatic methods for passaging the hNPl cells.
Doing so reduces the long term viability of the cells and can cause karyotypic abnormalities. 4. Plates should be
observed to ensure that all cells have been removed. This is most easily accomplished by working under a
dissection microscope within a laminar flow hood, but can also be achieved by frequent observation under a
bright field or phase contrast microscope. 5. Transfer the dissociated cells to a 50 mL conical tube. Inspect the
plate to ensure that all the cells have been removed. 6. If necessary, count the cells and calculate the cell
concentration. Cells can be centrifuged at 200 x g for 4 minutes in order to concentrate the cell suspension for
higher plating densities. 7. Plate the cells at the desired density into the appropriately coated flasks, plates or
wells in fully supplemented AB2 Neural Medium. We recommend keeping the cells at a high cell density by
passaging 1:2. 8. Incubate the cells at 37C in a 5% C02 humidified incubator. 9. Exchange the medium with fresh
fully supplemented AB2 Neural Medium 24 hours post plating. Exchange with fresh medium every other day
thereafter. Use caution not to dislodge the cells; do not pipette media directly onto the cells but rather onto
the side of the culture dish. Plate Coating Protocol for Cell Migration Assay: 1. Thaw BD Matrigel at 2-8C
overnight. Since it will gel rapidly at 22C to 35C, keep Matrigel on ice and use pre-cooled pipettes, plates and
tubes when preparing. Gelled Matrigel may re-liquefy if placed at 2-8C on ice for 24 to 48 hours. 2. Handle using
aseptic technique in a laminar flow hood. 3. Once the Matrigel is thawed, swirl vial to be sure that material is
evenly dispersed. 4. Place thawed vial of Matrigel in sterile area, decontaminate the external surfaces with
ethanol or isopropanol and air dry. Matrigel may be gently pipetted using a pre-cooled pipette to ensure
homogeneity. 5. Dilute Matrigel 1:200 with cooled AB2 Neural Culture Medium. Prepare 1 mL diluted Matrigel
for each column (8 wells) to be used. Keep on ice. 6. Add 100 uL of diluted Matrigel to each well intended for
use in the 96 well plate. 7. Tap the plate gently to ensure the entire surface of the well is covered with diluted
Matrigel. 8. Place dishes at 2-8C for 1-3 hours. 9. Remove the residual coating solution and rinse each well twice
with 200 uL of PBS per well. 10. Remove PBS and insert the Oris Cell Seeding Stoppers into the coated wells of
the 96-well plate. 11. Visually inspect to ensure that the Oris Cell Seeding Stoppers are firmly sealed. Cell
Migration Assay Protocol: 1. Harvest cells as described in steps 1-5 of section Subculture of hNPl Neural

-------
Progenitor cells. 2. Count cells and adjust cell suspension volume to the following concentration: 600,000
cells/mL 3. Plate 100 uL of suspended cells into each stoppered well for a cell density of 60,000 cells per well. 4.
Incubate the cells at 37C in a 5% C02 humidified incubator overnight (16-24 hours) to permit cell attachment.
5. Using the Oris Stopper Tool, remove all stoppers, except for those in "no migration controls" which will remain
in place until time of staining. 6. Carefully remove the seeding media from the wells and add 200 uL medium
containing the test compound per well. 7. Briefly examine the wells by phase contrast microscopy to ensure
continued adherence of the cells. 8. Incubate the cells at 37C/5% C02 for 72 hours to permit cell migration. 9.
After 72 hours, mix 5 uL Calcein AM, 5 uL Hoechst 33342, and 10 mL phenol red-free Neurobasal medium with
0.1% BSA. 10. Carefully remove stoppers from the "no migration controls". 11. Carefully remove the test
medium from all wells and add 100 uL of diluted Calcein/Hoechst solution to each well. 12. Incubate plate at
37C/5% C02for 30- 60 minutes with the lid on and in the dark (the darkness of a standard incubator will suffice).
13. For use with a fluorescence microplate reader, attach the Oris Detection Mask and read promptly for Calcein
fluorescence (ex 494 nm/ em 517 nm). 14. For image analysis, photomicrograph wells using epifluorescence
illumination with or without the Oris Detection mask. Images can then be analyzed using either area closure
with the calcein stain or number of cells (nuclei) using the Hoechst stain. ImageJ freeware available from the
NIH (http://rsbweb.nih.gov/ij/) can be used for migration data analysis as percent area closure or cellular
enumeration

Baseline median absolute deviation for the assay (bmad): 0.127
Response cutoff threshold used to determine hit calls: 0.381
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: The ArunA migration assay measures growth and survival in human embryonic neuroprogenitor
(hNP) and human neural crest (hNC) cells by tracking the presence/absence of viable nuclei movement into a
defined circular area in each microplate well. These different measurements are assessed following 72 hour
incubations with test chemical to evaluate the potential to disrupt neural migration in developing human
embryos.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

1.2 nM
Key positive control:

NA

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

NA

3.

Additionally, this assay was annotated to the intended target family of cell cycle.
Data Interpretation

-------
The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The migration of neuroprogenitor and neural crest cells into the detection zone was assessed by
comparing the percent migration of proliferative Ki-67 cells to total migrating cells following 72 hours exposure.
This was accomplished by determining the percentage of total cells migrating into the detection zone, i.e. the
migration index (Ml), compared to the percentage of migrating cells that expressed the Ki-67 proliferative
marker within the detection zone, i.e. the proliferative index (PI). Normalized response values for each assay
endpoint were calculated as resp = 100 x (rval-bval) / (pval-bval) where rval, bval, and pval correspond to the
raw value, the plate level DMSO control median, and the plate level positive/negative control median,
respectively. In the parallel viability assessment, normalized response was calculated as resp = log2(rval/bval).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.

-------
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 63	Number of chemicals tested: 58

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
14

Inactive hit count: 0
-------
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0.248

Neutral control median absolute deviation, by plate: nmad

0.021

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

9.44%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

NA

NA

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

NA

NA

-------
Positive control signal-to-noise: ((pmed-nmed)/nmad)

NA

Positive control signal-to-background: (pmed/nmed)

NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

NA

Negative control well median absolute deviation value, by plate: mmad

NA

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrtfmmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 5.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

-------
•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1827

Aru n A_M igration_h N P

1.	General Information

1.1	Assay Title: ArunA Biomedical's Oris Neuroprogenitor (hNP) Cell Migration Assay

1.2	Assay Summary: ArunA_Migration_hNP is a cell-based, single-readout assay that uses human H9-derived
neuroprogenitor stem cells (hNPl). Measurements were taken 72 hours after chemical dosing in a 96-well plate.
ArunA_Migration_hNP is an assay component measured from the ArunA_Migration_hNP assay. It is designed
to make measurements of cell migration, a form of distribution reporter, as detected with fluorescence
intensity signals by HCS Fluorescent Imaging technology. Data from the assay component
ArunA_Migration_hNP was analyzed at the endpoint, ArunA_Migration_hNP, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of distribution reporter,
loss-of-signal activity can be used to understand the cell migration. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the neurodevelopment intended target family, where the
subfamily is migration.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ArunA Biomedical is a privately owned biotechnology company and Contract Research
Organization (CRO) formerly providing toxicology screening using neural stem cell-based assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This protocol describes the use of ArunA Biomedical's hNPl Neural Progenitor Cells in
conjunction with an Oris Cell Migration Assembly Kit- FLEX to measure the effect of neuroactive compounds and
biologies that modulate proliferation and migration of neural progenitor cells. Certain uses of these products
may be covered by U.S. Pat. No. 6,200,806; No. 7,531,354,B2 licensed to ARUNA and U.S. Pat. No. 7,842,499;
No. 7,018,838; No. 10/597,118; No. 11/342,413; No. 11/890,740; and No. 12/195,007 licensed to PLATYPUS.

1.9	Assay Throughput: 96-well plate. ArunA systems offer high throughput chemical screening in a 96-well format
for the human neuroprogenitor (hNP) and human neural crest (hNC) migration and cell titer endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to Ki-67 expression is indicative of the cell migration.

Chemical-induced perturbations to cellular key events across neurogenic outcomes, including migration
(neuroprogenitor and neural crest cells) and neural network formation (neurite length, neurite length, and
branch points for neurites), in vitro can inform on cell-based prioritization of neurodevelopmental hazard
potential.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is migration of neuroprogenitors and neural crest cells (NCCs). Impaired neuroprogenitor and NCC

-------
migration can lead to cerebral malformations and neurodevelopmental disorders, such as diencephalic-
mesencephalic dysplasia syndrome, cerebral palsy, cerebellar ataxia, and microcephaly.

2.3	Experimental System: adherent hNPl cell line used. The hN2 cell line is derived from neuroepithelial cells of
WA09 hESC (Thomson et al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et
al., 2006). Importantly, as opposed to other methods of deriving neural progenitors through three-dimensional
neurosphere and embryoid body formations (Reubinoff et al., 2001, Zhang et al., 2001), these adherent
monolayer cultures are uniformly exposed to growth factors and/or morphogens throughout their propagation.
Neurogenic lineages from human embryonic stem cell line WA09 were locked into three neural differentiation
states: neuroprogenitor (hNPl - Cat no. 7009), neural crest (hNC - Cat no. 7029), and neural network (hNN -
Cat no. hNJL7014). Prior to differentiation into hN2 cells the population was confirmed karyotypically normal,
>95% nestin positive and <3% OCT-4 positive (Shin et al., 2006). The cells were produced in bulk by propagation
for an additional 2 weeks beyond the neuroepithelial stage by removal of bFGF from the media and
cryopreserved (ArunA Biomedical, Athens, GA) for end user applications. The hNP and hNC cell endpoints
consisted of cell titer and migratory measurements whereas hNN cell endpoints consisted of neuron count and
three neurite-specific metrics to assess network formation: neurite length, neurites per neuron, and branch
points for neurites. For this study, ArunA Bio extended the differentiation period of the hNN cells by
approximately two weeks more than in the original hN2 protocol. This allowed for increased neural network cell
axonation leading to better quantitation of network-specific endpoints. The utility of dissociated hN2 cultures
as an in vitro model for neurite outgrowth was assessed using automated high-content image analysis (HCA). In
addition, the molecular phenotype of these cells was examined using immunocytochemical staining.

2.4	Metabolic Competence: H9-derived cells are locked at different neuronal developmental states of interest to
DNT investigations of chemical exposures. Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: To assess the hNP and hNC migration and cell titer endpoints, 60,000 cells per well were
plated onto Matrigel in basal growth medium with LIF and bFGF in a 96-well plate format. Plates were incubated
for 16 h at 37C followed by a 72 h exposure to chemical in the test medium. For the migration endpoints, cells
were seeded and incubated in presence of 'seeding stoppers' to prevent cell migration and growth into the
detection zone. At the beginning of chemical exposure, stoppers were removed, and growth medium was
replaced with test medium. In the case of the stopper control wells, stoppers remained in place following
replacement of growth medium with test medium. Following 72 -h exposure to the test medium, cells were
stained at 37C for 30-60 min with calcein-AM. Cell viability in the detection zone was quantitated using a
Flexstation3 microplate reader (ex494 nm/em 517 nm). Corresponding cell titer endpoints were assessed for
the hNP and hNC cells using the Promega CellTiter Aqueous One Solution Cell Proliferation Assay (Cat no. G3581;
CellTiter 96). Finally, to gain insight into the mechanisms by which cells migrate into the detection zone, Ki-67
expression was quantified for 10 additional chemicals in the hNP and hNC systems. Additionally, cytochalasin D
was used as a positive control to inhibit cell migration. Supplementing the AB2 Basal Medium: 1.
Decontaminate the external surfaces of all supplement vials and the medium bottle with ethanol or isopropanol.
2. Aseptically open each supplement vial and add the amount indicated below to the basal medium with a
pipette. To make 100 ml of complete medium: AB2 Neural Medium 96 mL, ANS Supplement 2 mL, bFGF (50
ug/mL) 40 uL, LIF (10 ug/mL) 100 uL, L-Glutamine (200 mM) 1 mL, Penicillin (5,000 U/mL)/Streptomycin (5,000
Ug/mL) 1 mL. 3. Supplemented medium should be stored at 2-8C, protected from light. The complete medium
should be given a 2 week expiration date. Dispense the complete medium into aliquots to avoid repeated
heating prior to each use. Plate Coating Protocol for hNPl Neural Progenitor Expansion: To coat dishes perform
the following steps: 1. Thaw BD Matrigel at 2-8C overnight. Matrix will gel rapidly at 22C to 35C. Keep Matrigel
on ice and use pre-cooled pipettes, plates and tubes when preparing. Gelled Matrigel may be re-liquified if
placed at 2-8C on ice for 24 to 48 hours. 2. Handle using aseptic technique in a laminar flow hood. 3. Once BD
Matrigel Matrix is thawed, swirl vial to be sure that material is evenly dispersed. 4. Place thawed vial of BD
Matrigel Matrix in sterile area, decontaminate the external surfaces with ethanol or isopropanol and air dry. BD
Matrigel Matrix may be gently pipetted using a pre-cooled pipette to ensure homogeneity. 5. Dilute Matrigel
1:200 with cooled Dulbecco's Modified Eagle's Medium. Keep on ice. 6. Add 2 mL diluted Matrigel to a 35-mm
dish. Swirl to ensure the entire surface of the 35-mm dish is covered with the Matrigel solution. 7. Place dishes

-------
at 2-8Cfor 1-3 hours. 8. Rinse thoroughly with PBS. 9. Remove PBS and use immediately. Cell Thawing Protocol
for hNPl Neural Progenitor Expansion: To plate the cells perform the following steps: 1. Do not thaw the cells
until the recommended medium and appropriately coated plasticware and/or glassware are on hand. 2. Remove
the vial from liquid nitrogen and incubate in a 37C water bath. Closely monitor until the cells are completely
thawed. Maximum cell viability is dependent on the rapid and complete thawing of frozen cells. IMPORTANT:
Do not vortex the cells. Breaking cells down to single cell suspensions will significantly increase cell death. 3. As
soon as the cells are completely thawed, disinfect the outside of the vial with 70% ethanol or isopropanol.
Proceed immediately to the next step. 4. In a laminar flow hood, use a 1 or 2 mL pipette to transfer the cells to
a sterile 15 mL conical tube. Be careful to not introduce any bubbles during the transfer process. 5. Using a 10
mL pipette, slowly add dropwise 9 mL of fully supplemented AB2 Neural Medium (pre-warmed to 37C) to the
15 mL conical tube. IMPORTANT: Do not add the whole volume of medium at once to the cells. This may result
in decreased cell viability due to osmotic shock. 6. Gently mix the cell suspension by slow pipetting up and down
twice. Be careful to not introduce any bubbles. IMPORTANT: Do not vortex the cells. Breaking cells down to
single cell suspensions will significantly increase cell death. 7. Centrifuge the tube at room temperature at 200
x g for 4 minutes to pellet the cells. 8. Aspirate as much of the supernatant as possible. Steps 4-8 are necessary
to remove residual cryopreservative (DMSO). 9. Resuspend the cells in a total volume of 2 mL of fully
supplemented AB2 Neural Medium (pre-warmed to 37C). 10. Plate the 2 mL cell suspension of hNPl cells onto
a Matrigel-coated 35 mm dish. 11. Incubate the cells at 37C in a 5% C02 humidified incubator. 12. Exchange the
medium with fresh fully supplemented AB2 Neural Medium 24 hours post plating. Exchange with fresh medium
every other day thereafter. Use caution not to dislodge the cells; do not pipette media directly onto the cells
but rather onto the side of the culture dish. 13. Once the hNPl cells reach 100% confluence, they can be
dissociated manually for passaging (e.g., by cell scraping or by gentle and slow pipetting up and down to detach
the cells). The cells should be maintained at a high density at all times - the recommended passaging ratio is
1:2. Subculture of hNPl Cells: 1. Once the hNPl cells reach 100% confluence, carefully remove the medium
from the 35 mm dish. 2. Apply 2 mL fully supplemented AB2 Neural Medium (pre-warmed to 37C) to the cells
so that the cells can be harvested in fresh medium. 3. Using a pipette, manually detach the cells from the dish
by slow pipetting up and down the dish. Be careful to avoid introducing any bubbles. We recommend using a
200 uL or 1000 uL manual pipette to dislodge the attached cells. Alternatively, cells can be dislodged with a
sterile cell scraper. IMPORTANT: We do NOT recommend enzymatic methods for passaging the hNPl cells.
Doing so reduces the long term viability of the cells and can cause karyotypic abnormalities. 4. Plates should be
observed to ensure that all cells have been removed. This is most easily accomplished by working under a
dissection microscope within a laminar flow hood, but can also be achieved by frequent observation under a
bright field or phase contrast microscope. 5. Transfer the dissociated cells to a 50 mL conical tube. Inspect the
plate to ensure that all the cells have been removed. 6. If necessary, count the cells and calculate the cell
concentration. Cells can be centrifuged at 200 x g for 4 minutes in order to concentrate the cell suspension for
higher plating densities. 7. Plate the cells at the desired density into the appropriately coated flasks, plates or
wells in fully supplemented AB2 Neural Medium. We recommend keeping the cells at a high cell density by
passaging 1:2. 8. Incubate the cells at 37C in a 5% C02 humidified incubator. 9. Exchange the medium with fresh
fully supplemented AB2 Neural Medium 24 hours post plating. Exchange with fresh medium every other day
thereafter. Use caution not to dislodge the cells; do not pipette media directly onto the cells but rather onto
the side of the culture dish. Plate Coating Protocol for Cell Migration Assay: 1. Thaw BD Matrigel at 2-8C
overnight. Since it will gel rapidly at 22C to 35C, keep Matrigel on ice and use pre-cooled pipettes, plates and
tubes when preparing. Gelled Matrigel may re-liquefy if placed at 2-8C on ice for 24 to 48 hours. 2. Handle using
aseptic technique in a laminar flow hood. 3. Once the Matrigel is thawed, swirl vial to be sure that material is
evenly dispersed. 4. Place thawed vial of Matrigel in sterile area, decontaminate the external surfaces with
ethanol or isopropanol and air dry. Matrigel may be gently pipetted using a pre-cooled pipette to ensure
homogeneity. 5. Dilute Matrigel 1:200 with cooled AB2 Neural Culture Medium. Prepare 1 mL diluted Matrigel
for each column (8 wells) to be used. Keep on ice. 6. Add 100 uL of diluted Matrigel to each well intended for
use in the 96 well plate. 7. Tap the plate gently to ensure the entire surface of the well is covered with diluted
Matrigel. 8. Place dishes at 2-8C for 1-3 hours. 9. Remove the residual coating solution and rinse each well twice
with 200 uL of PBS per well. 10. Remove PBS and insert the Oris Cell Seeding Stoppers into the coated wells of
the 96-well plate. 11. Visually inspect to ensure that the Oris Cell Seeding Stoppers are firmly sealed. Cell
Migration Assay Protocol: 1. Harvest cells as described in steps 1-5 of section Subculture of hNPl Neural

-------
Progenitor cells. 2. Count cells and adjust cell suspension volume to the following concentration: 600,000
cells/mL 3. Plate 100 uL of suspended cells into each stoppered well for a cell density of 60,000 cells per well. 4.
Incubate the cells at 37C in a 5% C02 humidified incubator overnight (16-24 hours) to permit cell attachment.
5. Using the Oris Stopper Tool, remove all stoppers, except for those in "no migration controls" which will remain
in place until time of staining. 6. Carefully remove the seeding media from the wells and add 200 uL medium
containing the test compound per well. 7. Briefly examine the wells by phase contrast microscopy to ensure
continued adherence of the cells. 8. Incubate the cells at 37C/5% C02 for 72 hours to permit cell migration. 9.
After 72 hours, mix 5 uL Calcein AM, 5 uL Hoechst 33342, and 10 mL phenol red-free Neurobasal medium with
0.1% BSA. 10. Carefully remove stoppers from the "no migration controls". 11. Carefully remove the test
medium from all wells and add 100 uL of diluted Calcein/Hoechst solution to each well. 12. Incubate plate at
37C/5% C02for 30- 60 minutes with the lid on and in the dark (the darkness of a standard incubator will suffice).
13. For use with a fluorescence microplate reader, attach the Oris Detection Mask and read promptly for Calcein
fluorescence (ex 494 nm/ em 517 nm). 14. For image analysis, photomicrograph wells using epifluorescence
illumination with or without the Oris Detection mask. Images can then be analyzed using either area closure
with the calcein stain or number of cells (nuclei) using the Hoechst stain. ImageJ freeware available from the
NIH (http://rsbweb.nih.gov/ij/) can be used for migration data analysis as percent area closure or cellular
enumeration

Baseline median absolute deviation for the assay (bmad): 9.289
Response cutoff threshold used to determine hit calls: 27.866
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: The ArunA migration assay measures growth and survival in human embryonic neuroprogenitor
(hNP) and human neural crest (hNC) cells by tracking the presence/absence of viable nuclei movement into a
defined circular area in each microplate well. These different measurements are assessed following 72 hour
incubations with test chemical to evaluate the potential to disrupt neural migration in developing human
embryos.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

1.2 nM
Key positive control:

cytochalasin D

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

NA

3.

Additionally, this assay was annotated to the intended target family of neurodevelopment.
Data Interpretation

-------
The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The migration of neuroprogenitor and neural crest cells into the detection zone was assessed by
comparing the percent migration of proliferative Ki-67 cells to total migrating cells following 72 hours exposure.
This was accomplished by determining the percentage of total cells migrating into the detection zone, i.e. the
migration index (Ml), compared to the percentage of migrating cells that expressed the Ki-67 proliferative
marker within the detection zone, i.e. the proliferative index (PI). Normalized response values for each assay
endpoint were calculated as resp = 100 x (rval-bval) / (pval-bval) where rval, bval, and pval correspond to the
raw value, the plate level DMSO control median, and the plate level positive/negative control median,
respectively. In the parallel viability assessment, normalized response was calculated as resp = log2(rval/bval).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 15: pval.apid.medncbyconc.min (Calculate the positive
control value (pval) as the plate-wise minimum, by assay plate ID (apid), of the medians of the corrected
values (cval) for gain-of-signal single- or multiple-concentration negative control wells (wilt = m or o) by
apid, well type, and concentration.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

-------
5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 63	Number of chemicals tested: 58

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
21

Inactive hit count: Oihitc 0.9
30

WINING MODEL SELECTION

NA hit count: hitc^O
12

Number of sample-assay endpoints with winning hill model:

6
4

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

3

22

quadratic-polynomialfpoly2) model: 5

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:

0

1

18

-------
exponentials (exp5) model:

4

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

7708

Neutral control median absolute deviation, by plate: nmad

664.946

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

8.45%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

NA

NA

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	514

Negative control well median absolute deviation value, by plate: mmad	237.957

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-10.216

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 4.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1829

Aru n A_M igration_h N C

1.	General Information

1.1	Assay Title: ArunA Biomedical's Oris Neural Crest (hNC) Cell Migration Assay

1.2	Assay Summary: ArunA_Migration_hNC is a cell-based, single-readout assay that uses human H9-derived
embryonic neural crest stem cells (hNC). Measurements were taken 72 hours after chemical dosing in a 96-well
plate. ArunA_Migration_hNC is an assay component measured from the ArunA_Migration_hNC assay. It is
designed to make measurements of cell migration, a form of distribution reporter, as detected with
fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay component
ArunA_Migration_hNC was analyzed at the endpoint, ArunA_Migration_hNC, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of distribution reporter,
loss-of-signal activity can be used to understand the cell migration. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the neurodevelopment intended target family, where the
subfamily is migration.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ArunA Biomedical is a privately owned biotechnology company and Contract Research
Organization (CRO) formerly providing toxicology screening using neural stem cell-based assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This protocol describes the use of ArunA Biomedical's hNPl Neural Progenitor Cells in
conjunction with an Oris Cell Migration Assembly Kit- FLEX to measure the effect of neuroactive compounds and
biologies that modulate proliferation and migration of neural progenitor cells. Certain uses of these products
may be covered by U.S. Pat. No. 6,200,806; No. 7,531,354,B2 licensed to ARUNA and U.S. Pat. No. 7,842,499;
No. 7,018,838; No. 10/597,118; No. 11/342,413; No. 11/890,740; and No. 12/195,007 licensed to PLATYPUS.

1.9	Assay Throughput: 96-well plate. ArunA systems offer high throughput chemical screening in a 96-well format
for the human neuroprogenitor (hNP) and human neural crest (hNC) migration and cell titer endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to Ki-67 expression is indicative of the cell migration.

Chemical-induced perturbations to cellular key events across neurogenic outcomes, including migration
(neuroprogenitor and neural crest cells) and neural network formation (neurite length, neurite length, and
branch points for neurites), in vitro can inform on cell-based prioritization of neurodevelopmental hazard
potential.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is migration of neuroprogenitors and neural crest cells (NCCs). Impaired neuroprogenitor and NCC

-------
migration can lead to cerebral malformations and neurodevelopmental disorders, such as diencephalic-
mesencephalic dysplasia syndrome, cerebral palsy, cerebellar ataxia, and microcephaly.

2.3	Experimental System: adherent hNC cell line used. The hN2 cell line is derived from neuroepithelial cells of
WA09 hESC (Thomson et al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et
al., 2006). Importantly, as opposed to other methods of deriving neural progenitors through three-dimensional
neurosphere and embryoid body formations (Reubinoff et al., 2001, Zhang et al., 2001), these adherent
monolayer cultures are uniformly exposed to growth factors and/or morphogens throughout their propagation.
Neurogenic lineages from human embryonic stem cell line WA09 were locked into three neural differentiation
states: neuroprogenitor (hNPl - Cat no. 7009), neural crest (hNC - Cat no. 7029), and neural network (hNN -
Cat no. hNJL7014). Prior to differentiation into hN2 cells the population was confirmed karyotypically normal,
>95% nestin positive and <3% OCT-4 positive (Shin et al., 2006). The cells were produced in bulk by propagation
for an additional 2 weeks beyond the neuroepithelial stage by removal of bFGF from the media and
cryopreserved (ArunA Biomedical, Athens, GA) for end user applications. The hNP and hNC cell endpoints
consisted of cell titer and migratory measurements whereas hNN cell endpoints consisted of neuron count and
three neurite-specific metrics to assess network formation: neurite length, neurites per neuron, and branch
points for neurites. For this study, ArunA Bio extended the differentiation period of the hNN cells by
approximately two weeks more than in the original hN2 protocol. This allowed for increased neural network cell
axonation leading to better quantitation of network-specific endpoints. The utility of dissociated hN2 cultures
as an in vitro model for neurite outgrowth was assessed using automated high-content image analysis (HCA). In
addition, the molecular phenotype of these cells was examined using immunocytochemical staining.

2.4	Metabolic Competence: H9-derived cells are locked at different neuronal developmental states of interest to
DNT investigations of chemical exposures. Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: To assess the hNP and hNC migration and cell titer endpoints, 60,000 cells per well were
plated onto Matrigel in basal growth medium with LIF and bFGF in a 96-well plate format. Plates were incubated
for 16 h at 37C followed by a 72 h exposure to chemical in the test medium. For the migration endpoints, cells
were seeded and incubated in presence of 'seeding stoppers' to prevent cell migration and growth into the
detection zone. At the beginning of chemical exposure, stoppers were removed, and growth medium was
replaced with test medium. In the case of the stopper control wells, stoppers remained in place following
replacement of growth medium with test medium. Following 72 -h exposure to the test medium, cells were
stained at 37C for 30-60 min with calcein-AM. Cell viability in the detection zone was quantitated using a
Flexstation3 microplate reader (ex494 nm/em 517 nm). Corresponding cell titer endpoints were assessed for
the hNP and hNC cells using the Promega CellTiter Aqueous One Solution Cell Proliferation Assay (Cat no. G3581;
CellTiter 96). Finally, to gain insight into the mechanisms by which cells migrate into the detection zone, Ki-67
expression was quantified for 10 additional chemicals in the hNP and hNC systems. Additionally, cytochalasin D
was used as a positive control to inhibit cell migration. Supplementing the AB2 Basal Medium: 1.
Decontaminate the external surfaces of all supplement vials and the medium bottle with ethanol or isopropanol.
2. Aseptically open each supplement vial and add the amount indicated below to the basal medium with a
pipette. To make 100 ml of complete medium: AB2 Neural Medium 96 mL, ANS Supplement 2 mL, bFGF (50
ug/mL) 40 uL, LIF (10 ug/mL) 100 uL, L-Glutamine (200 mM) 1 mL, Penicillin (5,000 U/mL)/Streptomycin (5,000
Ug/mL) 1 mL. 3. Supplemented medium should be stored at 2-8C, protected from light. The complete medium
should be given a 2 week expiration date. Dispense the complete medium into aliquots to avoid repeated
heating prior to each use. Plate Coating Protocol for hNPl Neural Progenitor Expansion: To coat dishes perform
the following steps: 1. Thaw BD Matrigel at 2-8C overnight. Matrix will gel rapidly at 22C to 35C. Keep Matrigel
on ice and use pre-cooled pipettes, plates and tubes when preparing. Gelled Matrigel may be re-liquified if
placed at 2-8C on ice for 24 to 48 hours. 2. Handle using aseptic technique in a laminar flow hood. 3. Once BD
Matrigel Matrix is thawed, swirl vial to be sure that material is evenly dispersed. 4. Place thawed vial of BD
Matrigel Matrix in sterile area, decontaminate the external surfaces with ethanol or isopropanol and air dry. BD
Matrigel Matrix may be gently pipetted using a pre-cooled pipette to ensure homogeneity. 5. Dilute Matrigel
1:200 with cooled Dulbecco's Modified Eagle's Medium. Keep on ice. 6. Add 2 mL diluted Matrigel to a 35-mm
dish. Swirl to ensure the entire surface of the 35-mm dish is covered with the Matrigel solution. 7. Place dishes

-------
at 2-8Cfor 1-3 hours. 8. Rinse thoroughly with PBS. 9. Remove PBS and use immediately. Cell Thawing Protocol
for hNPl Neural Progenitor Expansion: To plate the cells perform the following steps: 1. Do not thaw the cells
until the recommended medium and appropriately coated plasticware and/or glassware are on hand. 2. Remove
the vial from liquid nitrogen and incubate in a 37C water bath. Closely monitor until the cells are completely
thawed. Maximum cell viability is dependent on the rapid and complete thawing of frozen cells. IMPORTANT:
Do not vortex the cells. Breaking cells down to single cell suspensions will significantly increase cell death. 3. As
soon as the cells are completely thawed, disinfect the outside of the vial with 70% ethanol or isopropanol.
Proceed immediately to the next step. 4. In a laminar flow hood, use a 1 or 2 mL pipette to transfer the cells to
a sterile 15 mL conical tube. Be careful to not introduce any bubbles during the transfer process. 5. Using a 10
mL pipette, slowly add dropwise 9 mL of fully supplemented AB2 Neural Medium (pre-warmed to 37C) to the
15 mL conical tube. IMPORTANT: Do not add the whole volume of medium at once to the cells. This may result
in decreased cell viability due to osmotic shock. 6. Gently mix the cell suspension by slow pipetting up and down
twice. Be careful to not introduce any bubbles. IMPORTANT: Do not vortex the cells. Breaking cells down to
single cell suspensions will significantly increase cell death. 7. Centrifuge the tube at room temperature at 200
x g for 4 minutes to pellet the cells. 8. Aspirate as much of the supernatant as possible. Steps 4-8 are necessary
to remove residual cryopreservative (DMSO). 9. Resuspend the cells in a total volume of 2 mL of fully
supplemented AB2 Neural Medium (pre-warmed to 37C). 10. Plate the 2 mL cell suspension of hNPl cells onto
a Matrigel-coated 35 mm dish. 11. Incubate the cells at 37C in a 5% C02 humidified incubator. 12. Exchange the
medium with fresh fully supplemented AB2 Neural Medium 24 hours post plating. Exchange with fresh medium
every other day thereafter. Use caution not to dislodge the cells; do not pipette media directly onto the cells
but rather onto the side of the culture dish. 13. Once the hNPl cells reach 100% confluence, they can be
dissociated manually for passaging (e.g., by cell scraping or by gentle and slow pipetting up and down to detach
the cells). The cells should be maintained at a high density at all times - the recommended passaging ratio is
1:2. Subculture of hNPl Cells: 1. Once the hNPl cells reach 100% confluence, carefully remove the medium
from the 35 mm dish. 2. Apply 2 mL fully supplemented AB2 Neural Medium (pre-warmed to 37C) to the cells
so that the cells can be harvested in fresh medium. 3. Using a pipette, manually detach the cells from the dish
by slow pipetting up and down the dish. Be careful to avoid introducing any bubbles. We recommend using a
200 uL or 1000 uL manual pipette to dislodge the attached cells. Alternatively, cells can be dislodged with a
sterile cell scraper. IMPORTANT: We do NOT recommend enzymatic methods for passaging the hNPl cells.
Doing so reduces the long term viability of the cells and can cause karyotypic abnormalities. 4. Plates should be
observed to ensure that all cells have been removed. This is most easily accomplished by working under a
dissection microscope within a laminar flow hood, but can also be achieved by frequent observation under a
bright field or phase contrast microscope. 5. Transfer the dissociated cells to a 50 mL conical tube. Inspect the
plate to ensure that all the cells have been removed. 6. If necessary, count the cells and calculate the cell
concentration. Cells can be centrifuged at 200 x g for 4 minutes in order to concentrate the cell suspension for
higher plating densities. 7. Plate the cells at the desired density into the appropriately coated flasks, plates or
wells in fully supplemented AB2 Neural Medium. We recommend keeping the cells at a high cell density by
passaging 1:2. 8. Incubate the cells at 37C in a 5% C02 humidified incubator. 9. Exchange the medium with fresh
fully supplemented AB2 Neural Medium 24 hours post plating. Exchange with fresh medium every other day
thereafter. Use caution not to dislodge the cells; do not pipette media directly onto the cells but rather onto
the side of the culture dish. Plate Coating Protocol for Cell Migration Assay: 1. Thaw BD Matrigel at 2-8C
overnight. Since it will gel rapidly at 22C to 35C, keep Matrigel on ice and use pre-cooled pipettes, plates and
tubes when preparing. Gelled Matrigel may re-liquefy if placed at 2-8C on ice for 24 to 48 hours. 2. Handle using
aseptic technique in a laminar flow hood. 3. Once the Matrigel is thawed, swirl vial to be sure that material is
evenly dispersed. 4. Place thawed vial of Matrigel in sterile area, decontaminate the external surfaces with
ethanol or isopropanol and air dry. Matrigel may be gently pipetted using a pre-cooled pipette to ensure
homogeneity. 5. Dilute Matrigel 1:200 with cooled AB2 Neural Culture Medium. Prepare 1 mL diluted Matrigel
for each column (8 wells) to be used. Keep on ice. 6. Add 100 uL of diluted Matrigel to each well intended for
use in the 96 well plate. 7. Tap the plate gently to ensure the entire surface of the well is covered with diluted
Matrigel. 8. Place dishes at 2-8C for 1-3 hours. 9. Remove the residual coating solution and rinse each well twice
with 200 uL of PBS per well. 10. Remove PBS and insert the Oris Cell Seeding Stoppers into the coated wells of
the 96-well plate. 11. Visually inspect to ensure that the Oris Cell Seeding Stoppers are firmly sealed. Cell
Migration Assay Protocol: 1. Harvest cells as described in steps 1-5 of section Subculture of hNPl Neural

-------
Progenitor cells. 2. Count cells and adjust cell suspension volume to the following concentration: 600,000
cells/mL 3. Plate 100 uL of suspended cells into each stoppered well for a cell density of 60,000 cells per well. 4.
Incubate the cells at 37C in a 5% C02 humidified incubator overnight (16-24 hours) to permit cell attachment.
5. Using the Oris Stopper Tool, remove all stoppers, except for those in "no migration controls" which will remain
in place until time of staining. 6. Carefully remove the seeding media from the wells and add 200 uL medium
containing the test compound per well. 7. Briefly examine the wells by phase contrast microscopy to ensure
continued adherence of the cells. 8. Incubate the cells at 37C/5% C02 for 72 hours to permit cell migration. 9.
After 72 hours, mix 5 uL Calcein AM, 5 uL Hoechst 33342, and 10 mL phenol red-free Neurobasal medium with
0.1% BSA. 10. Carefully remove stoppers from the "no migration controls". 11. Carefully remove the test
medium from all wells and add 100 uL of diluted Calcein/Hoechst solution to each well. 12. Incubate plate at
37C/5% C02for 30- 60 minutes with the lid on and in the dark (the darkness of a standard incubator will suffice).
13. For use with a fluorescence microplate reader, attach the Oris Detection Mask and read promptly for Calcein
fluorescence (ex 494 nm/ em 517 nm). 14. For image analysis, photomicrograph wells using epifluorescence
illumination with or without the Oris Detection mask. Images can then be analyzed using either area closure
with the calcein stain or number of cells (nuclei) using the Hoechst stain. ImageJ freeware available from the
NIH (http://rsbweb.nih.gov/ij/) can be used for migration data analysis as percent area closure or cellular
enumeration

Baseline median absolute deviation for the assay (bmad): 10.414
Response cutoff threshold used to determine hit calls: 31.243
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: The ArunA migration assay measures growth and survival in human embryonic neuroprogenitor
(hNP) and human neural crest (hNC) cells by tracking the presence/absence of viable nuclei movement into a
defined circular area in each microplate well. These different measurements are assessed following 72 hour
incubations with test chemical to evaluate the potential to disrupt neural migration in developing human
embryos.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

1.2 nM
Key positive control:

cytochalasin D

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

NA

3.

Additionally, this assay was annotated to the intended target family of neurodevelopment.
Data Interpretation

-------
The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The migration of neuroprogenitor and neural crest cells into the detection zone was assessed by
comparing the percent migration of proliferative Ki-67 cells to total migrating cells following 72 hours exposure.
This was accomplished by determining the percentage of total cells migrating into the detection zone, i.e. the
migration index (Ml), compared to the percentage of migrating cells that expressed the Ki-67 proliferative
marker within the detection zone, i.e. the proliferative index (PI). Normalized response values for each assay
endpoint were calculated as resp = 100 x (rval-bval) / (pval-bval) where rval, bval, and pval correspond to the
raw value, the plate level DMSO control median, and the plate level positive/negative control median,
respectively. In the parallel viability assessment, normalized response was calculated as resp = log2(rval/bval).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 15: pval.apid.medncbyconc.min (Calculate the positive
control value (pval) as the plate-wise minimum, by assay plate ID (apid), of the medians of the corrected
values (cval) for gain-of-signal single- or multiple-concentration negative control wells (wilt = m or o) by
apid, well type, and concentration.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

-------
5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 63	Number of chemicals tested: 58

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
27

Inactive hit count: Oihitc 0.9
29

WINING MODEL SELECTION

NA hit count: hitc^O
7

Number of sample-assay endpoints with winning hill model:

6
4

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

5

23

quadratic-polynomialfpoly2) model: 5

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:

0

0

10

-------
exponentials (exp5) model:

10

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

4067

Neutral control median absolute deviation, by plate: nmad

280.211

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

8.49%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

NA

NA

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	165

Negative control well median absolute deviation value, by plate: mmad	85.991

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-10.15

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 10.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1831

ArunA_NOG_NucleusCount

1.	General Information

1.1	Assay Title: ArunA Biomedical's Neurite Outgrowth (NOG) Assay for Nucleus Count

1.2	Assay Summary: ArunA_NOG (Neurite Outgrowth) is a cell-based, image-based assay that uses human H9-
derived embryonic differentiated neurons (hNN). Measurements were taken 48 hours after chemical dosing in
a 96-well plate. ArunA_NOG_NucleusCount is an assay component measured from the ArunA_NOG assay. It is
designed to make measurements of viability related to the number of neurons, using a form of viability reporter,
as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay
component ArunA_NOG_NucleusCount was analyzed at the endpoint, ArunA_NOG_NucleusCount, in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of viability reporter, loss-of-signal activity can be used to understand viability. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the neurodevelopment intended target family,
where the subfamily is neurite outgrowth.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ArunA Biomedical is a privately owned biotechnology company and Contract Research
Organization (CRO) formerly providing toxicology screening using neural stem cell-based assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: hN2 cells and growth media were provided through Material Transfer Agreement #466-
08 between the U.S. EPA and ArunA Biomedical, Inc.

1.9	Assay Throughput: 96-well plate. ArunA systems offer high throughput chemical screening in a 96-well format
for the human neuroprogenitor (hNP) and human neural crest (hNC) migration and cell titer endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to the number of Hoechst 33,258 labelled nuceli is
indicative of the viability of the system.

Chemical-induced perturbations to cellular key events across neurogenic outcomes, including migration
(neuroprogenitor and neural crest cells) and neural network formation (neurite length, neurite length, and
branch points for neurites), in vitro can inform on cell-based prioritization of neurodevelopmental hazard
potential.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is migration of neuroprogenitors and neural crest cells (NCCs). Impaired neuroprogenitor and NCC
migration can lead to cerebral malformations and neurodevelopmental disorders, such as diencephalic-
mesencephalic dysplasia syndrome, cerebral palsy, cerebellar ataxia, and microcephaly.

-------
2.3	Experimental System: adherent hNN cell line used. The hN2 cell line is derived from neuroepithelial cells of
WA09 hESC (Thomson et al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et
al., 2006). Importantly, as opposed to other methods of deriving neural progenitors through three-dimensional
neurosphere and embryoid body formations (Reubinoff et al., 2001, Zhang et al., 2001), these adherent
monolayer cultures are uniformly exposed to growth factors and/or morphogens throughout their propagation.
Neurogenic lineages from human embryonic stem cell line WA09 were locked into three neural differentiation
states: neuroprogenitor (hNPl - Cat no. 7009), neural crest (hNC - Cat no. 7029), and neural network (hNN -
Cat no. hNJL7014). Prior to differentiation into hN2 cells the population was confirmed karyotypically normal,
>95% nestin positive and <3% OCT-4 positive (Shin et al., 2006). The cells were produced in bulk by propagation
for an additional 2 weeks beyond the neuroepithelial stage by removal of bFGF from the media and
cryopreserved (ArunA Biomedical, Athens, GA) for end user applications. The hNP and hNC cell endpoints
consisted of cell titer and migratory measurements whereas hNN cell endpoints consisted of neuron count and
three neurite-specific metrics to assess network formation: neurite length, neurites per neuron, and branch
points for neurites. For this study, ArunA Bio extended the differentiation period of the hNN cells by
approximately two weeks more than in the original hN2 protocol. This allowed for increased neural network cell
axonation leading to better quantitation of network-specific endpoints. The utility of dissociated hN2 cultures
as an in vitro model for neurite outgrowth was assessed using automated high-content image analysis (HCA). In
addition, the molecular phenotype of these cells was examined using immunocytochemical staining.

2.4	Metabolic Competence: H9-derived cells are locked at different neuronal developmental states of interest to
DNT investigations of chemical exposures. Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Chemical treatment: Differentiated hNN cells were seeded and immediately exposed to test
medium for 48 h. Following chemical exposure, cell bodies were stained with Hoechst 33,258 to quantitate
viable neuron count and neurites were labeled with blll-tubulin/DyLightl 488. High content imaging assessed
the neurite outgrowth endpoints: neurite total length per neuron (um), neurite count per neuron, and branch
points per neurite using the methods described in (Harrill et al 2010). Measurements of hN2 morphology:
(Beta)11l-TubuIin stained cell cultures were allowed to warm to room temperature. Plates were then loaded into
a Cellomics ArrayScan VTI HCS reader high-content imaging system (ThermoFisher Scientific, Waltham, MA) for
automated image acquisition and morphometric analyses. This system consists of an epifluorescent microscope
with an EXFO X-cite 120 metal-halide arc lamp, motorized imaging objectives, stage and excitation/emission
filter wheel and a 12-bit high-resolution CCD camera connected to a Dell Intel Xenon computer terminal with 2
GHz processor. Image acquisition and storage was performed using the vHCS Scan software package, version
6.6.1.4. Matched fluorescent images of Hoechst-stained nuclei and (beta)lll-tubulin/DyLight 488 immunolabeled
cells were acquired using 365/515 (channel 1) and 475/515 (channel 2) nm excitation/emission filter couplings,
respectively, with a 20x objective (Zeiss, Inc., Thornwood, NY). Fixed integration times for image acquisition in
each channel were determined by manual sampling of control-treated wells across multiple plates. A matching
pseudocolored composite image of Hoechst-stained nuclei (blue) and (beta)lM-tubulin/DyLight 488 labeled cell
bodies and neurites (green). The Neural Profiling BioApplication performs automated image analysis in a
sequential manner as follows. Briefly, nuclei were identified in channel 1 as bright objects on a dark background.
Nuclei with size and intensity values outside of the ranges determined a priori for viable cells were identified in
the channel 1 image and rejected from further analyses. Spatial coordinates from the channel 1 image were
then superimposed on the matching channel 2 image. Cell body masks in channel 2 were then cast based on
positional data from channel 1 nuclei and a set of user-defined geometric and signal intensity-based parameters.
Cell bodies corresponding to valid neurons were then selected and invalid cell bodies rejected. Parameters for
valid cell body selection include the presence of exactly one nucleus within the cell body mask, a requirement
that the nucleus met the gating criteria imposed in channel 1, a requirement that at least 25% of the nucleus
perimeter is bounded by DyLight 488 labeled cytoplasm and a requirement that the total cell body area not
exceed 4000 um2. Neurites emerging from the selected cell bodies were then individually traced and measured.
For this study, neurites were defined as processes >10 um in length. Neurites were separated from cell bodies
at points when the half-width of the labeled cytoplasm was less 3.6 um across. In the case of neurites with an
ambiguous origin (i.e. appearing to emerge from or contact multiple cell bodies) the Neural Profiling
BioApplication traced the neurite from all potential origin points and retained the longest neurite for

-------
measurements of length and number of neurites per neuron. This effectively prevented repeated sampling of
the same neurite segment within each image. Morphometric data from high-content image analysis (HCA)
included measurements of the average number of neurites per neuron and total neurite length per neuron.
Data for both endpoints were collected on cell-by-cell basis. The number of neurites and the cumulative length
of all neurites associated with each cell body (i.e. total neurite length) were calculated for each cell meeting the
selection criteria outlined above. Cell-level measurements were then averaged to obtain a mean measurement
for the average number of neurites per neuron and total neurite length per neuron for the cell populations
sampled within each well.

Baseline median absolute deviation for the assay (bmad): 0.152
Response cutoff threshold used to determine hit calls: 0.456
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: The ArunA neurite outgrowth assay monitors changes in neurite length and number of branch points
(both total number of branch points and number formed per neuron) in human neural network cells (hNN)
derived from human embryonic stem cells. These different measurements are assessed following 48 hour
incubations with test chemical to help predict the potential to disrupt neural network formation in developing
human embryos.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: High content imaging assessed the neurite outgrowth endpoints: neurite total length per neuron
(nm), neurite count per neuron, and branch points per neurite. Plate-level raw data, provided by each assay

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

1.2 nM
Key positive control:

NA

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

NA

-------
source, were received by EPA from each contractor and analyzed using the ToxCast Pipeline (tcpl). Normalized
response values for each assay endpoint were calculated as resp = log2(rval/bval) where rval, bval, and pval
correspond to the raw value, the plate level DMSO control median, and the plate level positive/negative control
median, respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were

-------
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 60

Active hit count: hitc>0.9
14

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1589.5

Neutral control median absolute deviation, by plate: nmad	125.28

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.3%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1833

ArunA_NOG_NeuriteLength

1.	General Information

1.1	Assay Title: ArunA Biomedical's Neurite Outgrowth (NOG) Assay for Neurite Length

1.2	Assay Summary: ArunA_NOG (Neurite Outgrowth) is a cell-based, image-based assay that uses human H9-
derived embryonic differentiated neurons (hNN). Measurements were taken 48 hours after chemical dosing in
a 96-well plate. ArunA_NOG_NeuriteLength is an assay component measured from the ArunA_NOG assay. It is
designed to make measurements of neurite outgrowth related to neurite length, using a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component ArunA_NOG_NeuriteLength was analyzed at the endpoint, ArunA_NOG_NeuriteLength, in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of morphology reporter, loss-of-signal activity can be used to understand developmental effects. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the neurodevelopment
intended target family, where the subfamily is neurite outgrowth.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ArunA Biomedical is a privately owned biotechnology company and Contract Research
Organization (CRO) formerly providing toxicology screening using neural stem cell-based assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: hN2 cells and growth media were provided through Material Transfer Agreement #466-
08 between the U.S. EPA and ArunA Biomedical, Inc.

1.9	Assay Throughput: 96-well plate. ArunA systems offer high throughput chemical screening in a 96-well format
for the human neuroprogenitor (hNP) and human neural crest (hNC) migration and cell titer endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to blll-tubulin/DyLightl 488 antibody labelling is
indicative of the neurite outgrowth.

Chemical-induced perturbations to cellular key events across neurogenic outcomes, including migration
(neuroprogenitor and neural crest cells) and neural network formation (neurite length, neurite length, and
branch points for neurites), in vitro can inform on cell-based prioritization of neurodevelopmental hazard
potential.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is migration of neuroprogenitors and neural crest cells (NCCs). Impaired neuroprogenitor and NCC
migration can lead to cerebral malformations and neurodevelopmental disorders, such as diencephalic-
mesencephalic dysplasia syndrome, cerebral palsy, cerebellar ataxia, and microcephaly.

-------
2.3	Experimental System: adherent hNN cell line used. The hN2 cell line is derived from neuroepithelial cells of
WA09 hESC (Thomson et al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et
al., 2006). Importantly, as opposed to other methods of deriving neural progenitors through three-dimensional
neurosphere and embryoid body formations (Reubinoff et al., 2001, Zhang et al., 2001), these adherent
monolayer cultures are uniformly exposed to growth factors and/or morphogens throughout their propagation.
Neurogenic lineages from human embryonic stem cell line WA09 were locked into three neural differentiation
states: neuroprogenitor (hNPl - Cat no. 7009), neural crest (hNC - Cat no. 7029), and neural network (hNN -
Cat no. hNJL7014). Prior to differentiation into hN2 cells the population was confirmed karyotypically normal,
>95% nestin positive and <3% OCT-4 positive (Shin et al., 2006). The cells were produced in bulk by propagation
for an additional 2 weeks beyond the neuroepithelial stage by removal of bFGF from the media and
cryopreserved (ArunA Biomedical, Athens, GA) for end user applications. The hNP and hNC cell endpoints
consisted of cell titer and migratory measurements whereas hNN cell endpoints consisted of neuron count and
three neurite-specific metrics to assess network formation: neurite length, neurites per neuron, and branch
points for neurites. For this study, ArunA Bio extended the differentiation period of the hNN cells by
approximately two weeks more than in the original hN2 protocol. This allowed for increased neural network cell
axonation leading to better quantitation of network-specific endpoints. The utility of dissociated hN2 cultures
as an in vitro model for neurite outgrowth was assessed using automated high-content image analysis (HCA). In
addition, the molecular phenotype of these cells was examined using immunocytochemical staining.

2.4	Metabolic Competence: H9-derived cells are locked at different neuronal developmental states of interest to
DNT investigations of chemical exposures. Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Chemical treatment: Differentiated hNN cells were seeded and immediately exposed to test
medium for 48 h. Following chemical exposure, cell bodies were stained with Hoechst 33,258 to quantitate
viable neuron count and neurites were labeled with blll-tubulin/DyLightl 488. High content imaging assessed
the neurite outgrowth endpoints: neurite total length per neuron (um), neurite count per neuron, and branch
points per neurite using the methods described in (Harrill et al 2010). Measurements of hN2 morphology:
(Beta)11l-TubuIin stained cell cultures were allowed to warm to room temperature. Plates were then loaded into
a Cellomics ArrayScan VTI HCS reader high-content imaging system (ThermoFisher Scientific, Waltham, MA) for
automated image acquisition and morphometric analyses. This system consists of an epifluorescent microscope
with an EXFO X-cite 120 metal-halide arc lamp, motorized imaging objectives, stage and excitation/emission
filter wheel and a 12-bit high-resolution CCD camera connected to a Dell Intel Xenon computer terminal with 2
GHz processor. Image acquisition and storage was performed using the vHCS Scan software package, version
6.6.1.4. Matched fluorescent images of Hoechst-stained nuclei and (beta)lll-tubulin/DyLight 488 immunolabeled
cells were acquired using 365/515 (channel 1) and 475/515 (channel 2) nm excitation/emission filter couplings,
respectively, with a 20x objective (Zeiss, Inc., Thornwood, NY). Fixed integration times for image acquisition in
each channel were determined by manual sampling of control-treated wells across multiple plates. A matching
pseudocolored composite image of Hoechst-stained nuclei (blue) and (beta)lM-tubulin/DyLight 488 labeled cell
bodies and neurites (green). The Neural Profiling BioApplication performs automated image analysis in a
sequential manner as follows. Briefly, nuclei were identified in channel 1 as bright objects on a dark background.
Nuclei with size and intensity values outside of the ranges determined a priori for viable cells were identified in
the channel 1 image and rejected from further analyses. Spatial coordinates from the channel 1 image were
then superimposed on the matching channel 2 image. Cell body masks in channel 2 were then cast based on
positional data from channel 1 nuclei and a set of user-defined geometric and signal intensity-based parameters.
Cell bodies corresponding to valid neurons were then selected and invalid cell bodies rejected. Parameters for
valid cell body selection include the presence of exactly one nucleus within the cell body mask, a requirement
that the nucleus met the gating criteria imposed in channel 1, a requirement that at least 25% of the nucleus
perimeter is bounded by DyLight 488 labeled cytoplasm and a requirement that the total cell body area not
exceed 4000 um2. Neurites emerging from the selected cell bodies were then individually traced and measured.
For this study, neurites were defined as processes >10 um in length. Neurites were separated from cell bodies
at points when the half-width of the labeled cytoplasm was less 3.6 um across. In the case of neurites with an
ambiguous origin (i.e. appearing to emerge from or contact multiple cell bodies) the Neural Profiling
BioApplication traced the neurite from all potential origin points and retained the longest neurite for

-------
measurements of length and number of neurites per neuron. This effectively prevented repeated sampling of
the same neurite segment within each image. Morphometric data from high-content image analysis (HCA)
included measurements of the average number of neurites per neuron and total neurite length per neuron.
Data for both endpoints were collected on cell-by-cell basis. The number of neurites and the cumulative length
of all neurites associated with each cell body (i.e. total neurite length) were calculated for each cell meeting the
selection criteria outlined above. Cell-level measurements were then averaged to obtain a mean measurement
for the average number of neurites per neuron and total neurite length per neuron for the cell populations
sampled within each well.

Baseline median absolute deviation for the assay (bmad): 0.175
Response cutoff threshold used to determine hit calls: 0.524
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: The ArunA neurite outgrowth assay monitors changes in neurite length and number of branch points
(both total number of branch points and number formed per neuron) in human neural network cells (hNN)
derived from human embryonic stem cells. These different measurements are assessed following 48 hour
incubations with test chemical to help predict the potential to disrupt neural network formation in developing
human embryos.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: High content imaging assessed the neurite outgrowth endpoints: neurite total length per neuron
(nm), neurite count per neuron, and branch points per neurite. Plate-level raw data, provided by each assay

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

1.2 nM
Key positive control:

NA

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

NA

-------
source, were received by EPA from each contractor and analyzed using the ToxCast Pipeline (tcpl). Normalized
response values for each assay endpoint were calculated as resp = log2(rval/bval) where rval, bval, and pval
correspond to the raw value, the plate level DMSO control median, and the plate level positive/negative control
median, respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were

-------
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 60

Active hit count: hitc>0.9
14

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	21.125

Neutral control median absolute deviation, by plate: nmad	3.626

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.47%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1835

ArunA_NOG_NeuritesPerNeuron

1.	General Information

1.1	Assay Title: ArunA Biomedical's Neurite Outgrowth (NOG) Assay for Neurites Per Neuron

1.2	Assay Summary: ArunA_NOG (Neurite Outgrowth) is a cell-based, image-based assay that uses human H9-
derived embryonic differentiated neurons (hNN). Measurements were taken 48 hours after chemical dosing in
a 96-well plate. ArunA_NOG_NeuritesPerNeuron is an assay component measured from the ArunA_NOG assay.
It is designed to make measurements of neurite outgrowth related to number of neurites per neuron, using a
form of morphology reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging
technology. Data from the assay component ArunA_NOG_NeuritesPerNeuron was analyzed at the endpoint,
ArunA_NOG_NeuritesPerNeuron, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of morphology reporter, loss-of-signal activity can be used to
understand developmental effects. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the neurodevelopment intended target family, where the subfamily is neurite
outgrowth.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ArunA Biomedical is a privately owned biotechnology company and Contract Research
Organization (CRO) formerly providing toxicology screening using neural stem cell-based assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: hN2 cells and growth media were provided through Material Transfer Agreement #466-
08 between the U.S. EPA and ArunA Biomedical, Inc.

1.9	Assay Throughput: 96-well plate. ArunA systems offer high throughput chemical screening in a 96-well format
for the human neuroprogenitor (hNP) and human neural crest (hNC) migration and cell titer endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to blll-tubulin/DyLightl 488 antibody labelling is
indicative of the neurite outgrowth.

Chemical-induced perturbations to cellular key events across neurogenic outcomes, including migration
(neuroprogenitor and neural crest cells) and neural network formation (neurite length, neurite length, and
branch points for neurites), in vitro can inform on cell-based prioritization of neurodevelopmental hazard
potential.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is migration of neuroprogenitors and neural crest cells (NCCs). Impaired neuroprogenitor and NCC
migration can lead to cerebral malformations and neurodevelopmental disorders, such as diencephalic-
mesencephalic dysplasia syndrome, cerebral palsy, cerebellar ataxia, and microcephaly.

-------
2.3	Experimental System: adherent hNN cell line used. The hN2 cell line is derived from neuroepithelial cells of
WA09 hESC (Thomson et al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et
al., 2006). Importantly, as opposed to other methods of deriving neural progenitors through three-dimensional
neurosphere and embryoid body formations (Reubinoff et al., 2001, Zhang et al., 2001), these adherent
monolayer cultures are uniformly exposed to growth factors and/or morphogens throughout their propagation.
Neurogenic lineages from human embryonic stem cell line WA09 were locked into three neural differentiation
states: neuroprogenitor (hNPl - Cat no. 7009), neural crest (hNC - Cat no. 7029), and neural network (hNN -
Cat no. hNJL7014). Prior to differentiation into hN2 cells the population was confirmed karyotypically normal,
>95% nestin positive and <3% OCT-4 positive (Shin et al., 2006). The cells were produced in bulk by propagation
for an additional 2 weeks beyond the neuroepithelial stage by removal of bFGF from the media and
cryopreserved (ArunA Biomedical, Athens, GA) for end user applications. The hNP and hNC cell endpoints
consisted of cell titer and migratory measurements whereas hNN cell endpoints consisted of neuron count and
three neurite-specific metrics to assess network formation: neurite length, neurites per neuron, and branch
points for neurites. For this study, ArunA Bio extended the differentiation period of the hNN cells by
approximately two weeks more than in the original hN2 protocol. This allowed for increased neural network cell
axonation leading to better quantitation of network-specific endpoints. The utility of dissociated hN2 cultures
as an in vitro model for neurite outgrowth was assessed using automated high-content image analysis (HCA). In
addition, the molecular phenotype of these cells was examined using immunocytochemical staining.

2.4	Metabolic Competence: H9-derived cells are locked at different neuronal developmental states of interest to
DNT investigations of chemical exposures. Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Chemical treatment: Differentiated hNN cells were seeded and immediately exposed to test
medium for 48 h. Following chemical exposure, cell bodies were stained with Hoechst 33,258 to quantitate
viable neuron count and neurites were labeled with blll-tubulin/DyLightl 488. High content imaging assessed
the neurite outgrowth endpoints: neurite total length per neuron (um), neurite count per neuron, and branch
points per neurite using the methods described in (Harrill et al 2010). Measurements of hN2 morphology:
(Beta)11l-TubuIin stained cell cultures were allowed to warm to room temperature. Plates were then loaded into
a Cellomics ArrayScan VTI HCS reader high-content imaging system (ThermoFisher Scientific, Waltham, MA) for
automated image acquisition and morphometric analyses. This system consists of an epifluorescent microscope
with an EXFO X-cite 120 metal-halide arc lamp, motorized imaging objectives, stage and excitation/emission
filter wheel and a 12-bit high-resolution CCD camera connected to a Dell Intel Xenon computer terminal with 2
GHz processor. Image acquisition and storage was performed using the vHCS Scan software package, version
6.6.1.4. Matched fluorescent images of Hoechst-stained nuclei and (beta)lll-tubulin/DyLight 488 immunolabeled
cells were acquired using 365/515 (channel 1) and 475/515 (channel 2) nm excitation/emission filter couplings,
respectively, with a 20x objective (Zeiss, Inc., Thornwood, NY). Fixed integration times for image acquisition in
each channel were determined by manual sampling of control-treated wells across multiple plates. A matching
pseudocolored composite image of Hoechst-stained nuclei (blue) and (beta)lM-tubulin/DyLight 488 labeled cell
bodies and neurites (green). The Neural Profiling BioApplication performs automated image analysis in a
sequential manner as follows. Briefly, nuclei were identified in channel 1 as bright objects on a dark background.
Nuclei with size and intensity values outside of the ranges determined a priori for viable cells were identified in
the channel 1 image and rejected from further analyses. Spatial coordinates from the channel 1 image were
then superimposed on the matching channel 2 image. Cell body masks in channel 2 were then cast based on
positional data from channel 1 nuclei and a set of user-defined geometric and signal intensity-based parameters.
Cell bodies corresponding to valid neurons were then selected and invalid cell bodies rejected. Parameters for
valid cell body selection include the presence of exactly one nucleus within the cell body mask, a requirement
that the nucleus met the gating criteria imposed in channel 1, a requirement that at least 25% of the nucleus
perimeter is bounded by DyLight 488 labeled cytoplasm and a requirement that the total cell body area not
exceed 4000 um2. Neurites emerging from the selected cell bodies were then individually traced and measured.
For this study, neurites were defined as processes >10 um in length. Neurites were separated from cell bodies
at points when the half-width of the labeled cytoplasm was less 3.6 um across. In the case of neurites with an
ambiguous origin (i.e. appearing to emerge from or contact multiple cell bodies) the Neural Profiling

-------
BioApplication traced the neurite from all potential origin points and retained the longest neurite for
measurements of length and number of neurites per neuron. This effectively prevented repeated sampling of
the same neurite segment within each image. Morphometric data from high-content image analysis (HCA)
included measurements of the average number of neurites per neuron and total neurite length per neuron.
Data for both endpoints were collected on cell-by-cell basis. The number of neurites and the cumulative length
of all neurites associated with each cell body (i.e. total neurite length) were calculated for each cell meeting the
selection criteria outlined above. Cell-level measurements were then averaged to obtain a mean measurement
for the average number of neurites per neuron and total neurite length per neuron for the cell populations
sampled within each well.

Baseline median absolute deviation for the assay (bmad): 0.138
Response cutoff threshold used to determine hit calls: 0.415
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: The ArunA neurite outgrowth assay monitors changes in neurite length and number of branch points
(both total number of branch points and number formed per neuron) in human neural network cells (hNN)
derived from human embryonic stem cells. These different measurements are assessed following 48 hour
incubations with test chemical to help predict the potential to disrupt neural network formation in developing
human embryos.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.
3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

1.2 nM
Key positive control:

NA

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

NA

-------
3.2 Data Analysis: High content imaging assessed the neurite outgrowth endpoints: neurite total length per neuron
(urn), neurite count per neuron, and branch points per neurite. Plate-level raw data, provided by each assay
source, were received by EPA from each contractor and analyzed using the ToxCast Pipeline (tcpl). Normalized
response values for each assay endpoint were calculated as resp = log2(rval/bval) where rval, bval, and pval
correspond to the raw value, the plate level DMSO control median, and the plate level positive/negative control
median, respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);

-------
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 60	Number of chemicals tested: 58

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.05

Neutral control median absolute deviation, by plate: nmad	0.215

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.3%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 5.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

-------
researcli/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1838

ArunA_NOG_BranchPointsPerNeurite

1.	General Information

1.1	Assay Title: ArunA Biomedical's Neurite Outgrowth (NOG) Assay for Branch Points Per Neurite

1.2	Assay Summary: ArunA_NOG (Neurite Outgrowth) is a cell-based, image-based assay that uses human H9-
derived embryonic differentiated neurons (hNN). Measurements were taken 48 hours after chemical dosing in
a 96-well plate. ArunA_NOG_BranchPointsPerNeurite is an assay component measured from the ArunA_NOG
assay. It is designed to make measurements of neurite outgrowth related to branch points per neurite, using a
form of morphology reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging
technology. Data from the assay component ArunA_NOG_BranchPointsPerNeurite was analyzed at the
endpoint, ArunA_NOG_BranchPointsPerNeurite, with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of morphology reporter, gain-of-signal activity can be used to
understand developmental effects. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the neurodevelopment intended target family, where the subfamily is neurite
outgrowth.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: ArunA Biomedical is a privately owned biotechnology company and Contract Research
Organization (CRO) formerly providing toxicology screening using neural stem cell-based assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: hN2 cells and growth media were provided through Material Transfer Agreement #466-
08 between the U.S. EPA and ArunA Biomedical, Inc.

1.9	Assay Throughput: 96-well plate. ArunA systems offer high throughput chemical screening in a 96-well format
for the human neuroprogenitor (hNP) and human neural crest (hNC) migration and cell titer endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to blll-tubulin/DyLightl 488 antibody labelling is
indicative of the neurite outgrowth.

Chemical-induced perturbations to cellular key events across neurogenic outcomes, including migration
(neuroprogenitor and neural crest cells) and neural network formation (neurite length, neurite length, and
branch points for neurites), in vitro can inform on cell-based prioritization of neurodevelopmental hazard
potential.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is migration of neuroprogenitors and neural crest cells (NCCs). Impaired neuroprogenitor and NCC
migration can lead to cerebral malformations and neurodevelopmental disorders, such as diencephalic-
mesencephalic dysplasia syndrome, cerebral palsy, cerebellar ataxia, and microcephaly.

-------
2.3	Experimental System: adherent hNN cell line used. The hN2 cell line is derived from neuroepithelial cells of
WA09 hESC (Thomson et al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et
al., 2006). Importantly, as opposed to other methods of deriving neural progenitors through three-dimensional
neurosphere and embryoid body formations (Reubinoff et al., 2001, Zhang et al., 2001), these adherent
monolayer cultures are uniformly exposed to growth factors and/or morphogens throughout their propagation.
Neurogenic lineages from human embryonic stem cell line WA09 were locked into three neural differentiation
states: neuroprogenitor (hNPl - Cat no. 7009), neural crest (hNC - Cat no. 7029), and neural network (hNN -
Cat no. hNJL7014). Prior to differentiation into hN2 cells the population was confirmed karyotypically normal,
>95% nestin positive and <3% OCT-4 positive (Shin et al., 2006). The cells were produced in bulk by propagation
for an additional 2 weeks beyond the neuroepithelial stage by removal of bFGF from the media and
cryopreserved (ArunA Biomedical, Athens, GA) for end user applications. The hNP and hNC cell endpoints
consisted of cell titer and migratory measurements whereas hNN cell endpoints consisted of neuron count and
three neurite-specific metrics to assess network formation: neurite length, neurites per neuron, and branch
points for neurites. For this study, ArunA Bio extended the differentiation period of the hNN cells by
approximately two weeks more than in the original hN2 protocol. This allowed for increased neural network cell
axonation leading to better quantitation of network-specific endpoints. The utility of dissociated hN2 cultures
as an in vitro model for neurite outgrowth was assessed using automated high-content image analysis (HCA). In
addition, the molecular phenotype of these cells was examined using immunocytochemical staining.

2.4	Metabolic Competence: H9-derived cells are locked at different neuronal developmental states of interest to
DNT investigations of chemical exposures. Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Chemical treatment: Differentiated hNN cells were seeded and immediately exposed to test
medium for 48 h. Following chemical exposure, cell bodies were stained with Hoechst 33,258 to quantitate
viable neuron count and neurites were labeled with blll-tubulin/DyLightl 488. High content imaging assessed
the neurite outgrowth endpoints: neurite total length per neuron (um), neurite count per neuron, and branch
points per neurite using the methods described in (Harrill et al 2010). Measurements of hN2 morphology:
(Beta)11l-TubuIin stained cell cultures were allowed to warm to room temperature. Plates were then loaded into
a Cellomics ArrayScan VTI HCS reader high-content imaging system (ThermoFisher Scientific, Waltham, MA) for
automated image acquisition and morphometric analyses. This system consists of an epifluorescent microscope
with an EXFO X-cite 120 metal-halide arc lamp, motorized imaging objectives, stage and excitation/emission
filter wheel and a 12-bit high-resolution CCD camera connected to a Dell Intel Xenon computer terminal with 2
GHz processor. Image acquisition and storage was performed using the vHCS Scan software package, version
6.6.1.4. Matched fluorescent images of Hoechst-stained nuclei and (beta)lll-tubulin/DyLight 488 immunolabeled
cells were acquired using 365/515 (channel 1) and 475/515 (channel 2) nm excitation/emission filter couplings,
respectively, with a 20x objective (Zeiss, Inc., Thornwood, NY). Fixed integration times for image acquisition in
each channel were determined by manual sampling of control-treated wells across multiple plates. A matching
pseudocolored composite image of Hoechst-stained nuclei (blue) and (beta)lM-tubulin/DyLight 488 labeled cell
bodies and neurites (green). The Neural Profiling BioApplication performs automated image analysis in a
sequential manner as follows. Briefly, nuclei were identified in channel 1 as bright objects on a dark background.
Nuclei with size and intensity values outside of the ranges determined a priori for viable cells were identified in
the channel 1 image and rejected from further analyses. Spatial coordinates from the channel 1 image were
then superimposed on the matching channel 2 image. Cell body masks in channel 2 were then cast based on
positional data from channel 1 nuclei and a set of user-defined geometric and signal intensity-based parameters.
Cell bodies corresponding to valid neurons were then selected and invalid cell bodies rejected. Parameters for
valid cell body selection include the presence of exactly one nucleus within the cell body mask, a requirement
that the nucleus met the gating criteria imposed in channel 1, a requirement that at least 25% of the nucleus
perimeter is bounded by DyLight 488 labeled cytoplasm and a requirement that the total cell body area not
exceed 4000 um2. Neurites emerging from the selected cell bodies were then individually traced and measured.
For this study, neurites were defined as processes >10 um in length. Neurites were separated from cell bodies
at points when the half-width of the labeled cytoplasm was less 3.6 um across. In the case of neurites with an
ambiguous origin (i.e. appearing to emerge from or contact multiple cell bodies) the Neural Profiling

-------
BioApplication traced the neurite from all potential origin points and retained the longest neurite for
measurements of length and number of neurites per neuron. This effectively prevented repeated sampling of
the same neurite segment within each image. Morphometric data from high-content image analysis (HCA)
included measurements of the average number of neurites per neuron and total neurite length per neuron.
Data for both endpoints were collected on cell-by-cell basis. The number of neurites and the cumulative length
of all neurites associated with each cell body (i.e. total neurite length) were calculated for each cell meeting the
selection criteria outlined above. Cell-level measurements were then averaged to obtain a mean measurement
for the average number of neurites per neuron and total neurite length per neuron for the cell populations
sampled within each well.

Baseline median absolute deviation for the assay (bmad): 0.228
Response cutoff threshold used to determine hit calls: 0.683
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: The ArunA neurite outgrowth assay monitors changes in neurite length and number of branch points
(both total number of branch points and number formed per neuron) in human neural network cells (hNN)
derived from human embryonic stem cells. These different measurements are assessed following 48 hour
incubations with test chemical to help predict the potential to disrupt neural network formation in developing
human embryos.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.
3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

1.2 nM
Key positive control:

NA

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

NA

-------
3.2 Data Analysis: High content imaging assessed the neurite outgrowth endpoints: neurite total length per neuron
(urn), neurite count per neuron, and branch points per neurite. Plate-level raw data, provided by each assay
source, were received by EPA from each contractor and analyzed using the ToxCast Pipeline (tcpl). Normalized
response values for each assay endpoint were calculated as resp = log2(rval/bval) where rval, bval, and pval
correspond to the raw value, the plate level DMSO control median, and the plate level positive/negative control
median, respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);

-------
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 60	Number of chemicals tested: 58

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.16

Neutral control median absolute deviation, by plate: nmad	0.03

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	18.15%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

-------
researcli/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 63

ATG_Ahr_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human Aryl Hydrocarbon Receptor (Ahr)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Ahr_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_Ahr_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_Ahr_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene AHR. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is basic helix-loop-helix protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element AhRE, which is responsive to the endogenous human aryl hydrocarbon receptor
[GeneSymbokAHR | GenelD:196 | Uniprot_SwissProt_Accession:P35869],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

6-formylindolo carbazole
Baseline median absolute deviation for the assay (bmad): 0.197

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Response cutoff threshold used to determine hit calls: 0.983

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
774

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

412

537

1214

quadratic-polynomialfpoly2) model: 680

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

47

477

809

16

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

0.313

Neutral control median absolute deviation, by plate: nmad	0.129

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	41.21%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 477.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:64

ATG_AP_1_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human AP-1 Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_AP_1_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_AP_1_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_AP_1_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene FOS and JUN. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the dna binding intended target family, where the subfamily is basic leucine zipper.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene AP-1,
which is responsive to the endogenous human FBJ murine osteosarcoma viral oncogene homolog and jun proto-
oncogene [GeneSymbol:FOS & JUN | GenelD:2353 & 3725 | Uniprot_SwissProt_Accession:P01100 & P05412],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2

Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.121

Response cutoff threshold used to determine hit calls: 0.604

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
738

3776

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

142
328

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

366

1754

quadratic-polynomialfpoly2) model: 849

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

94

20

578

331

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.451

Neutral control median absolute deviation, by plate: nmad	0.151

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	33.53%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 331.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 65

ATG_AP_2_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human AP-2 Gene Activation Assay

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_AP_2_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_AP_2_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_AP_2_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene TFAP2A and TFAP2B and TFAP2D. Furthermore, this
assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the dna binding intended target family, where the subfamily is basic
helix-turn-helix leucine zipper.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene AP-2,
which is responsive to the endogenous human transcription factor AP-2 alpha (activating enhancer binding
protein 2 alpha) and transcription factor AP-2 beta (activating enhancer binding protein 2 beta) and transcription
factor AP-2 delta (activating enhancer binding protein 2 delta) [GeneSymbol:TFAP2A & TFAP2B & TFAP2D |
GenelD:7020 & 7021 & 83741 | Uniprot_SwissProt_Accession:P05549 & Q92481 & Q7Z6R9],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

-------
Baseline median absolute deviation for the assay (bmad): 0.064

Response cutoff threshold used to determine hit calls: 0.321

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
213

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.29

Neutral control median absolute deviation, by plate: nmad	0.157

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.18%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 235.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 66

ATG_BRE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human BRE Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_BRE_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_BRE_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_BRE_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene SMAD1. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the dna binding intended target family, where the subfamily is Smad protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element BRE, which is responsive to the endogenous human SMAD family member 1
[GeneSymbokSMADl | GenelD:4086 | Uniprot_SwissProt_Accession:Q15797],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.183

Response cutoff threshold used to determine hit calls: 0.916

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
456

4058

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

123
455

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

383

1487

quadratic-polynomialfpoly2) model: 775

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

56

22

876

285

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.259

Neutral control median absolute deviation, by plate: nmad	0.105

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	40.64%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 285.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 67

ATG_C_EBP_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human CCAAT/enhancer binding protein (C/EBP), beta

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_C_EBP_CIS is one of 52
assay component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_C_EBP_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_C_EBP_CIS, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a
type of inducible reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the
reporter gene at the transcription factor-level as they relate to the gene CEBPB. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a reporter gene function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the dna binding intended target family, where the subfamily is basic leucine
zipper.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene C/EBP,
which is responsive to the endogenous human CCAAT/enhancer binding protein (C/EBP), beta
[GeneSymbokCEBPB | GenelD:1051 | Uniprot_SwissProt_Accession:P17676],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.1
Response cutoff threshold used to determine hit calls: 0.502

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
268

Inactive hit count: Oihitc 0.9
4246

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

136
602

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

342

1411

quadratic-polynomialfpoly2) model: 757

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

41

9

938

226

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.157

Neutral control median absolute deviation, by plate: nmad	0.274

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	23.71%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 226.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 68

ATG_CMV_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human CMV Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_CMV_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_CMV_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_CMV_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the background control
at the transcription factor-level as they relate to the gene . Furthermore, this assay endpoint can be referred
to as a secondary readout, because this assay has produced multiple assay endpoints where this one serves a
background control function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the background measurement intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene CMV,
which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.135

Response cutoff threshold used to determine hit calls: 0.675

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
675

3839

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

145
294

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

371

1859

quadratic-polynomialfpoly2) model: 798

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

63

321

596

15

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.451

Neutral control median absolute deviation, by plate: nmad	0.138

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	30.57%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 321.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 69

ATG_CRE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human cAMP responsive element binding protein 3

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_CRE_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_CRE_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_CRE_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene CREB3. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the dna binding intended target family, where the subfamily is basic leucine zipper.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element CRE, which is responsive to the endogenous human cAMP responsive element binding protein
3 [GeneSymbol:CREB3 | GenelD:10488 | Uniprot_SwissProt_Accession:043889],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

Forskolin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.146

Response cutoff threshold used to determine hit calls: 0.73

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
391

4123

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

126
546

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

380

1450

quadratic-polynomialfpoly2) model: 688

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

61

23

925

263

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.519

Neutral control median absolute deviation, by plate: nmad	0.172

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	33.14%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 263.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 70

ATG_DR4_I_XR_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human LXRE Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_DR4_LXR_CIS is one of 52
assay component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_DR4_LXR_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_DR4_LXR_CIS,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of inducible reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the
reporter gene at the transcription factor-level as they relate to the gene NR1H2 and NR1H3. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element LXRE, which is responsive to the endogenous human nuclear receptor subfamily 1, group H,
member 2 and nuclear receptor subfamily 1, group H, member 3 [GeneSymbol:NRlH2 & NR1H3 | GenelD:7376
& 10062 | Uniprot_SwissProt_Accession:P55055 & Q13133],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.119
Response cutoff threshold used to determine hit calls: 0.597

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
775

Inactive hit count: Oihitc 0.9
3739

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

145
332

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

363

1740

quadratic-polynomialfpoly2) model: 860

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

71

318

615

18

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.996

Neutral control median absolute deviation, by plate: nmad	0.38

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	38.11%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 318.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:71

ATG_DR5_RAR_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human RARE Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_DR5_RAR_CIS is one of 52
assay component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_DR5_RAR_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_DR5_CIS, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a
type of inducible reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the
reporter gene at the transcription factor-level as they relate to the gene RARA and RARB and RARG.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element RARE, which is responsive to the endogenous human retinoic acid receptor, alpha and retinoic
acid receptor, beta and retinoic acid receptor, gamma [GeneSymbokRARA & RARB & RARG | GenelD:5914 &
5915 & 5916 | Uniprot_SwissProt_Accession:P10276 & P10826 & P13631],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

9-cis-Retinoic acid

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.12
Response cutoff threshold used to determine hit calls: 0.599

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
396

Inactive hit count: Oihitc 0.9
4118

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

146
503

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

336

1451

quadratic-polynomialfpoly2) model: 896

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

29

7

759

335

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.464

Neutral control median absolute deviation, by plate: nmad	0.104

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	22.37%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 335.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 72

ATG_E_Box_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human Ebox Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_E_Box_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_E_Box_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_E_Box_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene USF1. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is basic helix-loop-helix protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene Ebox,
which is responsive to the endogenous human upstream transcription factor 1 [GeneSymbokUSFl |
GenelD:7391 | Uniprot_SwissProt_Accession:P22415],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.095

Response cutoff threshold used to determine hit calls: 0.477

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
364

4150

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

106
357

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

300

1829

quadratic-polynomialfpoly2) model: 820

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

49

8

723

270

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.9

Neutral control median absolute deviation, by plate: nmad	0.261

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	28.99%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 270.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 73

ATG_E2F_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human E2F transcription factor 1 Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_E2F_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_E2F_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_E2F_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene E2F1. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is E2F transcription factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene E2F,
which is responsive to the endogenous human E2F transcription factor 1 [GeneSymbol:E2Fl | GenelD:1869 |
Uniprot_SwissProt_Accession:Q01094],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.097

Response cutoff threshold used to determine hit calls: 0.484

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
95

4419

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

53
322

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

213

2256

quadratic-polynomialfpoly2) model: 849

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

29

200

539

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.295

Neutral control median absolute deviation, by plate: nmad	0.03

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.05%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 200.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:74

ATG_EGR_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human early growth response 1 (EGR1)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_EGR_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_EGR_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_EGR_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene EGR1. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is zinc finger.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene EGR,
which is responsive to the endogenous human early growth response 1 [GeneSymbol:EGRl | GenelD:1958 |
Uniprot_SwissProt_Accession:P18146],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.14

Response cutoff threshold used to determine hit calls: 0.701

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
482

4032

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

101
331

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

366

1946

quadratic-polynomialfpoly2) model: 692

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

61

289

661

15

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.362

Neutral control median absolute deviation, by plate: nmad	0.076

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	20.89%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 289.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 75

ATG_ERE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for Estrogen Response Element (ERE)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_ERE_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_ERE_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_ERE_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene ESR1. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element ERE, which is responsive to the endogenous human estrogen receptor 1 [GeneSymbol:ESRl
| GenelD:2099 | Uniprot_SwissProt_Accession:P03372],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.101

Response cutoff threshold used to determine hit calls: 0.507

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 18: resp.shiftneg.3bmad (Shift all
the normalized response values (resp) less than -3 multiplied by the baseline median absolute deviation
(bmad) to 0; if resp < -3*bmad, resp = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
1102

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.882

Neutral control median absolute deviation, by plate: nmad	0.211

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	23.87%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 395.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 76

ATG_Ets_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human Ets Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Ets_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_Ets_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_Ets_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene ETS1. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is winged helix-turn-helix.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene Ets,
which is responsive to the endogenous human v-ets avian erythroblastosis virus E26 oncogene homolog 1
[GeneSymbokETSl | GenelD:2113 | Uniprot_SwissProt_Accession:P14921],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.073

Response cutoff threshold used to determine hit calls: 0.364

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
180

4334

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

66
311

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

254

2226

quadratic-polynomialfpoly2) model: 758

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

37

2

616

192

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.707

Neutral control median absolute deviation, by plate: nmad	0.07

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.86%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 192.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 77

ATG_FoxA2_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human forkhead box A2 (FOXA2)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_FoxA2_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_FoxA2_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_FoxA2_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene FOXA2. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the dna binding intended target family, where the subfamily is forkhead box protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene FoxA,
which is responsive to the endogenous human forkhead box A2 [GeneSymbol:FOXA2 | GenelD:3170 |
Uniprot_SwissProt_Accession:Q9Y261],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.097

Response cutoff threshold used to determine hit calls: 0.486

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
98

4416

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

70
452

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

238

1865

quadratic-polynomialfpoly2) model: 682

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

33

208

913

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.798

Neutral control median absolute deviation, by plate: nmad	0.107

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.38%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 208.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 78

ATG_FoxO_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human FoxO Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_FoxO_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_FoxO_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_FoxO_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene FOXOl and F0X03. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the dna binding intended target family, where the subfamily is forkhead box protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene FoxO,
which is responsive to the endogenous human forkhead box 01 and forkhead box 03 [GeneSymbokFOXOl &
F0X03 | GenelD:2308 & 2309 | Uniprot_SwissProt_Accession:Q12778 & 043524],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. The HepG2 cell line is a permanent cell culture isolated
from the liver tumor lobectomy of a 15-yr-old Caucasian male from Argentina in 1975 (Aden et al. 1979), which
has been cloned and transfected with a library of multiple reporter transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2
(Attagene, personal communication). The parental HepG2 cell line has been shown by others to retain the
potential for Phase I and Phase II metabolic responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6,
2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 (Westerink and Schoonen 2007a) with CYP1A2, CYP2C9, CYP2D6, CYP2E1
and CYP3A activities reported at levels similar to human hepatocytes although variable depending on source
and culture conditions (Hewitt and Hewitt 2004); some enzymes (e.g., CYP2W1) have even been observed at
higher rates than in primary hepatocytes (Guo et al. 2010). Phase II enzyme activities identified in HepG2 cells
include SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 (Hart et al. 2010, Walle et al. 2000,
Westerink and Schoonen 2007b) and UGTs (1A1, 1A6 and 2B7) (Hart et al. 2010). In addition, HepG2 cells can
potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme et al. 2010) and
Nrf2, a transcription factor which regulates genes containing antioxidant response element (ARE) sequences in
their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding cassette (ABC)
xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated in part by Nrf2
TF DNA-binding) (Adachi et al. 2007).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.094

Response cutoff threshold used to determine hit calls: 0.471

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
159

4355

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

45
266

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

212

2410

quadratic-polynomialfpoly2) model: 776

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

42

532

1

178

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.385

Neutral control median absolute deviation, by plate: nmad	0.031

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.09%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 178.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 79

ATG_GATA_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human GATA Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GATA_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_GATA_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_GATA_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene GATA1. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the dna binding intended target family, where the subfamily is GATA proteins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene GATA,
which is responsive to the endogenous human GATA binding protein 1 (globin transcription factor 1)
[GeneSymbokGATAl | GenelD:2623 | Uniprot_SwissProt_Accession:P15976],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.066

Response cutoff threshold used to determine hit calls: 0.329

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
114

4400

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

37
304

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2436

193

quadratic-polynomialfpoly2) model: 753

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

31

549

159

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.737

Neutral control median absolute deviation, by plate: nmad	0.049

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.64%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 159.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 80

ATG_GU_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human GLI Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GU_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_GU_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_GU_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene GUI. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is zinc finger.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene GLI,
which is responsive to the endogenous human GLI family zinc finger 1 [GeneSymbokGLIl | GenelD:2735 |
Uniprot_SwissProt_Accession:P08151],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.077

Response cutoff threshold used to determine hit calls: 0.385

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
203

4311

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

48
307

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2415

178

quadratic-polynomialfpoly2) model: 814

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

492

0

56

152

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.529

Neutral control median absolute deviation, by plate: nmad	0.043

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.13%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 152.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:81

ATG_GRE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human GRE Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GRE_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_GRE_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_GRE_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene NR3C1. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element GRE, which is responsive to the endogenous human nuclear receptor subfamily 3, group C,
member 1 (glucocorticoid receptor) [GeneSymbol:NR3Cl | GenelD:2908 |
Uniprot_SwissProt_Accession:P04150],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

Dexamethasone

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.083
Response cutoff threshold used to determine hit calls: 0.416

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
147

Inactive hit count: Oihitc 0.9
4367

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

43
289

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

240

2388

quadratic-polynomialfpoly2) model: 781

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

45

2

501

173

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.651

Neutral control median absolute deviation, by plate: nmad	0.108

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	16.63%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 173.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 82

ATG_HIFla_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human hypoxia inducible factor 1 (HIF1A)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_HIFla_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_HIFla_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_HIFla_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene HIF1A. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the dna binding intended target family, where the subfamily is basic helix-loop-helix
protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene HIFla,
which is responsive to the endogenous human hypoxia inducible factor 1, alpha subunit (basic helix-loop-helix
transcription factor) [GeneSymbokHIFIA | GenelD:3091 | Uniprot_SwissProt_Accession:Q16665],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.155
Response cutoff threshold used to determine hit calls: 0.773

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
455

Inactive hit count: Oihitc 0.9
4059

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

104
339

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

301

1727

quadratic-polynomialfpoly2) model: 957

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

42

6

599

387

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.422

Neutral control median absolute deviation, by plate: nmad	0.082

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	19.32%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 387.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 83

ATG_HNF6_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human HNF6 Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_HNF6_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_HNF6_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_HNF6_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene ONECUT1. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the dna binding intended target family, where the subfamily is homeobox protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene HNF6,
which is responsive to the endogenous human one cut homeobox 1 [GeneSymbol:ONECUTl | GenelD:3175 |
Uniprot_SwissProt_Accession:Q9UBC0],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.072

Response cutoff threshold used to determine hit calls: 0.36

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
158

4356

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

34
285

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

205

2357

quadratic-polynomialfpoly2) model: 877

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

489

50

2

163

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.721

Neutral control median absolute deviation, by plate: nmad	0.053

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.4%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 163.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:84

ATG_HSE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human heat shock transcription factor 1 (HSE)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_HSE_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_HSE_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_HSE_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene HSF1. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is heat shock protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene HSE,
which is responsive to the endogenous human heat shock transcription factor 1 [GeneSymbokHSFl |
GenelD:3297 | Uniprot_SwissProt_Accession:Q00613],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

Geldanamycin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.096

Response cutoff threshold used to determine hit calls: 0.478

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
513

4001

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

59
294

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

246

2055

quadratic-polynomialfpoly2) model: 881

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

218

3

594

112

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.582

Neutral control median absolute deviation, by plate: nmad	0.065

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.21%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 218.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 85

ATG_IR1_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human IR1 Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_IR1_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_IR1_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_IR1_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene NR1H4. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the nuclear receptor intended target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene IR1,
which is responsive to the endogenous human nuclear receptor subfamily 1, group H, member 4
[GeneSymbol:NRlH4 | GenelD:9971 | Uniprot_SwissProt_Accession:Q96Rll],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.099

Response cutoff threshold used to determine hit calls: 0.493

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
394

4120

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

100
367

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

289

1791

quadratic-polynomialfpoly2) model: 874

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

43

6

704

288

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.605

Neutral control median absolute deviation, by plate: nmad	0.17

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	28.18%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 288.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 86

ATG_ISRE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human interferon regulatory factor 1

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_ISRE_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_ISRE_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_ISRE_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene IRF1. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is interferon regulatory factors.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element ISRE, which is responsive to the endogenous human interferon regulatory factor 1
[GeneSymboklRFl | GenelD:3659 | Uniprot_SwissProt_Accession:P10914],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.142

Response cutoff threshold used to determine hit calls: 0.712

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
490

4024

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

84
309

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

320

2028

quadratic-polynomialfpoly2) model: 847

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

80

237

544

13

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.313

Neutral control median absolute deviation, by plate: nmad	0.099

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	31.74%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 237.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 87

ATG_M_06_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for M06 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_06_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_M_06_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_M_06_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the background control
at the transcription factor-level as they relate to the gene . Furthermore, this assay endpoint can be referred
to as a secondary readout, because this assay has produced multiple assay endpoints where this one serves a
background control function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the background measurement intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene M_06,
which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.024

Response cutoff threshold used to determine hit calls: 0.263

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
14

4500

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

18
339

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

200

2448

quadratic-polynomial(poly2) model: 610

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

24

676

1

146

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.047

Neutral control median absolute deviation, by plate: nmad	0.027

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	2.55%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 146.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 88

ATG_M_19_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for M19 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_19_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_M_19_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_M_19_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the background control
at the transcription factor-level as they relate to the gene . Furthermore, this assay endpoint can be referred
to as a secondary readout, because this assay has produced multiple assay endpoints where this one serves a
background control function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the background measurement intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene M_19,
which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.055

Response cutoff threshold used to determine hit calls: 0.277

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
57

4457

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

22
355

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2484

187

quadratic-polynomialfpoly2) model: 585

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

32

650

1

146

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.906

Neutral control median absolute deviation, by plate: nmad	0.052

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.73%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 146.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 89

ATG_M_32_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for M32 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_32_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_M_32_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_M_32_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the background control
at the transcription factor-level as they relate to the gene . Furthermore, this assay endpoint can be referred
to as a secondary readout, because this assay has produced multiple assay endpoints where this one serves a
background control function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the background measurement intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene M_32,
which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.053

Response cutoff threshold used to determine hit calls: 0.265

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
70

4444

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

58
428

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2226

189

quadratic-polynomialfpoly2) model: 583

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

803

3

17

155

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.87

Neutral control median absolute deviation, by plate: nmad	0.213

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.52%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 155.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 90

ATG_M_61_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for M61 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_61_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_M_61_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_M_61_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the background control
at the transcription factor-level as they relate to the gene . Furthermore, this assay endpoint can be referred
to as a secondary readout, because this assay has produced multiple assay endpoints where this one serves a
background control function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the background measurement intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene M_61,
which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.024

Response cutoff threshold used to determine hit calls: 0.263

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
14

4500

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

18
340

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

200

2440

quadratic-polynomial(poly2) model: 610

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

24

680

1

149

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.047

Neutral control median absolute deviation, by plate: nmad	0.027

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	2.55%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 149.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:91

ATG_MRE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human MRE Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_MRE_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_MRE_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_MRE_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene MTF1. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the dna binding intended target family, where the subfamily is zinc finger.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element MRE, which is responsive to the endogenous human metal-regulatory transcription factor 1
[GeneSymbokMTFl | GenelD:4520 | Uniprot_SwissProt_Accession:Q14872],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.121

Response cutoff threshold used to determine hit calls: 0.604

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
793

3721

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

128
299

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

371

1850

quadratic-polynomialfpoly2) model: 707

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

44

293

627

143

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.533

Neutral control median absolute deviation, by plate: nmad	0.104

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	19.47%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 293.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 92

ATG_Myb_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human Myb Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Myb_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_Myb_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_Myb_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene MYB. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is MYB proteins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene Myb,
which is responsive to the endogenous human v-myb avian myeloblastosis viral oncogene homolog
[GeneSymbokMYB | GenelD:4602 | Uniprot_SwissProt_Accession:P10242],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2

Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.059

Response cutoff threshold used to determine hit calls: 0.296

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
126

4388

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

29
285

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2590

165

quadratic-polynomialfpoly2) model: 736

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

471

32

2

152

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.898

Neutral control median absolute deviation, by plate: nmad	0.049

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.45%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 152.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 93

ATG_Myc_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human Myc Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Myc_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_Myc_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_Myc_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene MYC. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is basic helix-loop-helix leucine zipper.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene Myc,
which is responsive to the endogenous human v-myc avian myelocytomatosis viral oncogene homolog
[GeneSymbokMYC | GenelD:4609 | Uniprot_SwissProt_Accession:P01106],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.098

Response cutoff threshold used to determine hit calls: 0.489

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
195

4319

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

54
330

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

270

2111

quadratic-polynomialfpoly2) model: 808

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

39

3

642

205

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.005

Neutral control median absolute deviation, by plate: nmad	0.169

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	16.82%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 205.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointlD:94

ATG_NF_kB_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human NF-kB Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_NF_kB_CIS is one of 52
assay component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_NF_kB_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_NF_kB_CIS, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a
type of inducible reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the
reporter gene at the transcription factor-level as they relate to the gene NFKB1. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a reporter gene function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the dna binding intended target family, where the subfamily is NF-kappa B.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene NF-kB,
which is responsive to the endogenous human nuclear factor of kappa light polypeptide gene enhancer in B-
cells 1 [GeneSymbol:NFKBl | GenelD:4790 | Uniprot_SwissProt_Accession:P19838],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.128

Response cutoff threshold used to determine hit calls: 0.642

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
354

4160

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

131
377

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

350

1666

quadratic-polynomialfpoly2) model: 845

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

51

9

744

289

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.522

Neutral control median absolute deviation, by plate: nmad	0.166

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	31.81%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 289.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 95

ATG_NFI_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human NFI Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_NFI_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_NFI_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_NFI_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene NFIA. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is nuclear factor I.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene NFI,
which is responsive to the endogenous human nuclear factor l/A [GeneSymbol:NFIA | GenelD:4774 |
Uniprot_SwissProt_Accession:Q12857],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.085

Response cutoff threshold used to determine hit calls: 0.425

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
283

4231

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

61
323

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

294

1993

quadratic-polynomialfpoly2) model: 867

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

4

60

634

226

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.872

Neutral control median absolute deviation, by plate: nmad	0.142

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	16.32%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 226.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 96

ATG_NRF1_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human NRF1 Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_NRF1_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_NRF1_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_NRF1_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene NRF1. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is nuclear respiratory factors.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene NRF1,
which is responsive to the endogenous human nuclear respiratory factor 1 [GeneSymbokNRFl | GenelD:4899 |
Uniprot_SwissProt_Accession:Q16656],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.081

Response cutoff threshold used to determine hit calls: 0.403

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
198

4316

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

65
351

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

245

2082

quadratic-polynomialfpoly2) model: 816

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

47

3

653

200

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.753

Neutral control median absolute deviation, by plate: nmad	0.098

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.99%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 200.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 97

ATG_N RF2_ARE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human ARE Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_NRF2_ARE_CIS is one of 52
assay component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_NRF2_ARE_CIS was analyzed into 1 assay endpoint.	This assay endpoint,
ATG_NRF2_ARE_CIS, was analyzed with bidirectional fitting relative to DMSO as the negative control and
baseline of activity. Using a type of inducible reporter, measures of mRNA for gain or loss-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene NFE2L2.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the dna binding intended target family, where the
subfamily is basic leucine zipper.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element ARE, which is responsive to the endogenous human nuclear factor, erythroid 2-like 2
[GeneSymbol:NFE2L2 | GenelD:4780 | Uniprot_SwissProt_Accession:Q16236],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.156
Response cutoff threshold used to determine hit calls: 0.78

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1559

Inactive hit count: Oihitc 0.9
2955

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

267
259

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

480

1323

quadratic-polynomialfpoly2) model: 879

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

64

51

629

510

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.254

Neutral control median absolute deviation, by plate: nmad	0.073

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	28.6%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 510.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 98

ATG_Oct_M LP_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human Oct Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Oct_MLP_CIS is one of 52
assay component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_Oct_MLP_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_Oct_MLP_CIS,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of inducible reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the
reporter gene at the transcription factor-level as they relate to the gene POU2F1. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a reporter gene function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the dna binding intended target family, where the subfamily is POU domain
protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene Oct,
which is responsive to the endogenous human POU class 2 homeobox 1 [GeneSymbol:POU2Fl | GenelD:5451
| Uniprot_SwissProt_Accession:P14859],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.136
Response cutoff threshold used to determine hit calls: 0.68

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
594

Inactive hit count: Oihitc 0.9
3920

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

122
429

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

419

1485

quadratic-polynomialfpoly2) model: 784

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

95

27

788

313

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.711

Neutral control median absolute deviation, by plate: nmad	0.325

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	45.67%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 313.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 99

ATG_p53_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human p53 Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_p53_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_p53_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_p53_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene TP53. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is tumor suppressor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene p53,
which is responsive to the endogenous human tumor protein p53 [GeneSymbol:TP53 | GenelD:7157 |
Uniprot_SwissProt_Accession:P04637],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2

Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.128

Response cutoff threshold used to determine hit calls: 0.641

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
403

4111

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

105
472

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

321

1613

quadratic-polynomialfpoly2) model: 954

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

71

5

663

258

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.429

Neutral control median absolute deviation, by plate: nmad	0.154

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	35.94%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 258.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 100

ATG_Pax6_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human paired box 6 (PAX6)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Pax6_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_Pax6_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_Pax6_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene PAX6. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is paired box protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene Pax,
which is responsive to the endogenous human paired box 6 [GeneSymbol:PAX6 | GenelD:5080 |
Uniprot_SwissProt_Accession:P26367],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.128

Response cutoff threshold used to determine hit calls: 0.64

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
443

4071

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

105
356

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

336

2053

quadratic-polynomialfpoly2) model: 628

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

81

242

650

11

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.482

Neutral control median absolute deviation, by plate: nmad	0.096

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	19.99%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 242.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 101

ATG_PBREM_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human PBREM Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_PBREM_CIS is one of 52
assay component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_PBREM_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_PBREM_CIS, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a
type of inducible reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the
reporter gene at the transcription factor-level as they relate to the gene NR1I3. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a reporter gene function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is non-
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene PBREM,
which is responsive to the endogenous human nuclear receptor subfamily 1, group I, member 3
[GeneSymbol:NRll3 | GenelD:9970 | Uniprot_SwissProt_Accession:Q14994],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.096
Response cutoff threshold used to determine hit calls: 0.481

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
353

Inactive hit count: Oihitc 0.9
4161

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

133
457

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

341

1569

quadratic-polynomialfpoly2) model: 813

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

34

9

805

301

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.51

Neutral control median absolute deviation, by plate: nmad	0.071

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.95%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 301.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 102

ATG_PPRE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human Peroxisome Proliferator-activated Response
Element

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_PPRE_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_PPRE_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_PPRE_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene PPARA and PPARD and PPARG. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a reporter gene function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is non-
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element PPRE, which is responsive to the endogenous human peroxisome proliferator-activated
receptor alpha and peroxisome proliferator-activated receptor delta and peroxisome proliferator-activated
receptor gamma [GeneSymbol:PPARA & PPARD & PPARG | GenelD:5465 & 5467 & 5468 |
Uniprot_SwissProt_Accession:Q07869 & Q03181 & P37231],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

Rosiglitazone

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

-------
Baseline median absolute deviation for the assay (bmad): 0.18

Response cutoff threshold used to determine hit calls: 0.9

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
613

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.24

Neutral control median absolute deviation, by plate: nmad	0.676

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	54.52%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 291.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 103

ATG_PXRE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human PXRE Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_PXRE_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_PXRE_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_PXRE_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene NR1I2. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the nuclear receptor intended target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element PXRE, which is responsive to the endogenous human nuclear receptor subfamily 1, group I,
member 2 [GeneSymbol:NRll2 | GenelD:8856 | Uniprot_SwissProt_Accession:075469],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

Rifampicin

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.135

Response cutoff threshold used to determine hit calls: 0.675

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
2349

2165

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

324
645

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

281

896

quadratic-polynomialfpoly2) model:	1185

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

49

476

594

12

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.449

Neutral control median absolute deviation, by plate: nmad	0.15

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	33.35%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 594.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 104

ATG_RORE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human RORE Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_RORE_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_RORE_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_RORE_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene RORA and RORB and RORC. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a reporter gene function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element RORE, which is responsive to the endogenous human RAR-related orphan receptor A and
RAR-related orphan receptor B and RAR-related orphan receptor C [GeneSymbokRORA & RORB & RORC |
GenelD:6095 & 6096 & 6097 | Uniprot_SwissProt_Accession:P35398 & Q92753 & P51449],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.138
Response cutoff threshold used to determine hit calls: 0.688

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
363

Inactive hit count: Oihitc 0.9
4151

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

123
395

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

390

1717

quadratic-polynomialfpoly2) model: 687

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

36

8

814

292

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.358

Neutral control median absolute deviation, by plate: nmad	0.087

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	24.43%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 292.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 105

ATG_Sox_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human SOX Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Sox_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_Sox_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_Sox_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene SOX1. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is HMG box protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene SOX,
which is responsive to the endogenous human SRY (sex determining region Y)-box 1 [GeneSymbokSOXl |
GenelD:6656 | Uniprot_SwissProt_Accession:000570],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.088

Response cutoff threshold used to determine hit calls: 0.441

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
218

4296

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

60
327

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

219

2258

quadratic-polynomialfpoly2) model: 815

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

66

3

526

188

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.765

Neutral control median absolute deviation, by plate: nmad	0.076

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.88%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 188.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 106

ATG_Spl_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human Spl Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Spl_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_Spl_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_Spl_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene SP1. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is zinc finger.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene Spl,
which is responsive to the endogenous human Spl transcription factor [GeneSymbol:SPl | GenelD:6667 |
Uniprot_SwissProt_Accession:P08047],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2

Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.106

Response cutoff threshold used to determine hit calls: 0.529

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
357

4157

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

80
307

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

265

2081

quadratic-polynomialfpoly2) model: 853

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

60

234

564

18

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.571

Neutral control median absolute deviation, by plate: nmad	0.119

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	20.77%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 234.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 107

ATG_SREBP_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human SREBP Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_SREBP_CIS is one of 52
assay component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_SREBP_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_SREBP_CIS, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a
type of inducible reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the
reporter gene at the transcription factor-level as they relate to the gene SREBF1. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a reporter gene function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the dna binding intended target family, where the subfamily is basic helix-
loop-helix leucine zipper.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene SREBP,
which is responsive to the endogenous human sterol regulatory element binding transcription factor 1
[GeneSymbokSREBFl | GenelD:6720 | Uniprot_SwissProt_Accession:P36956],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.073
Response cutoff threshold used to determine hit calls: 0.365

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
523

Inactive hit count: Oihitc 0.9
3991

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

100
348

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

370

1845

quadratic-polynomial(poly2) model: 761

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

51

9

700

278

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.988

Neutral control median absolute deviation, by plate: nmad	0.216

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.89%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 278.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 108

ATG_STAT3_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human signal transducer and activator of transcription 3
(STAT3)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_STAT3_CIS is one of 52
assay component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_STAT3_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_STAT3_CIS, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a
type of inducible reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the
reporter gene at the transcription factor-level as they relate to the gene STAT3. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a reporter gene function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the dna binding intended target family, where the subfamily is stat protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene STAT,
which is responsive to the endogenous human signal transducer and activator of transcription 3 (acute-phase
response factor) [GeneSymbol:STAT3 | GenelD:6774 | Uniprot_SwissProt_Accession:P40763],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.088
Response cutoff threshold used to determine hit calls: 0.439

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
84

Inactive hit count: Oihitc 0.9
4430

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

31
332

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2430

179

quadratic-polynomialfpoly2) model: 765

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

33

530

1

161

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.612

Neutral control median absolute deviation, by plate: nmad	0.043

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.03%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 161.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 109

ATG_TA_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human TA Gene Activation (Basal Promoter)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_TA_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_TA_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_TA_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the background control
at the transcription factor-level as they relate to the gene . Furthermore, this assay endpoint can be referred
to as a secondary readout, because this assay has produced multiple assay endpoints where this one serves a
background control function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the background measurement intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene TA,
which is used as a basal promoter.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.152

Response cutoff threshold used to determine hit calls: 0.759

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
207

4307

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

83
379

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

231

2155

quadratic-polynomialfpoly2) model: 694

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

34

6

660

220

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.286

Neutral control median absolute deviation, by plate: nmad	0.064

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	22.29%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 220.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 110

ATG_TAL_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human TAL Gene Activation (Basal Promoter)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_TAL_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_TAL_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_TAL_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the background control
at the transcription factor-level as they relate to the gene . Furthermore, this assay endpoint can be referred
to as a secondary readout, because this assay has produced multiple assay endpoints where this one serves a
background control function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the background measurement intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene TAL,
which is used as a basal promoter.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.137

Response cutoff threshold used to determine hit calls: 0.687

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
137

4377

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

45
309

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

206

2289

quadratic-poly nomialfpoly 2) model: 811

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

4

35

554

209

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.213

Neutral control median absolute deviation, by plate: nmad	0.042

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	19.49%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 209.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay EndpointID: 111

ATG_TCF_b_cat_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human TCF/b-cat Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_TCF_b_cat_CIS is one of 52
assay component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_TCF_b_cat_CIS was analyzed into 1 assay endpoint.	This assay endpoint,
ATG_TCF_b_cat_CIS, was analyzed with bidirectional fitting relative to DMSO as the negative control and
baseline of activity. Using a type of inducible reporter, measures of mRNA for gain or loss-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene TCF7
and TCF7L2 and LEF1 and TCF7L1. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves a reporter gene function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the dna binding
intended target family, where the subfamily is HMG box protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene TCF/b-
cat, which is responsive to the endogenous human transcription factor 7 (T-cell specific, HMG-box) and
transcription factor 7-like 2 (T-cell specific, HMG-box) and lymphoid enhancer-binding factor 1 and transcription
factor 7-like 1 (T-cell specific, HMG-box) [GeneSymbol:TCF7 & TCF7L2 & LEF1 & TCF7L1 | GenelD:6932 & 6934
& 51176 & 83439 | Uniprot_SwissProt_Accession:P36402 & Q9NQB0 & Q9UJU2 & Q9HCS4],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

-------
Baseline median absolute deviation for the assay (bmad): 0.139

Response cutoff threshold used to determine hit calls: 0.695

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
492

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.422

Neutral control median absolute deviation, by plate: nmad	0.248

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	58.67%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 257.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 112

ATG_TGFb_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human transforming growth factor (TGF)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_TGFb_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_TGFb_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_TGFb_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene TGFB1. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the growth factor intended target family, where the subfamily is transforming growth
factor beta.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene TGF,
which is responsive to the endogenous human transforming growth factor, beta 1 [GeneSymbokTGFBl |
GenelD:7040 | Uniprot_SwissProt_Accession:P01137],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.156
Response cutoff threshold used to determine hit calls: 0.781

-------
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of growth factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
213

Inactive hit count: Oihitc 0.9
4301

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

90
484

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

263

1828

quadratic-polynomialfpoly2) model: 749

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

42

2

788

216

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.255

Neutral control median absolute deviation, by plate: nmad	0.058

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	22.68%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 216.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 113

ATG_VDRE_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human VDRE Gene Activation

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_VDRE_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_VDRE_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_VDRE_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene VDR. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the nuclear receptor intended target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene
response element VDRE, which is responsive to the endogenous human vitamin D (1,25- dihydroxyvitamin D3)
receptor [GeneSymbokVDR | GenelD:7421 | Uniprot_SwissProt_Accession:P11473],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.146

Response cutoff threshold used to determine hit calls: 0.73

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
996

3518

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

239
388

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

397

1393

quadratic-polynomialfpoly2) model: 767

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

47

408

20

803

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.533

Neutral control median absolute deviation, by plate: nmad	0.122

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	22.81%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 408.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 114

ATG_Xbpl_CIS

1.	General Information

1.1	Assay Title: Attagene CIS-FACTORIAL HepG2 Assay for human X-box binding protein 1 (XBP1)

1.2	Assay Summary: ATG_CIS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Xbpl_CIS is one of 52 assay
component(s) measured or calculated from the ATG_CIS assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay component
ATG_Xbpl_CIS was analyzed into 1 assay endpoint. This assay endpoint, ATG_Xbpl_CIS, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of mRNA for gain or loss-of-signal activity can be used to understand the reporter gene at
the transcription factor-level as they relate to the gene XBP1. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is basic leucine zipper.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the cis-acting reporter gene Xbpl,
which is responsive to the endogenous human X-box binding protein 1 [GeneSymbol:XBPl | GenelD:7494 |
Uniprot_SwissProt_Accession:P17861],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2

Scientific Principles: Within the CIS-FACTORIAL version, RTU transcription is controlled by a cis-regulating
element (promoter). The specificity of a RTU is determined by the presence of a TF binding site in the promoter.

-------
Importantly, as all members of TF family recognize the same binding sequence, CIS-FACTORIAL evaluates
activities of TF families.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

300 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.118

Response cutoff threshold used to determine hit calls: 0.591

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514	Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
492

4022

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

104
408

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

381

1653

quadratic-polynomialfpoly2) model: 111

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

82

282

762

13

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.715

Neutral control median absolute deviation, by plate: nmad	0.208

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	29.03%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 282.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 115

ATG_AR_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human androgen receptor (AR)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_AR_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_AR_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_AR_TRANS, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene AR. Furthermore, this
assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-AR, also known as human androgen receptor
[GeneSymbokAR | GenelD:367 | Uniprot_SwissProt_Accession:P10275],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

6a-Fluorotestosterone

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.171

-------
Response cutoff threshold used to determine hit calls: 0.857

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast AR Pathway Model: Androgen receptor assays used in ToxCast AR Pathway model. See
10.1016/j.yrtph.2020.104764 and 10.1021/acs.chemrestox.6b00347

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
54

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.159

Neutral control median absolute deviation, by plate: nmad	0.268

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	23.15%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 163.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 116

ATG_CAR_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human CAR Gene Activation

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_CAR_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_CAR_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_CAR_TRANS, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene NR1I3. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-CAR, also known as human nuclear receptor subfamily
1, group I, member 3 [GeneSymbol:NRll3 | GenelD:9970 | Uniprot_SwissProt_Accession:Q14994],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.203

-------
Response cutoff threshold used to determine hit calls: 1.013

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
19

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.252

Neutral control median absolute deviation, by plate: nmad	1.221

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	37.55%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 214.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 117

ATG_ERa_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human estrogen receptor, alpha (ERa)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_ERa_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_ERa_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_ERa_TRANS, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene ESR1. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-ERa, also known as human estrogen receptor 1
[GeneSymbokESRl | GenelD:2099 | Uniprot_SwissProt_Accession:P03372],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.221

-------
Response cutoff threshold used to determine hit calls: 1.107

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 18: resp.shiftneg.3bmad (Shift all
the normalized response values (resp) less than -3 multiplied by the baseline median absolute deviation
(bmad) to 0; if resp < -3*bmad, resp = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
965

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.743

Neutral control median absolute deviation, by plate: nmad	0.555

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	31.86%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 434.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 118

ATG_ERRa_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human estrogen-related receptor, alpha (ERRa)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_ERRa_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_ERRa_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_ERRa_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene ESRRA. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-ERRa, also known as human estrogen-related receptor
alpha [GeneSymbol:ESRRA | GenelD:2101 | Uniprot_SwissProt_Accession:P11474],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.234

-------
Response cutoff threshold used to determine hit calls: 1.172

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
6

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.424

Neutral control median absolute deviation, by plate: nmad	0.407

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	28.59%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 135.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 119

ATG_E R Rg_TRAN S

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human estrogen-related receptor, gamma (ERRg)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_ERRg_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_ERRg_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_ERRg_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene ESRRG. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-ERRg, also known as human estrogen-related receptor
gamma [GeneSymbokESRRG | GenelD:2104 | Uniprot_SwissProt_Accession:P62508],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.272

-------
Response cutoff threshold used to determine hit calls: 1.358

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
11

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.067

Neutral control median absolute deviation, by plate: nmad	1.462

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	47.66%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 168.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 120

ATG_FXR_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for FXR Gene Activation

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_FXR_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_FXR_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_FXR_TRANS, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene NR1H4. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-FXR, also known as human nuclear receptor subfamily
1, group H, member 4 [GeneSymbol:NRlH4 | GenelD:9971 | Uniprot_SwissProt_Accession:Q96Rll],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

CDCA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.166

-------
Response cutoff threshold used to determine hit calls: 0.828

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
98

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.224

Neutral control median absolute deviation, by plate: nmad	0.293

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	23.93%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 176.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 121

ATG_GAL4_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for GAL4 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GAL4_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_GAL4_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_GAL4_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the background control at the transcription factor-level as they relate to the gene . Furthermore,
this assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves a background control function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the background measurement intended target family,
where the subfamily is baseline control.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-gal4, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a

-------
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.156

Response cutoff threshold used to determine hit calls: 0.778

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4478

Number of chemicals tested: 3827

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
3

Inactive hit count: Oihitc 0.9
2167

WINING MODEL SELECTION

NA hit count: hitc^O
2308

Number of sample-assay endpoints with winning hill model:

43
318

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2556

159

quadratic-polynomialfpoly2) model: 640

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

488

20

3

199

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.713

Neutral control median absolute deviation, by plate: nmad	0.489

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	28.56%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 199.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 122

ATG_GR_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human glucocorticoid receptor (GR)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GR_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_GR_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_GR_TRANS, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene NR3C1. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-GR, also known as human nuclear receptor subfamily 3,
group C, member 1 (glucocorticoid receptor) [GeneSymbol:NR3Cl | GenelD:2908 |
Uniprot_SwissProt_Accession:P04150],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.162

-------
Response cutoff threshold used to determine hit calls: 0.811

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
82

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.552

Neutral control median absolute deviation, by plate: nmad	0.367

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	23.64%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 149.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 123

ATG_H N F4a_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human hepatocyte nuclear factor 4, alpha (HNF4a)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_HNF4a_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_HNF4a_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_HNF4a_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene HNF4A. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-HNF4a, also known as human hepatocyte nuclear factor
4, alpha [GeneSymbol:HNF4A | GenelD:3172 | Uniprot_SwissProt_Accession:P41235],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.283

-------
Response cutoff threshold used to determine hit calls: 1.417

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
30

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.873

Neutral control median absolute deviation, by plate: nmad	1.353

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	47.08%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 180.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 124

ATG_Hpa5_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human Hpa5 Gene Activation (Basal Promoter)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Hpa5_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_Hpa5_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_Hpa5_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the background control at the transcription factor-level as they relate to the gene . Furthermore,
this assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves a background control function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the background measurement intended target family,
where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-Hpa5, which is used as a basal promoter.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a

-------
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.129 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.167

Response cutoff threshold used to determine hit calls: 0.834

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 320

Number of chemicals tested: 310

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: Oihitc 0.9
187

WINING MODEL SELECTION

NA hit count: hitc^O
133

Number of sample-assay endpoints with winning hill model:

1

26

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

4

188

quadratic-polynomialfpoly2) model: 57

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

33

1

10

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 10.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 125

ATG_l_XRa_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human LXRa Gene Activation

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_LXRa_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_LXRa_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_LXRa_TRANS, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene NR1H3. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-LXRa, also known as human nuclear receptor subfamily
1, group H, member 3 [GeneSymbol:NRlH3 | GenelD:10062 | Uniprot_SwissProt_Accession:Q13133],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

T0901317

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.229

-------
Response cutoff threshold used to determine hit calls: 1.144

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
57

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.497

Neutral control median absolute deviation, by plate: nmad	0.373

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	24.91%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 210.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 126

ATG_LXRb_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human LXRb Gene Activation

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_LXRb_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_LXRb_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_LXRb_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene NR1H2. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-LXRb, also known as human nuclear receptor subfamily
1, group H, member 2 [GeneSymbol:NRlH2 | GenelD:7376 | Uniprot_SwissProt_Accession:P55055],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

T0901317

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.199

-------
Response cutoff threshold used to determine hit calls: 0.995

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
46

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.988

Neutral control median absolute deviation, by plate: nmad	0.282

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	28.53%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 229.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 127

ATG_M_06_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for M06 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_06_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_M_06_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_M_06_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the background control at the transcription factor-level as they relate to the gene . Furthermore,
this assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves a background control function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the background measurement intended target family,
where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_06, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a

-------
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.035

Response cutoff threshold used to determine hit calls: 0.263

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4798

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
13

Inactive hit count: Oihitc 0.9
3218

WINING MODEL SELECTION

NA hit count: hitc^O
1567

Number of sample-assay endpoints with winning hill model:

26
379

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2756

148

quadratic-polynomialfpoly2) model: 592

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

21

655

169

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.991

Neutral control median absolute deviation, by plate: nmad	0.031

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	3.14%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 169.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 128

ATG_M_19_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for M19 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_19_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_M_19_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_M_19_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the background control at the transcription factor-level as they relate to the gene . Furthermore,
this assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves a background control function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the background measurement intended target family,
where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_19, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a

-------
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.07

Response cutoff threshold used to determine hit calls: 0.348

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4798

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
6

Inactive hit count: Oihitc 0.9
2568

WINING MODEL SELECTION

NA hit count: hitc^O
2224

Number of sample-assay endpoints with winning hill model:

38
379

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2769

146

quadratic-poly nomialfpoly 2) model: 581

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

654

15

164

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.019

Neutral control median absolute deviation, by plate: nmad	0.061

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.97%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 164.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 129

ATG_M_32_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for M32 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_32_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_M_32_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_M_32_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the background control at the transcription factor-level as they relate to the gene . Furthermore,
this assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves a background control function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the background measurement intended target family,
where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_32, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a

-------
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.065

Response cutoff threshold used to determine hit calls: 0.327

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4798

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
11

Inactive hit count: Oihitc 0.9
3013

WINING MODEL SELECTION

NA hit count: hitc^O
1774

Number of sample-assay endpoints with winning hill model:

29
387

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2713

159

quadratic-polynomialfpoly2) model: 633

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

21

632

1

171

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4.223

Neutral control median absolute deviation, by plate: nmad	0.266

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.3%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 171.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 130

ATG_M_61_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for M61 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_61_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_M_61_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_M_61_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the background control at the transcription factor-level as they relate to the gene . Furthermore,
this assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves a background control function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the background measurement intended target family,
where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_61, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a

-------
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.035

Response cutoff threshold used to determine hit calls: 0.263

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 4798

Number of chemicals tested: 4060

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
11

Inactive hit count: Oihitc 0.9
3221

WINING MODEL SELECTION

NA hit count: hitc^O
1566

Number of sample-assay endpoints with winning hill model:

26
378

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2758

148

quadratic-polynomial(poly2) model: 591

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

20

656

169

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.991

Neutral control median absolute deviation, by plate: nmad	0.031

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	3.14%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 169.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 131

ATG_N U RR1_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human NURR1 Gene Activation

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_NURR1_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_NURR1_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_NURR1_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene NR4A2. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-NURR1, also known as human nuclear receptor
subfamily 4, group A, member 2 [GeneSymbol:NR4A2 | GenelD:4929 | Uniprot_SwissProt_Accession:P43354],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.174

-------
Response cutoff threshold used to determine hit calls: 0.868

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
218

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4.282

Neutral control median absolute deviation, by plate: nmad	1.686

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	39.38%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 260.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 132

ATG_PPARa_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human peroxisome proliferator-activated receptor,
alpha (PPARalpha)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_PPARa_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_PPARa_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_PPARa_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene PPARA. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-PPARa, also known as human peroxisome proliferator-
activated receptor alpha [GeneSymbokPPARA | GenelD:5465 | Uniprot_SwissProt_Accession:Q07869],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

GW0742

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.232

-------
Response cutoff threshold used to determine hit calls: 1.162

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
280

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.284

Neutral control median absolute deviation, by plate: nmad	1.296

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	39.45%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 297.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 133

ATG_PPARd_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human peroxisome proliferator-activated receptor,
delta (PPARd)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_PPARd_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_PPARd_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_PPARd_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene PPARD. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-PPARd, also known as human peroxisome proliferator-
activated receptor delta [GeneSymbokPPARD | GenelD:5467 | Uniprot_SwissProt_Accession:Q03181],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

GW7647

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.219

-------
Response cutoff threshold used to determine hit calls: 1.095

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
53

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.79

Neutral control median absolute deviation, by plate: nmad	0.327

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	41.41%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 189.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 134

ATG_PRARg_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human peroxisome proliferator-activated receptor,
gamma (PPARg)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_PPARg_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_PPARg_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_PPARg_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene PPARG. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-PPARg, also known as human peroxisome proliferator-
activated receptor gamma [GeneSymbokPPARG | GenelD:5468 | Uniprot_SwissProt_Accession:P37231],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

Rosiglitazone

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.23

-------
Response cutoff threshold used to determine hit calls: 1.149

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1084

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.558

Neutral control median absolute deviation, by plate: nmad	1.131

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	44.22%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 418.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 135

ATG_PXR_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human PXR Gene Activation

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_PXR_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_PXR_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_PXR_TRANS, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene NR1I2. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-PXR, also known as human nuclear receptor subfamily
1, group I, member 2 [GeneSymbol:NRll2 | GenelD:8856 | Uniprot_SwissProt_Accession:075469],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

T0901317

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.258

-------
Response cutoff threshold used to determine hit calls: 1.291

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1095

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.49

Neutral control median absolute deviation, by plate: nmad	0.497

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	33.33%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 476.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 136

ATG_RARa_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human retinoic acid receptor, alpha (RXRa)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_RARa_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_RARa_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_RARa_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene RARA. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-RARa, also known as human retinoic acid receptor,
alpha [GeneSymbol:RARA | GenelD:5914 | Uniprot_SwissProt_Accession:P10276],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

Retinoic Acid

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.23

-------
Response cutoff threshold used to determine hit calls: 1.149

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
58

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	8.189

Neutral control median absolute deviation, by plate: nmad	5.048

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	61.65%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 256.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 137

ATG_RARb_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human retinoic acid receptor, beta (RXRb)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_RARb_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_RARb_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_RARb_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene RARB. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-RARb, also known as human retinoic acid receptor, beta
[GeneSymbokRARB | GenelD:5915 | Uniprot_SwissProt_Accession:P10826],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

Retinoic Acid

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.216

-------
Response cutoff threshold used to determine hit calls: 1.079

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
13

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4.912

Neutral control median absolute deviation, by plate: nmad	2.712

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	55.21%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 214.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 138

ATG_RARgJTRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human retinoic acid receptor, gamma (RXRg)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_RARg_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_RARg_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_RARg_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene RARG. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-RARg, also known as human retinoic acid receptor,
gamma [GeneSymbokRARG | GenelD:5916 | Uniprot_SwissProt_Accession:P13631],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

Retinoic Acid

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.232

-------
Response cutoff threshold used to determine hit calls: 1.159

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
44

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	5.914

Neutral control median absolute deviation, by plate: nmad	2.745

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	46.42%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 244.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 139

ATG_RORb_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human RAR-related orphan receptor B (RORb)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_RORb_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_RORb_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_RORb_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene RORB. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-RORb, also known as human RAR-related orphan
receptor B [GeneSymbokRORB | GenelD:6096 | Uniprot_SwissProt_Accession:Q92753],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.315

-------
Response cutoff threshold used to determine hit calls: 1.574

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
6

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4.272

Neutral control median absolute deviation, by plate: nmad	2.418

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	56.6%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 179.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 140

ATG_RO Rg_TRAN S

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human RAR-related orphan receptor G (RORg)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_RORg_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_RORg_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_RORg_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene RORC. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-RORg, also known as human RAR-related orphan
receptor C [GeneSymbohRORC | GenelD:6097 | Uniprot_SwissProt_Accession:P51449],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

NA

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.235

-------
Response cutoff threshold used to determine hit calls: 1.174

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
21

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.014

Neutral control median absolute deviation, by plate: nmad	0.677

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	33.6%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 160.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 141

ATG_RXRa_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human retinoid X receptor, alpha (RXRa)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_RXRa_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_RXRa_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_RXRa_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene RXRA. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-RXRa, also known as human retinoid X receptor, alpha
[GeneSymbokRXRA | GenelD:6256 | Uniprot_SwissProt_Accession:P19793],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

9-cis-Retinoic acid

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.185

-------
Response cutoff threshold used to determine hit calls: 0.923

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
111

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.088

Neutral control median absolute deviation, by plate: nmad	0.354

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	32.57%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 198.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 142

ATG_RXRb_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human retinoid X receptor, beta (RXRb)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_RXRb_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_RXRb_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_RXRb_TRANS, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene RXRB. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-RXRb, also known as human retinoid X receptor, beta
[GeneSymbokRXRB | GenelD:6257 | Uniprot_SwissProt_Accession:P28702],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

9-cis-Retinoic acid

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.201

-------
Response cutoff threshold used to determine hit calls: 1.003

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
625

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.952

Neutral control median absolute deviation, by plate: nmad	0.692

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	35.44%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 394.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 143

ATG_TH Ra 1_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human thyroid receptor alpha (THRal)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_THRal_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_THRal_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_THRal_TRANS,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene THRA. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-THRa, also known as human thyroid hormone receptor,
alpha [GeneSymbol:THRA | GenelD:7067 | Uniprot_SwissProt_Accession:P10827],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

T3

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.221

-------
Response cutoff threshold used to determine hit calls: 1.104

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
87

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.936

Neutral control median absolute deviation, by plate: nmad	0.281

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	30%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 213.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 144

ATG_VDR_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human vitamin D receptor (VDR)

1.2	Assay Summary: ATG_TRANS is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_VDR_TRANS is one of 30
assay component(s) measured or calculated from the ATG_TRANS assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. Data from the assay
component ATG_VDR_TRANS was analyzed into 1 assay endpoint. This assay endpoint, ATG_VDR_TRANS, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity can be used to
understand the reporter gene at the transcription factor-level as they relate to the gene VDR. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a reporter gene function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-VDR, also known as human vitamin D (1,25-
dihydroxyvitamin D3) receptor [GeneSymbokVDR | GenelD:7421 | Uniprot_SwissProt_Accession:P11473],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

1.23 nM
Key positive control:

la,25-Dihydroxyvitamin D3
Baseline median absolute deviation for the assay (bmad): 0.17

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

-------
Response cutoff threshold used to determine hit calls: 0.852

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 4514

Number of chemicals tested: 4060

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
30

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.966

Neutral control median absolute deviation, by plate: nmad	0.259

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	26.78%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 133.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1204

ATG_XTT_Cytotoxicity

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment in the Attagene TRANS-FACTORIAL HepG2 Assay

1.2	Assay Summary: ATG_XTT_Cytotoxicity is a cell-based, single-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_XTT_Cytotoxicity is
one of one assay component(s) measured or calculated from the ATG_XTT_Cytotoxicity assay. It is designed to
make measurements of cell number, a form of viability reporter, as detected with fluorescence intensity signals
by XTT cytotoxicity assay technology. Data from the assay component ATG_XTT_Cytotoxicity was analyzed into
1 assay endpoint. This assay endpoint, ATG_XTT_Cytotoxicity, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter,
loss-of-signal activity can be used to understand changes in the viability. Furthermore, this assay endpoint can
be referred to as a primary readout, because the performed assay has only produced 1 assay endpoint. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell cycle
intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals produced from an enzymatic reaction involving the key
substrate [XTT reagent] are correlated to the viability of the mitochondria in the system.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

-------
2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

2.94 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

376 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 15.145
Response cutoff threshold used to determine hit calls: 75.725
Detection technology used: XTT cytotoxicity assay (Fluorescence)

2.6 Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription

-------
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

17: sublOO (Center data around zero by subtracting the corrected response value (cval) from 100; 100 -
cval. Typically used if data was pre-normalized around 100 with responses decreasing to 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 3834	Number of chemicals tested: 3402

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
160

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 239.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1348

ATG_N U R77_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for NUR77 orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_NUR77_TRANS2 is one
of 24 assay components measured from the ATG_TRANS2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint,
ATG_NUR77_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene NR4A1.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.172

Response cutoff threshold used to determine hit calls: 0.859

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.013

Neutral control median absolute deviation, by plate: nmad	0.076

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.46%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1349

ATG_GCN F_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for GCNF orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GCNF_TRANS2 is one
of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_GCNF_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene NR6A1. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.278

Response cutoff threshold used to determine hit calls: 1.392

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.44

Neutral control median absolute deviation, by plate: nmad	0.073

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	16.53%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1350

ATG_COU P_TF2_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for COUP-TFII orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_COUP_TF2_TRANS2 is
one of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_COUP_TF2_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene NR2F2. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.236

Response cutoff threshold used to determine hit calls: 1.181

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.581

Neutral control median absolute deviation, by plate: nmad	0.064

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.1%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1351

ATG_PN R_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for PNR orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_PNR_TRANS2 is one of
24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_PNR_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene NR2E3. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.265

Response cutoff threshold used to determine hit calls: 1.326

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.552

Neutral control median absolute deviation, by plate: nmad	0.07

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.76%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1352

ATG_LRH 1_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for LRH1 orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_LRH1_TRANS2 is one
of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_LRH1_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene NR5A2. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.332

Response cutoff threshold used to determine hit calls: 1.659

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.692

Neutral control median absolute deviation, by plate: nmad	0.517

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	30.59%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1353

ATG_Rev_ERB_A_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for Rev-ERB-alpha orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Rev_ERB_A_TRANS2
is one of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_Rev_ERB_A_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene NR1D1. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.183

Response cutoff threshold used to determine hit calls: 0.914

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.829

Neutral control median absolute deviation, by plate: nmad	0.062

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.43%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1354

ATG_H N F4g_TRAN S2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for HNF4g orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_HNF4g_TRANS2 is one
of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_HNF4g_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene HNF4G. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.369

Response cutoff threshold used to determine hit calls: 1.844

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.554

Neutral control median absolute deviation, by plate: nmad	0.454

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	29.18%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1355

ATG_ERRb_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for human estrogen-related receptor, beta (ERRb)

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_ERRb_TRANS2 is one
of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_ERRb_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ESRRB. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.457

Response cutoff threshold used to determine hit calls: 2.284

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.716

Neutral control median absolute deviation, by plate: nmad	0.478

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	27.87%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1356

ATG_M R_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for MR orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_MR_TRANS2 is one of
24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_MR_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene NR3C2. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.164

Response cutoff threshold used to determine hit calls: 0.82

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
2

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.851

Neutral control median absolute deviation, by plate: nmad	0.098

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.51%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1357

ATG_COU P_TF1_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for COUP-TFI orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_COUP_TFl_TRANS2 is
one of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_COUP_TFl_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene NR2F1. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.325

Response cutoff threshold used to determine hit calls: 1.623

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.385

Neutral control median absolute deviation, by plate: nmad	0.035

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.05%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1358

ATG_N0R1_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for NOR1 orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_NORl_TRANS2 is one
of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_NORl_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene NR4A3. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.177

Response cutoff threshold used to determine hit calls: 0.887

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.082

Neutral control median absolute deviation, by plate: nmad	0.207

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.93%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1359

ATG_TR4_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for TR4 orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_TR4_TRANS2 is one of
24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_TR4_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene NR2C2. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.2
Response cutoff threshold used to determine hit calls: 1.001
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.748

Neutral control median absolute deviation, by plate: nmad	0.119

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	15.95%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1360

ATG_DAX1_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for DAX1 orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_DAX1_TRANS2 is one
of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_DAX1_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene NR0B1. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.194

Response cutoff threshold used to determine hit calls: 0.971

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.777

Neutral control median absolute deviation, by plate: nmad	0.076

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.73%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1361

ATG_Rev_ERB_B_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for Rev-Erb beta orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_Rev_ERB_B_TRANS2
is one of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_Rev_ERB_B_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene NR1D2. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.206

Response cutoff threshold used to determine hit calls: 1.029

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.642

Neutral control median absolute deviation, by plate: nmad	0.067

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.39%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1362

ATG_R0Ra_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for ROR alpha orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_RORa_TRANS2 is one
of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_RORa_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene RORA. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.265

Response cutoff threshold used to determine hit calls: 1.324

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.362

Neutral control median absolute deviation, by plate: nmad	0.199

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.58%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1363

ATG_PR_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for human progesterone receptor (PR)

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_PR_TRANS2 is one of
24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_PR_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene PGR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.145

Response cutoff threshold used to determine hit calls: 0.727

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
4

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.917

Neutral control median absolute deviation, by plate: nmad	0.123

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.41%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1364

ATG_RX Rg_TRAN S2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for human retinoid X receptor gamma (RXRg)

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_RXRg_TRANS2 is one
of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_RXRg_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene RXRG. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.203

Response cutoff threshold used to determine hit calls: 1.015

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.872

Neutral control median absolute deviation, by plate: nmad	0.082

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.35%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1365

ATG_SF_1_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for SF-1 orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_SF_1_TRANS2 is one
of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_SF_1_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene NR5A1. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.36

Response cutoff threshold used to determine hit calls: 1.802

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.499

Neutral control median absolute deviation, by plate: nmad	1.05

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	30.02%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1366

ATG_SHP_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for SHP orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_SHP_TRANS2 is one of
24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_SHP_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene NR0B2. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.194

Response cutoff threshold used to determine hit calls: 0.969

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.665

Neutral control median absolute deviation, by plate: nmad	0.048

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.25%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1367

ATG_ERb_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for human estrogen receptor, beta (Erb)

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_ERb_TRANS2 is one of
24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_ERb_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene ESR2. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.213

Response cutoff threshold used to determine hit calls: 1.063

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
5

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.663

Neutral control median absolute deviation, by plate: nmad	0.091

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.76%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 3.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1368

ATG_TI_X_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for TLX orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_TLX_TRANS2 is one of
24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_TLX_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene NR2E1. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.247

Response cutoff threshold used to determine hit calls: 1.235

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.522

Neutral control median absolute deviation, by plate: nmad	0.096

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	18.3%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1369

ATG_THRb_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for human thyroid receptor, beta (THRb)

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_THRb_TRANS2 is one
of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_THRb_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene THRB. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.278

Response cutoff threshold used to determine hit calls: 1.39

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.303

Neutral control median absolute deviation, by plate: nmad	0.039

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.72%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1370

ATG_EAR2_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for EAR2 orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_EAR2_TRANS2 is one
of 24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_EAR2_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene NR2F6. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.272

Response cutoff threshold used to determine hit calls: 1.36

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.294

Neutral control median absolute deviation, by plate: nmad	0.039

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.11%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1371

ATG_TR2_TRANS2

1.	General Information

1.1	Assay Title: Attagene TRANS2-FACTORIAL HepG2 Assay for TR2 orphan gene

1.2	Assay Summary: ATG_TRANS2 is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_TR2_TRANS2 is one of
24 assay components measured or calculated from the ATG_TRANS2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_TR2_TRANS2 was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene NR2C1. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is orphan.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factors.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.179

Response cutoff threshold used to determine hit calls: 0.897

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.958

Neutral control median absolute deviation, by plate: nmad	0.093

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.75%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1746

ATG_G PCR_AD0RA2A_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for adenosine A2a g-protein coupled receptor (ADORA2A)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_ADORA2A_TRANS is
one of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_GPCR_ADORA2A_TRANS was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA
for gain-of-signal activity can be used to understand the reporter gene at the transcription factor-level as they
relate to the gene ADORA2A. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a reporter gene function. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target
family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
ADORA2A, which is responsive to the adenosine A2a receptor [GeneSymbol:ADORA2A| GenelD:135 |
Uniprot_SwissProt_Accession:P29274],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2

Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a

-------
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.244

Response cutoff threshold used to determine hit calls: 1.218

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24

Number of chemicals tested: 24

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: Oihitc 0.9
13

WINING MODEL SELECTION

NA hit count: hitc^O
11

Number of sample-assay endpoints with winning hill model:

0
3

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

16

quadratic-polynomialfpoly2) model: 2

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

0

3

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	11.939

Neutral control median absolute deviation, by plate: nmad	2.428

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	20.34%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1748

ATG_G PCR_AD0RA2 B_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human adenosine A2b g-protein coupled receptor
(ADORA2B)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_ADORA2B_TRANS is
one of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_ADORA2B_TRANS was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ADORA2B. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
ADORA2B, which is responsive to the adenosine A2b receptor [GeneSymbol:ADORA2B| GenelD:136 |
Uniprot_SwissProt_Accession:NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a

-------
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.274

Response cutoff threshold used to determine hit calls: 1.37

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24

Number of chemicals tested: 24

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: Oihitc 0.9
21

WINING MODEL SELECTION

NA hit count: hitc^O
3

Number of sample-assay endpoints with winning hill model:

0
3

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

4

9

quadratic-polynomialfpoly2) model: 4

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

4

0

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	5.595

Neutral control median absolute deviation, by plate: nmad	0.851

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	15.21%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1750

ATG_G PCR_ADRA1A_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human adrenoceptor alpha 1A (ADRA1A)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_ADRA1A_TRANS is
one of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_ADRA1A_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ADRA1A. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
ADRA1A, which is responsive to the adrenoceptor alpha 1A. [GeneSymbol:ADRAlA| GenelD:148 |
Uniprot_SwissProt_Accession:NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.183

Response cutoff threshold used to determine hit calls: 0.916

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.99

Neutral control median absolute deviation, by plate: nmad	0.202

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	20.43%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1752

ATG_G PCR_ADRA2 B_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human adrenoceptor alpha IB (ADRA1B)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_ADRA2B_TRANS is
one of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_ADRA2B_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ADRA2B. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
ADRA2B, which is responsive to the adrenoceptor alpha 2B. [GeneSymbol:ADRA2B| GenelD:151 |
Uniprot_SwissProt_Accession:NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.201

Response cutoff threshold used to determine hit calls: 1.006

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.609

Neutral control median absolute deviation, by plate: nmad	0.225

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.01%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1754

ATG_GPCR_ADRB2_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human adrenoceptor beta 2 (ADRB2)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_ADRB2_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_ADRB2_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ADRB2. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
ADRB2, which is responsive to the adrenoceptor beta 2, surface. [GeneSymbol:ADRB2| GenelD:154 |
Uniprot_SwissProt_Accession: P07550],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.212

Response cutoff threshold used to determine hit calls: 1.062

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	7.122

Neutral control median absolute deviation, by plate: nmad	1.593

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	22.37%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1756

ATG_GPCR_ADRB3_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human adrenoceptor beta 3 (ADRB3)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_ADRB3_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_ADRB3_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ADRB3. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
ADRB3, which is responsive to the adrenoceptor beta 3. [GeneSymbol:ADRB3| GenelD:155 |
Uniprot_SwissProt_Accession: P13945],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.353

Response cutoff threshold used to determine hit calls: 1.765

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.288

Neutral control median absolute deviation, by plate: nmad	0.515

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	39.98%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1758

ATG_G PCR_CHRM3_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human cholinergic receptor, muscarinic 3 (CHRM3)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_CHRM3_TRANS is
one of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_CHRM3_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene CHRM3. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
CHRM3, which is responsive to the cholinergic receptor, muscarinic 3. [GeneSymbol:CHRM3| GenelD:1131 |
Uniprot_SwissProt_Accession: P20309],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.262

Response cutoff threshold used to determine hit calls: 1.311

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.64

Neutral control median absolute deviation, by plate: nmad	0.225

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	35.1%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1760

ATG_GPCR_DRD1_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human dopamine receptor D1 (DRD1)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_DRD1_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_DRD1_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene DRD1. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
DRD1, which is responsive to the dopamine receptor Dl. [GeneSymbol:DRDl| GenelD:1812 |
Uniprot_SwissProt_Accession: P21728],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.289

Response cutoff threshold used to determine hit calls: 1.444

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	9.804

Neutral control median absolute deviation, by plate: nmad	2.67

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	27.24%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1762

ATG_GPCR_DRD5_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human dopamine receptor D5 (DRD5)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_DRD5_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_DRD5_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene DRD5. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
DRD5, which is responsive to the dopamine receptor D5. [GeneSymbol:DRD5| GenelD:1816 |
Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.258

Response cutoff threshold used to determine hit calls: 1.291

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	32.421

Neutral control median absolute deviation, by plate: nmad	4.422

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.64%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1764

ATG_G PCR_EDN RA_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human endothelin receptor type A (EDNRA)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_EDNRA_TRANS is
one of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_EDNRA_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene EDNRA. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
EDNRA, which is responsive to the endothelin receptor type A. [GeneSymbol:EDNRA| GenelD: 1909 |
Uniprot_SwissProt_Accession: P25101],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.361

Response cutoff threshold used to determine hit calls: 1.805

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.534

Neutral control median absolute deviation, by plate: nmad	0.159

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	29.85%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1766

ATG_G PCR_GCG R_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for glucagon receptor (GCGR)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_GCGR_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_GCGR_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene GCGR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
GCGR, which is responsive to the glucagon receptor. [GeneSymbol: GCGR| GenelD: 2642 |
Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.244

Response cutoff threshold used to determine hit calls: 1.219

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.526

Neutral control median absolute deviation, by plate: nmad	0.06

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.43%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1768

ATG_G PCR_G PBAR1_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for G protein-coupled bile acid receptor 1 (GPBAR1)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_GPBAR1_TRANS is
one of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_GPBAR1_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene GPBAR1. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
GPBAR1, which is responsive to the G protein-coupled bile acid receptor 1. [GeneSymbol: GPBAR1| GenelD:
1501306 | Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.412

Response cutoff threshold used to determine hit calls: 2.061

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	9.115

Neutral control median absolute deviation, by plate: nmad	2.93

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	32.15%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1770

ATG_G PCR_G PR40_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human free fatty acid receptor 1 (FFAR1)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_GPR40_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_GPR40_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene GPR40. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
FFAR1, which is responsive to the free fatty acid receptor 1. [GeneSymbol: FFAR1| GenelD: 2864 |
Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.228

Response cutoff threshold used to determine hit calls: 1.141

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.974

Neutral control median absolute deviation, by plate: nmad	0.157

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	16.13%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1772

ATG_G PCR_GQ_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human G protein subunit alpha q (GNAQ)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_GQ_TRANS is one of
35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_GPCR_GQ_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene GNAQ.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
GNAQ, which is responsive to the G protein subunit alpha q. [GeneSymbol: GNAQ| GenelD: 2776 |
Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.24

Response cutoff threshold used to determine hit calls: 1.199

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.649

Neutral control median absolute deviation, by plate: nmad	0.36

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	21.85%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1774

ATG_G PCR_GS1_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human GNAS complex locus (GNAS1)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_GS1_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_GS1_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene GNAS. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
GNAS, which is responsive to the GNAS complex locus. [GeneSymbol: GNAS| GenelD: 2778 |
Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.231

Response cutoff threshold used to determine hit calls: 1.155

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.062

Neutral control median absolute deviation, by plate: nmad	0.208

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.07%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1776

ATG_G PCR_GS_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human GNAS complex locus (GNAS)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_GS_TRANS is one of
35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_GPCR_GS_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene GNAS.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
GNAS, which is responsive to the GNAS complex locus. [GeneSymbol: GNAS| GenelD: 2778 |
Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.216

Response cutoff threshold used to determine hit calls: 1.078

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.553

Neutral control median absolute deviation, by plate: nmad	0.159

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.24%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1778

ATG_G PCR_H RH 1_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human histamine receptor HI (HRH1)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_HRH1_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_HRH1_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene HRH1. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
HRH1, which is responsive to the histamine receptor HI. [GeneSymbol: HRH1| GenelD: 3269 |
Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.22

Response cutoff threshold used to determine hit calls: 1.099

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.523

Neutral control median absolute deviation, by plate: nmad	0.134

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	25.65%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1780

ATG_G PCR_HTR6_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human 5-hydroxytryptamine (serotonin) receptor 6,
G(HTR6)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_HTR6_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_HTR6_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene HTR6. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
HTR6, which is responsive to the 5-hydroxytryptamine (serotonin) receptor 6, G. [GeneSymbol: HTR61 GenelD:
3362 | Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a

-------
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.204

Response cutoff threshold used to determine hit calls: 1.019

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24

Number of chemicals tested: 24

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: Oihitc 0.9
17

WINING MODEL SELECTION

NA hit count: hitc^O
7

Number of sample-assay endpoints with winning hill model:

0
0

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

19

quadratic-polynomialfpoly2) model: 2

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

2

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	19.196

Neutral control median absolute deviation, by plate: nmad	3.67

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	19.12%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1782

ATG_G PCR_HTR7_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human 5-hydroxytryptamine (serotonin) receptor 7
(HTR7)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_HTR7_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_HTR7_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene HTR7. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
HTR7, which is responsive to the 5-hydroxytryptamine (serotonin) receptor 7. [GeneSymbol: HTR7| GenelD:
3363 | Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a

-------
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.157

Response cutoff threshold used to determine hit calls: 0.784

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24

Number of chemicals tested: 24

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: Oihitc 0.9
11

WINING MODEL SELECTION

NA hit count: hitc^O
13

Number of sample-assay endpoints with winning hill model:

0
4

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

10

quadratic-polynomialfpoly2) model: 4

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

5

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	6.481

Neutral control median absolute deviation, by plate: nmad	0.956

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.74%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1784

ATG_G PCR_LPAR4_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human lysophosphatidic acid receptor 4 (LPAR4)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_LPAR4_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_LPAR4_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene LPAR4. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
LPAR4, which is responsive to the lysophosphatidic acid receptor 4. [GeneSymbol: LPAR4| GenelD: 2846 |
Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.161

Response cutoff threshold used to determine hit calls: 0.804

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.086

Neutral control median absolute deviation, by plate: nmad	0.202

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	18.57%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1786

ATG_G PCR_MC1R_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human melanocortin 1 receptor ( MC1R)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_MC1R_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_MC1R_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene MC1R. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
MC1R, which is responsive to the melanocortin 1 receptor (alpha melanocyte stimulating hormone receptor).
[GeneSymbol: MC1R| GenelD: 4157 | Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.261

Response cutoff threshold used to determine hit calls: 1.307

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	10.349

Neutral control median absolute deviation, by plate: nmad	1.062

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.26%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1788

ATG_G PCR_MC2 R_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human melanocortin 2 receptor (MC2R)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_MC2R_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_MC2R_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene MC2R. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
MC2R, which is responsive to the melanocortin 2 receptor (alpha melanocyte stimulating hormone receptor).
[GeneSymbol: MC2R| GenelD: 4158 | Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.207

Response cutoff threshold used to determine hit calls: 1.036

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.796

Neutral control median absolute deviation, by plate: nmad	0.263

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	33.06%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1790

ATG_G PCR_MC3 R_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human melanocortin 3 receptor (MC3R)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_MC3R_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_MC3R_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene MC3R. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
MC3R, which is responsive to the melanocortin 3 receptor (alpha melanocyte stimulating hormone receptor).
[GeneSymbol: MC3R| GenelD: 4159 | Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.16

Response cutoff threshold used to determine hit calls: 0.8

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.07

Neutral control median absolute deviation, by plate: nmad	0.191

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	17.87%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1792

ATG_G PCR_MC4R_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human melanocortin 4 receptor (MC4R)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_MC4R_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_MC4R_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene MC4R. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
MC4R, which is responsive to the melanocortin 4 receptor (alpha melanocyte stimulating hormone receptor).
[GeneSymbol: MC4R| GenelD: 4160 | Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.234

Response cutoff threshold used to determine hit calls: 1.17

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.909

Neutral control median absolute deviation, by plate: nmad	0.417

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	21.83%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1794

ATG_G PCR_PTG DR_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human prostaglandin D2 receptor (PTGDR)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_PTGDR_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_PTGDR_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene PTGDR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
PTGDR, which is responsive to the prostaglandin D2 receptor (DP). [GeneSymbol: PTGDR| GenelD: 5729 |
Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.359

Response cutoff threshold used to determine hit calls: 1.795

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	9.43

Neutral control median absolute deviation, by plate: nmad	2.12

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	22.48%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1796

ATG_G PCR_PTG I R_TRANS

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL HepG2 Assay for human prostaglandin 12 receptor (PTGIR)

1.2	Assay Summary: ATG_GPCR is a cell-based, multiplexed-readout assay that uses HepG2, a human liver cell line,
with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_GPCR_PTGIR_TRANS is one
of 35 assay components measured or calculated from the ATG_GPCR assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay
endpoint ATG_GPCR_PTGIR_TRANS was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene PTGIR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
PTGIR, which is responsive to the prostaglandin 12 (prostacyclin) receptor (IP). [GeneSymbol: PTGIR| GenelD:
5739 | Uniprot_SwissProt_Accession: NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0412 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

10 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.304

Response cutoff threshold used to determine hit calls: 1.52

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 5: bmad5 (Add a cutoff value
of 5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 24	Number of chemicals tested: 24

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.512

Neutral control median absolute deviation, by plate: nmad	0.181

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	35.33%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: RomanovS, Medvedev A, Gambarian M, Poltoratskaya N, Moeser M, Medvedeva L, Gambarian
M, Diatchenko L, Makarov S. Homogeneous reporter system enables quantitative functional assessment of
multiple transcription factors. Nat Methods. 2008 Mar;5(3):253-60. doi: 10.1038/nmeth.ll86. Epub 2008 Feb
24. PubMed PMID: 18297081., Martin MT, Dix DJ, Judson RS, Kavlock RJ, Reif DM, Richard AM, Rotroff DM,
Romanov S, Medvedev A, Poltoratskaya N, Gambarian M, Moeser M, Makarov SS, Houck KA. Impact of
environmental chemicals on key transcription regulators and correlation to toxicity end points within EPA's
ToxCast program. Chem Res Toxicol. 2010 Mar 15;23(3):578-90. doi: 10.1021/tx900325g. PubMed PMID:
20143881., Medvedev A, Moeser M, Medvedeva L, Martsen E, Granick A, Raines L, Zeng M, Makarov S Jr, Houck
KA, Makarov SS. Evaluating biological activity of compounds by transcription factor activity profiling. Sci Adv.
2018 Sep 26;4(9):eaar4666. doi: 10.1126/sciadv.aar4666. PMID: 30263952; PMCID: PMC6157966.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1913

ATG_chAR_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for chicken androgen receptor (AR)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_chAR_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_chAR_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene AR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the chicken androgen receptor. [GeneSymbokar | GenelD:4221651
Uniprot_SwissProt_Accession:Q2ACE0],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.192
Response cutoff threshold used to determine hit calls: 0.576
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
17

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.48

Neutral control median absolute deviation, by plate: nmad	0.09

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.06%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 5.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1915

ATG_frAR_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for frog androgen receptor (AR)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_frAR_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_frAR_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene AR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the frog androgen receptor. [GeneSymbol:ar.L | GenelD:399456|
Uniprot_SwissProt_Accession:P70048],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.154
Response cutoff threshold used to determine hit calls: 0.463
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
8

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.032

Neutral control median absolute deviation, by plate: nmad	0.094

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.12%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1917

ATG_hAR_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for human androgen receptor (AR)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_hAR_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_hAR_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene AR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the human androgen receptor. [GeneSymbol:AR | GenelD:367 |
Uniprot_SwissProt_Accession:P10275],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.2
Response cutoff threshold used to determine hit calls: 0.599
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
4

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.977

Neutral control median absolute deviation, by plate: nmad	0.119

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 10.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1919

ATG_trAR_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for turtle androgen receptor (AR)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_trAR_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_trAR_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene AR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the turtle androgen receptor. [GeneSymbokAR |
GenelD:101947425 | Uniprot_SwissProt_Accession:NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.186
Response cutoff threshold used to determine hit calls: 0.558
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
8

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.721

Neutral control median absolute deviation, by plate: nmad	0.077

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	4.48%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 16.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1921

ATG_zfAR_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for zebrafish androgen receptor (AR)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_zfAR_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_zfAR_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene AR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the zebrafish androgen receptor. [GeneSymbokAR |
GenelD:100005148 | Uniprot_SwissProt_Accession:NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.164
Response cutoff threshold used to determine hit calls: 0.491
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
7

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.928

Neutral control median absolute deviation, by plate: nmad	0.043

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	4.63%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1923

ATG_frERl_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for frog estrogen receptor (ESR1)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_frERl_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_frERl_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the frog estrogen receptor. [GeneSymbol:esrl.L | GenelD: 398734
| Uniprot_SwissProt_Accession:Q6W5G7],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.28
Response cutoff threshold used to determine hit calls: 0.839
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
45

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.978

Neutral control median absolute deviation, by plate: nmad	0.191

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.67%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 16.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1925

ATG_zfERl_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for zebrafish estrogen receptor
(ESR1)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_zfERl_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_zfERl_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the zebrafish estrogen receptor. [GeneSymbokesrl | GenelD:
2592521 Uniprot_SwissProt_Accession:P57717],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.172

Response cutoff threshold used to determine hit calls: 0.517

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
25

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.317

Neutral control median absolute deviation, by plate: nmad	0.136

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.35%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 13.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1927

ATG_zfER2a_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for zebrafish estrogen receptor
(ESR2A)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_zfER2a_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_zfER2a_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the zebrafish estrogen receptor. [GeneSymbol:esr2a | GenelD:
3177341 Uniprot_SwissProt_Accession:Q7ZU32],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.23

Response cutoff threshold used to determine hit calls: 0.69

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
32

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.206

Neutral control median absolute deviation, by plate: nmad	0.167

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.83%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 11.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1929

ATG_zfER2b_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for zebrafish estrogen receptor
(ESR2B)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_zfER2b_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_zfER2b_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the zebrafish estrogen receptor. [GeneSymbol:esr2b | GenelD:
3177331 Uniprot_SwissProt_Accession:Q5PR29],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.181

Response cutoff threshold used to determine hit calls: 0.544

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
82

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.912

Neutral control median absolute deviation, by plate: nmad	0.15

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	16.41%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 22.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1931

ATG_frER2_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for frog estrogen receptor (ESR2)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_frER2_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_frER2_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the frog estrogen receptor. [GeneSymbol:esr2.L | GenelD:
100174814| Uniprot_SwissProt_Accession:A0AlL8FA50],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.153
Response cutoff threshold used to determine hit calls: 0.46
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
76

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.124

Neutral control median absolute deviation, by plate: nmad	0.054

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	4.81%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 21.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1933

ATG_chERa_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for chicken estrogen receptor (ESR1)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_chERa_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_chERa_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the chicken estrogen receptor. [GeneSymbol:ESRl | GenelD:
3960991 Uniprot_SwissProt_Accession:P06212],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.251
Response cutoff threshold used to determine hit calls: 0.753
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
57

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	11.996

Neutral control median absolute deviation, by plate: nmad	0.668

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.57%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 11.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1935

ATG_hERa_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for human estrogen receptor (ESR1)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_hERa_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_hERa_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the human estrogen receptor. [GeneSymbol:ESRl | GenelD: 20991
Uniprot_SwissProt_Accession:P03372],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.246
Response cutoff threshold used to determine hit calls: 0.738
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
62

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	7.515

Neutral control median absolute deviation, by plate: nmad	0.832

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.08%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 13.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1937

ATG_trERa_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for turtle estrogen receptor (ESR1)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_trERa_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_trERa_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the turtle estrogen receptor. [GeneSymbokESRl | GenelD:
1019335331 Uniprot_SwissProt_Accession:B6ElV9],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.257
Response cutoff threshold used to determine hit calls: 0.771
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
62

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	6.32

Neutral control median absolute deviation, by plate: nmad	0.85

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.44%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 11.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1939

ATG_hERb_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for human estrogen receptor (ESR2)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_hERb_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_hERb_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the human estrogen receptor. [GeneSymbol:ESR2 | GenelD: 2100|
Uniprot_SwissProt_Accession:Q92731],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.276
Response cutoff threshold used to determine hit calls: 0.828
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
40

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.674

Neutral control median absolute deviation, by plate: nmad	0.158

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.44%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 14.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1941

ATG_GAL4_XSP1

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for GAL4 Gene Activation (Internal
Marker)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_GAL4_XSP1 is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. This
assay endpoint, ATG_GAL4_XSP1, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the background control at the transcription factor-level as they relate
to the gene. Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has
produced multiple assay endpoints where this one serves a background control function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the background measurement
intended target family, where the subfamily is baseline control.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). A trans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NR and modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.13
Response cutoff threshold used to determine hit calls: 0.39
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.864

Neutral control median absolute deviation, by plate: nmad	0.346

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.09%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1943

ATG_M_06_XSP1

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for M06 Gene Activation (Internal
Marker)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_M_06_XSP1 is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. This
assay endpoint, ATG_M_06_XSP1, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the background control at the transcription factor-level as they relate
to the gene. Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has
produced multiple assay endpoints where this one serves a background control function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the background measurement
intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_06, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). A trans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NR and modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.035
Response cutoff threshold used to determine hit calls: 0.263
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.026

Neutral control median absolute deviation, by plate: nmad	0.014

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	1.37%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1945

ATG_M_19_XSP1

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for M19 Gene Activation (Internal
Marker)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_M_19_XSP1 is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. This
assay endpoint, ATG_M_19_XSP1, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the background control at the transcription factor-level as they relate
to the gene. Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has
produced multiple assay endpoints where this one serves a background control function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the background measurement
intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_19, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). A trans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NR and modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.079
Response cutoff threshold used to determine hit calls: 0.263
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.949

Neutral control median absolute deviation, by plate: nmad	0.027

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	2.89%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1947

ATG_M_32_XSP1

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for M32 Gene Activation (Internal
Marker)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_M_32_XSP1 is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. This
assay endpoint, ATG_M_32_XSP1, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the background control at the transcription factor-level as they relate
to the gene. Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has
produced multiple assay endpoints where this one serves a background control function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the background measurement
intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_32, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). A trans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NR and modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.069
Response cutoff threshold used to determine hit calls: 0.263
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.932

Neutral control median absolute deviation, by plate: nmad	0.174

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	4.43%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 3.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1949

ATG_M_61_XSP1

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for M61 Gene Activation (Internal
Marker)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_M_61_XSP1 is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. This
assay endpoint, ATG_M_61_XSP1, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the background control at the transcription factor-level as they relate
to the gene. Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has
produced multiple assay endpoints where this one serves a background control function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the background measurement
intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_61, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). A trans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NR and modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.035
Response cutoff threshold used to determine hit calls: 0.263
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.026

Neutral control median absolute deviation, by plate: nmad	0.014

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	1.37%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1951

ATG_hPPARg_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for human peroxisome proliferator
activated receptor gamma (PPARg)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_hPPARg_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_hPPARg_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene PPARg. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element PPARg, which is responsive to the human peroxisome proliferator activated receptor gamma.
[GeneSymbokPPARG | GenelD:5468| Uniprot_SwissProt_Accession:P37231],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.239

Response cutoff threshold used to determine hit calls: 0.716

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
48

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.95

Neutral control median absolute deviation, by plate: nmad	0.218

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.52%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 13.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1953

ATG_mPPARg_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for mouse peroxisome proliferator
activated receptor gamma (PPARg)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_mPPARg_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_mPPARg_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene PPARg. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element PPARg, which is responsive to the mouse peroxisome proliferator activated receptor gamma.
[GeneSymbokPparg | GenelD: 190161 Uniprot_SwissProt_Accession:P37238],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.196

Response cutoff threshold used to determine hit calls: 0.587

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
59

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.135

Neutral control median absolute deviation, by plate: nmad	0.775

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	24.73%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 10.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1955

ATG_zfPPARgJ(SPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for zebrafish peroxisome
proliferator activated receptor gamma (PPARg)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_zfPPARg_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_zfPPARg_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene PPARg. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element PPARg, which is responsive to the zebrafish peroxisome proliferator activated receptor
gamma. [GeneSymbol: pparg | GenelD: 5570371 Uniprot_SwissProt_Accession:A6XMH6],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.153

Response cutoff threshold used to determine hit calls: 0.46

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
8

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.924

Neutral control median absolute deviation, by plate: nmad	0.05

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.38%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 5.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1957

ATG_m PXR_XSP1

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for mouse pregnane X receptor (PXR)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_mPXR_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_mPXR_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene PXR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element PXR, which is responsive to the mouse pregnane X receptor. [GeneSymbol: NR1IA | GenelD:
88561 Uniprot_SwissProt_Accession:075469],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.835 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.211
Response cutoff threshold used to determine hit calls: 0.632
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
59

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.362

Neutral control median absolute deviation, by plate: nmad	0.311

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.24%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1959

ATG_frTRa_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for frog thyroid hormone receptor,
alpha (TRa)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_frTRa_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_frTRa_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the frog thyroid hormone receptor, alpha L homeolog.
[GeneSymbol: thra.L | GenelD: 3979421 Uniprot_SwissProt_Accession: P15204],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.368

Response cutoff threshold used to determine hit calls: 1.104

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
6

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.761

Neutral control median absolute deviation, by plate: nmad	0.168

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.56%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1961

ATG_hTRa_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for human thyroid hormone
receptor, alpha (Tra)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_hTRa_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_hTRa_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the human thyroid hormone receptor, alpha. [GeneSymbol: THRA
| GenelD: 70671 Uniprot_SwissProt_Accession: P10827],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.274

Response cutoff threshold used to determine hit calls: 0.823

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
10

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.567

Neutral control median absolute deviation, by plate: nmad	0.248

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	15.8%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1963

ATG_trTRa_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for turtle thyroid hormone receptor,
alpha (TRa)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_trTRa_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_trTRa_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the turtle thyroid hormone receptor, alpha. [GeneSymbol: THRA
| GenelD: 1019490571 Uniprot_SwissProt_Accession: NA],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.269

Response cutoff threshold used to determine hit calls: 0.808

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
8

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.74

Neutral control median absolute deviation, by plate: nmad	0.144

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	19.45%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1965

ATG_zfTRa_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for zebrafish thyroid hormone
receptor, alpha (TRa)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_zfTRa_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_zfTRa_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the zebrafish thyroid hormone receptor, alpha a. [GeneSymbol:
thraa | GenelD: 30670| Uniprot_SwissProt_Accession: Q98867],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.249

Response cutoff threshold used to determine hit calls: 0.746

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
8

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.591

Neutral control median absolute deviation, by plate: nmad	0.022

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	3.76%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 4.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1967

ATG_hTRb_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for human thyroid hormone
receptor, beta (TRb)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_hTRb_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_hTRb_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the human thyroid hormone receptor, beta. [GeneSymbol: THRB
| GenelD: 70681 Uniprot_SwissProt_Accession: P10828],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.286

Response cutoff threshold used to determine hit calls: 0.859

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
10

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.694

Neutral control median absolute deviation, by plate: nmad	0.079

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.44%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1969

ATG_zfTRb_XSPl

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for zebrafish thyroid hormone
receptor, beta (TRb)

1.2	Assay Summary: ATG_XSPl_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xspl included 3 nM 6-
alpha-fluorotestosterone to partially stimulate the androgen receptor. ATG_zfTRb_XSPl is one of 29 assay
components measured or calculated from the ATG_XSPl_multi-species_TRANS assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_zfTRb_XSPl was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the zebrafish thyroid hormone receptor, beta. [GeneSymbol: thrb
| GenelD: 306071 Uniprot_SwissProt_Accession: Q9PVE4],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.835 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.247

Response cutoff threshold used to determine hit calls: 0.741

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100

Number of chemicals tested: 98

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
10

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.687

Neutral control median absolute deviation, by plate: nmad	0.074

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.79%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 4.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1971

ATG_XTT_Cytotoxicity_XSPl

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment in the Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay

1.2	Assay Summary: ATG_XSPl_XTT_Cytotoxicity_multi-species is a cell-based, single-readout assay that uses
HepG2, a human liver cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate.
Xspl included 3 nM 6-alpha-fluorotestosterone to partially stimulate the androgen receptor.
ATG_XTT_Cytotoxicity_XSPl is an assay component measured from the ATG_XSPl_XTT_Cytotoxicity_multi-
species assay. It is designed to make measurements of cell number, a form of viability reporter, as detected with
fluorescence intensity signals by XTT cytotoxicity assay technology. This assay endpoint,
ATG_XTT_Cytotoxicity_XSPl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used to
understand changes in the viability. Furthermore, this assay endpoint can be referred to as a primary readout,
because the performed assay has only produced 1 assay endpoint. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals produced from an enzymatic reaction involving the key
substrate [XTT reagent] are correlated to the viability of the mitochondria in the system.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.835 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 9.54
Response cutoff threshold used to determine hit calls: 28.62
Detection technology used: XTT cytotoxicity assay (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

17: sublOO (Center data around zero by subtracting the corrected response value (cval) from 100; 100 -
cval. Typically used if data was pre-normalized around 100 with responses decreasing to 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 100	Number of chemicals tested: 98

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
15

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 11.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1973

ATG_M_06_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for M06 Gene Activation (Internal
Marker)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_M_06_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
This assay endpoint, ATG_M_06_XSP2, was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the background control at the transcription factor-level as they
relate to the gene. Furthermore, this assay endpoint can be referred to as a secondary readout, because this
assay has produced multiple assay endpoints where this one serves a background control function. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the background measurement
intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_06, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). A trans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NR and modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.0928 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.035
Response cutoff threshold used to determine hit calls: 0.263
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 190

Number of chemicals tested: 188

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.024

Neutral control median absolute deviation, by plate: nmad	0.006

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0.61%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1975

ATG_trERa_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for turtle estrogen receptor (ERa)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_trERa_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_trERa_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the turtle estrogen receptor. [GeneSymbokESRl | GenelD:
1019335331 Uniprot_SwissProt_Accession:B6ElV9],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.3
Response cutoff threshold used to determine hit calls: 0.9
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
154

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.891

Neutral control median absolute deviation, by plate: nmad	0.937

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	19.07%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 33.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1977

ATG_hERb_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for human estrogen receptor (ERb)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_hERb_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_hERb_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the human estrogen receptor. [GeneSymbol:ESR2 | GenelD: 2100|
Uniprot_SwissProt_Accession:Q92731],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.275
Response cutoff threshold used to determine hit calls: 0.824
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
95

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.685

Neutral control median absolute deviation, by plate: nmad	0.128

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.61%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 31.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1979

ATG_trAR_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for turtle androgen receptor (AR)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_trAR_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_trAR_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene AR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the turtle androgen receptor. [GeneSymbokAR |
GenelD:101947425 | Uniprot_SwissProt_Accession:NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.206
Response cutoff threshold used to determine hit calls: 0.619
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
16

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.541

Neutral control median absolute deviation, by plate: nmad	0.146

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.67%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 33.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1981

ATG_GAL4_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for GAL4 Gene Activation (Internal
Marker)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_GAL4_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
This assay endpoint, ATG_GAL4_XSP2, was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the background control at the transcription factor-level as they
relate to the gene. Furthermore, this assay endpoint can be referred to as a secondary readout, because this
assay has produced multiple assay endpoints where this one serves a background control function. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the background measurement
intended target family, where the subfamily is baseline control.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). A trans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NR and modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.0928 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.125
Response cutoff threshold used to determine hit calls: 0.376
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 190

Number of chemicals tested: 188

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
2

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.267

Neutral control median absolute deviation, by plate: nmad	0.078

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	3.55%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 10.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1983

ATG_zfERl_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for zebrafish estrogen receptor (ER1)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_zfERl_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_zfERl_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the zebrafish estrogen receptor. [GeneSymbokesrl | GenelD:
2592521 Uniprot_SwissProt_Accession:P57717],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.174
Response cutoff threshold used to determine hit calls: 0.521
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
84

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.332

Neutral control median absolute deviation, by plate: nmad	0.233

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	19.68%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 34.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1985

ATG_chERa_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for chicken estrogen receptor (ERa)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_chERa_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_chERa_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the chicken estrogen receptor. [GeneSymbol:ESRl | GenelD:
3960991 Uniprot_SwissProt_Accession:P06212],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.258
Response cutoff threshold used to determine hit calls: 0.773
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
159

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	7.272

Neutral control median absolute deviation, by plate: nmad	1.42

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	17.35%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 43.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1987

ATG_zfER2a_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for zebrafish estrogen receptor
(ER2a)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_zfER2a_XSP2 is one
of 29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed
to make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_zfER2a_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the zebrafish estrogen receptor. [GeneSymbol:esr2a | GenelD:
3177341 Uniprot_SwissProt_Accession:Q7ZU32],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.189

Response cutoff threshold used to determine hit calls: 0.568

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
120

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.249

Neutral control median absolute deviation, by plate: nmad	0.173

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.7%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 35.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1989

ATG_hAR_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for human androgen receptor (AR)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_hAR_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_hAR_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene AR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the human androgen receptor. [GeneSymbol:AR | GenelD:367 |
Uniprot_SwissProt_Accession:P10275],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.226
Response cutoff threshold used to determine hit calls: 0.678
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
9

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.464

Neutral control median absolute deviation, by plate: nmad	0.101

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.93%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 34.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1991

ATG_chAR_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for chicken androgen receptor (AR)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_chAR_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_chAR_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene AR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the chicken androgen receptor. [GeneSymbokar | GenelD:4221651
Uniprot_SwissProt_Accession:Q2ACE0],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.21
Response cutoff threshold used to determine hit calls: 0.631
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
26

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.195

Neutral control median absolute deviation, by plate: nmad	0.059

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	4.97%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 20.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1993

ATG_frERl_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for frog estrogen receptor (ER)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_frERl_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_frERl_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the frog estrogen receptor. [GeneSymbol:esrl.L | GenelD: 398734
| Uniprot_SwissProt_Accession:Q6W5G7],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.252
Response cutoff threshold used to determine hit calls: 0.757
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
127

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.443

Neutral control median absolute deviation, by plate: nmad	0.169

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.8%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 30.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1995

ATG_frAR_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for frog androgen receptor (AR)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_frAR_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_frAR_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene AR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the frog androgen receptor. [GeneSymbol:ar.L | GenelD:399456|
Uniprot_SwissProt_Accession:P70048],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.191
Response cutoff threshold used to determine hit calls: 0.572
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
15

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.95

Neutral control median absolute deviation, by plate: nmad	0.069

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.21%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 24.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1997

ATG_zfAR_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for zebrafish androgen receptor (AR)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_zfAR_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_zfAR_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene AR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the zebrafish androgen receptor. [GeneSymbokAR |
GenelD:100005148 | Uniprot_SwissProt_Accession:NA],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.274
Response cutoff threshold used to determine hit calls: 0.823
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
2

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.113

Neutral control median absolute deviation, by plate: nmad	0.105

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.58%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 43.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1999

ATG_zfER2b_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for zebrafish estrogen receptor
(ER2b)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_zfER2b_XSP2 is one
of 29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed
to make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_zfER2b_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the zebrafish estrogen receptor. [GeneSymbol:esr2b | GenelD:
3177331 Uniprot_SwissProt_Accession:Q5PR29],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.173

Response cutoff threshold used to determine hit calls: 0.52

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
219

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.004

Neutral control median absolute deviation, by plate: nmad	0.105

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.11%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 56.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2001

ATG_hERa_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for human estrogen receptor (ERa)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_hERa_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_hERa_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the human estrogen receptor. [GeneSymbol:ESRl | GenelD: 20991
Uniprot_SwissProt_Accession:P03372],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.208
Response cutoff threshold used to determine hit calls: 0.625
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
202

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	5.993

Neutral control median absolute deviation, by plate: nmad	1.379

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	18.07%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 50.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2003

ATG_M_19_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for M32 Gene Activation (Internal
Marker)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_M_19_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
This assay endpoint, ATG_M_19_XSP2, was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the background control at the transcription factor-level as they
relate to the gene. Furthermore, this assay endpoint can be referred to as a secondary readout, because this
assay has produced multiple assay endpoints where this one serves a background control function. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the background measurement
intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_19, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). A trans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NR and modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.0928 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.072
Response cutoff threshold used to determine hit calls: 0.263
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 190

Number of chemicals tested: 188

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.952

Neutral control median absolute deviation, by plate: nmad	0.013

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	1.37%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2005

ATG_M_32_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for M19 Gene Activation (Internal
Marker)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_M_32_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
This assay endpoint, ATG_M_32_XSP2, was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the background control at the transcription factor-level as they
relate to the gene. Furthermore, this assay endpoint can be referred to as a secondary readout, because this
assay has produced multiple assay endpoints where this one serves a background control function. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the background measurement
intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_32, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). A trans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NR and modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.0928 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.06
Response cutoff threshold used to determine hit calls: 0.263
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 190

Number of chemicals tested: 188

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.924

Neutral control median absolute deviation, by plate: nmad	0.074

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	1.91%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 10.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2007

ATG_frER2_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for frog estrogen receptor (ER2)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_frER2_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_frER2_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the frog estrogen receptor. [GeneSymbol:esr2.L | GenelD:
100174814| Uniprot_SwissProt_Accession:A0AlL8FA50],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.171
Response cutoff threshold used to determine hit calls: 0.513
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
164

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.237

Neutral control median absolute deviation, by plate: nmad	0.153

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.44%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 40.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2009

ATG_mPPARg_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for mouse peroxisome proliferator
activated receptor gamma (PPARg)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_mPPARg_XSP2 is one
of 29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed
to make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_mPPARg_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene PPARg. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element PPARg, which is responsive to the mouse peroxisome proliferator activated receptor gamma.
[GeneSymbokPparg | GenelD: 190161 Uniprot_SwissProt_Accession:P37238],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.174

Response cutoff threshold used to determine hit calls: 0.523

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
154

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.556

Neutral control median absolute deviation, by plate: nmad	0.285

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	16.88%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 49.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2011

ATG_zfPPARgJ(SP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for zebrafish peroxisome
proliferator activated receptor gamma (PPARg)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_zfPPARg_XSP2 is one
of 29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed
to make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_zfPPARg_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene PPARg. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element PPARg, which is responsive to the zebrafish peroxisome proliferator activated receptor
gamma. [GeneSymbol: pparg | GenelD: 5570371 Uniprot_SwissProt_Accession:A6XMH6],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.154

Response cutoff threshold used to determine hit calls: 0.461

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
22

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.086

Neutral control median absolute deviation, by plate: nmad	0.126

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.46%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 19.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2013

ATG_hPPARg_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for human peroxisome proliferator
activated receptor gamma (PPARg)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_hPPARg_XSP2 is one
of 29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed
to make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_hPPARg_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene PPARg. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element PPARg, which is responsive to the human peroxisome proliferator activated receptor gamma.
[GeneSymbokPPARG | GenelD:5468| Uniprot_SwissProt_Accession:P37231],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.206

Response cutoff threshold used to determine hit calls: 0.617

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
155

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.007

Neutral control median absolute deviation, by plate: nmad	0.542

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	23.68%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 37.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2015

ATG_m PXR_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for mouse pregnane X receptor (PXR)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_mPXR_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_mPXR_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene PXR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element PXR, which is responsive to the mouse pregnane X receptor. [GeneSymbol: NR1IA | GenelD:
88561 Uniprot_SwissProt_Accession:075469],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.18
Response cutoff threshold used to determine hit calls: 0.541
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
146

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.027

Neutral control median absolute deviation, by plate: nmad	0.445

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.49%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 32.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2017

ATG_trTRa_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for turtle thyroid hormone receptor,
alpha (TRa)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_trTRa_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_trTRa_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the turtle thyroid hormone receptor, alpha. [GeneSymbol: THRA
| GenelD: 1019490571 Uniprot_SwissProt_Accession: NA],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.253

Response cutoff threshold used to determine hit calls: 0.76

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
22

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.871

Neutral control median absolute deviation, by plate: nmad	0.334

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	36.66%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 21.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2019

ATG_zfTRa_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for zebrafish thyroid hormone
receptor, alpha (TRa)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_zfTRa_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_zfTRa_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the zebrafish thyroid hormone receptor, alpha a. [GeneSymbol:
thraa | GenelD: 30670| Uniprot_SwissProt_Accession: Q98867],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.213

Response cutoff threshold used to determine hit calls: 0.639

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
18

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.639

Neutral control median absolute deviation, by plate: nmad	0.076

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.61%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2021

ATG_zfTRb_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP1) HepG2 Assay for zebrafish thyroid hormone
receptor, beta (TRb)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_zfTRb_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_zfTRb_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the zebrafish thyroid hormone receptor, beta. [GeneSymbol: thrb
| GenelD: 306071 Uniprot_SwissProt_Accession: Q9PVE4],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.262

Response cutoff threshold used to determine hit calls: 0.785

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
15

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.568

Neutral control median absolute deviation, by plate: nmad	0.072

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.06%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 17.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2023

ATG_M_61_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for M61 Gene Activation (Internal
Marker)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_M_61_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
This assay endpoint, ATG_M_61_XSP2, was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the background control at the transcription factor-level as they
relate to the gene. Furthermore, this assay endpoint can be referred to as a secondary readout, because this
assay has produced multiple assay endpoints where this one serves a background control function. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the background measurement
intended target family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_61, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). A trans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NR and modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

0.0928 nM
Key positive control:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

203 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.035
Response cutoff threshold used to determine hit calls: 0.263
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 190

Number of chemicals tested: 188

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.024

Neutral control median absolute deviation, by plate: nmad	0.006

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0.61%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2025

ATG_frTRa_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for frog thyroid hormone receptor,
alpha (TRa)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_frTRa_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_frTRa_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the frog thyroid hormone receptor, alpha L homeolog.
[GeneSymbol: thra.L | GenelD: 3979421 Uniprot_SwissProt_Accession: P15204],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.273

Response cutoff threshold used to determine hit calls: 0.818

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
35

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.482

Neutral control median absolute deviation, by plate: nmad	0.49

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	31.8%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 27.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2027

ATG_hTRa_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for human thyroid hormone
receptor, alpha (TRa)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_hTRa_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_hTRa_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the human thyroid hormone receptor, alpha. [GeneSymbol: THRA
| GenelD: 70671 Uniprot_SwissProt_Accession: P10827],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.286

Response cutoff threshold used to determine hit calls: 0.858

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
23

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.934

Neutral control median absolute deviation, by plate: nmad	0.905

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	44.82%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 14.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2029

ATG_hTRb_XSP2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay for human thyroid hormone
receptor, beta (TRb)

1.2	Assay Summary: ATG_XSP2_multi-species_TRANS is a cell-based, multiplexed assay created by modifying the
existing Attagene TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the
following species: human (Homo sapiens), mouse (Mus musculus), frog (Xenopus laevis), zebrafish (Danio rerio),
chicken (Gallus gallus), and turtle (Chrysemys picta). The ECOTOX-FACTORIAL format uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. Xsp2 included 3 nM 6-
alpha-fluorotestosterone and 1.5 nM norgestrel to stimulate the androgen receptor. ATG_hTRb_XSP2 is one of
29 assay components measured or calculated from the ATG_XSP2_multi-species_TRANS assay. It is designed to
make measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity
signals by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology.
The assay endpoint ATG_hTRb_XSP2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-
of-signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to
the gene TR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element TRa, which is responsive to the human thyroid hormone receptor, beta. [GeneSymbol: THRB
| GenelD: 70681 Uniprot_SwissProt_Accession: P10828],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:
NA

203 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.219

Response cutoff threshold used to determine hit calls: 0.658

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 280

Number of chemicals tested: 191

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
33

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.677

Neutral control median absolute deviation, by plate: nmad	0.092

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.57%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 24.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2031

ATG_XTT_Cytotoxicity_XSP2

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment in the Attagene TRANS-FACTORIAL Multi-Species (XSP2) HepG2 Assay

1.2	Assay Summary: ATG_XSP2_XTT_Cytotoxicity_multi-species is a cell-based, single-readout assay that uses
HepG2, a human liver cell line, with measurements taken at 24 hours after chemical dosing in a 24-well plate.
ATG_XTT_Cytotoxicity_XSP2 is an assay component measured from the ATG_XSP2_XTT_Cytotoxicity_multi-
species assay. It is designed to make measurements of cell number, a form of viability reporter, as detected with
fluorescence intensity signals by XTT cytotoxicity assay technology. Data from the assay component
ATG_XTT_Cytotoxicity_XSP2 was analyzed into 1 assay endpoint. This assay endpoint,
ATG_XTT_Cytotoxicity_XSP2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used to
understand changes in the viability. Furthermore, this assay endpoint can be referred to as a primary readout,
because the performed assay has only produced 1 assay endpoint. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals produced from an enzymatic reaction involving the key
substrate [XTT reagent] are correlated to the viability of the mitochondria in the system.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that

-------
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.823 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 10.951
Response cutoff threshold used to determine hit calls: 32.853
Detection technology used: XTT cytotoxicity assay (Fluorescence)

-------
2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

17: sublOO (Center data around zero by subtracting the corrected response value (cval) from 100; 100 -
cval. Typically used if data was pre-normalized around 100 with responses decreasing to 0.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 27:
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 90	Number of chemicals tested: 90

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
12

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 11.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Simha A, Bone A, Doering JA, Vliet SMF, LaLone C, Medvedev A, Makarov S. Evaluation
of a multiplexed, multispecies nuclear receptor assay for chemical hazard assessment. Toxicol In Vitro. 2021
Apr;72:105016. doi: 10.1016/j.tiv.2020.105016. Epub 2020 Oct 10. PMID: 33049310., Medvedev AV,
Medvedeva LA, Martsen E, Moeser M, Gorman KL, Lin B, Blackwell B, Villeneuve DL, Houck KA, Crofton KM,
Makarov SS. Harmonized Cross-Species Assessment of Endocrine and Metabolic Disruptors by Ecotox
FACTORIAL Assay. Environ Sci Technol. 2020 Oct 6;54(19):12142-12153. doi: 10.1021/acs.est.0c03375. Epub
2020 Sep 23. PMID: 32901485.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3099

ATG_frG R_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2HepG2 Assay for frog glucocorticoid receptor (GR)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_frGR_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_frGR_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene for
glucocorticoid receptor. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element frGR, also known as african clawed frog (Xenopus laevis) glucocorticoid receptor. [NCBI
Reference Sequence: NP_001081531.1],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Dexamethasone	DMSO

Baseline median absolute deviation for the assay (bmad): 0.164
Response cutoff threshold used to determine hit calls: 0.493
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
3

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.849

Neutral control median absolute deviation, by plate: nmad	0.078

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.17%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3101

ATG_rtG R_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for rainbow trout glucocorticoid receptor (GR)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_rtGR_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_rtGR_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene for
glucocorticoid receptor. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element rtGR, also known as rainbow trout (Oncorhynchus mykiss) glucocorticoid receptor. [NCBI
Reference Sequence: NP_001118202.1],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Dexamethasone	DMSO

Baseline median absolute deviation for the assay (bmad): 0.184
Response cutoff threshold used to determine hit calls: 0.552
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.474

Neutral control median absolute deviation, by plate: nmad	0.052

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.94%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3103

ATG J mG R_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for Japanese medaka glucocorticoid receptor
(GR)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_jmGR_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATGJmGR_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene for
glucocorticoid receptor. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element mGR, also known as Japanese medaka (Oryzias latipes) glucocorticoid receptor. [NCBI
Reference Sequence: NNP_001292330.1],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Dexamethasone	DMSO

Baseline median absolute deviation for the assay (bmad): 0.164
Response cutoff threshold used to determine hit calls: 0.492
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.733

Neutral control median absolute deviation, by plate: nmad	0.128

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	17.39%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3105

ATG_zfGR_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for zebrafish glucocorticoid receptor

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_zfGR_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_zfGR_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene for
glucocorticoid receptor. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element zfGR, also known as zebrafish (Danio rerio) glucocorticoid receptor. [NCBI Reference
Sequence: NP_001018547.2],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Dexamethasone	DMSO

Baseline median absolute deviation for the assay (bmad): 0.198
Response cutoff threshold used to determine hit calls: 0.595
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.599

Neutral control median absolute deviation, by plate: nmad	0.044

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.3%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3107

ATG_hPPARa_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for human peroxisome proliferator activated
receptor alpha (PPARa)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_hPPARa_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_hPPARa_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene PPARa.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element hPPARg, which is responsive to the human peroxisome proliferator activated receptor alpha.
NCBI Reference Sequence: NP_005027.2]

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
GW7647	DMSO

Baseline median absolute deviation for the assay (bmad): 0.271
Response cutoff threshold used to determine hit calls: 0.813
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
10

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.454

Neutral control median absolute deviation, by plate: nmad	0.385

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	15.68%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3109

ATG_frPPARa_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for frog peroxisome proliferator activated
receptor alpha (PPARa)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_frPPARa_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_frPPARa_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene PPARa.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element frPPARg, which is responsive to the african clawed frog (Xenopus laevis) peroxisome
proliferator activated receptor alpha. NCBI Reference Sequence: NP_001083282.1]

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
GW7647	DMSO

Baseline median absolute deviation for the assay (bmad): 0.268
Response cutoff threshold used to determine hit calls: 0.803
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
7

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.734

Neutral control median absolute deviation, by plate: nmad	0.068

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.29%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3111

ATG_rtPPARa_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for rainbow trout peroxisome proliferator
activated receptor alpha (PPARa)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_rtPPARa_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_frPPARa_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene PPARa.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element rtPPARg, which is responsive to the rainbow trout (Oncorhynchus mykiss) peroxisome
proliferator activated receptor alpha. NCBI Reference Sequence: XP_021473593.1]

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
GW7647	DMSO

Baseline median absolute deviation for the assay (bmad): 0.155
Response cutoff threshold used to determine hit calls: 0.466
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
8

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.079

Neutral control median absolute deviation, by plate: nmad	0.104

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.62%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3113

ATG J m PPARa_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for Japanese medaka peroxisome proliferator
activated receptor alpha (PPARa)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATGJmPPARa_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATGJmPPARa_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene PPARa.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element jmPPARg, which is responsive to the Japanese medaka (Oryzias latipes) peroxisome
proliferator activated receptor alpha. NCBI Reference Sequence: NP_001158347.1]

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
GW7647	DMSO

Baseline median absolute deviation for the assay (bmad): 0.19
Response cutoff threshold used to determine hit calls: 0.57
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
7

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.669

Neutral control median absolute deviation, by plate: nmad	0.054

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.09%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3115

ATG_zfPPARa_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for zebrafish peroxisome proliferator activated
receptor alpha (PPARa)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_zfPPARa_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_zfPPARa_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene PPARa.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element zfPPARg, which is responsive to the zebrafish (Danio rerio) peroxisome proliferator activated
receptor alpha. NCBI Reference Sequence: NP_001154805.1]

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
GW7647	DMSO

Baseline median absolute deviation for the assay (bmad): 0.205
Response cutoff threshold used to determine hit calls: 0.616
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
6

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.632

Neutral control median absolute deviation, by plate: nmad	0.09

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.31%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 3.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3117

ATG_hPPARg_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for human peroxisome proliferator activated
receptor gamma (PPARg)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_hPPARg_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_hPPARg_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene PPARg.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element hPPARg, which is responsive to the human peroxisome proliferator activated receptor
gamma. [NCBI Reference Sequence: NP_001341595.1]

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Rosiglitazone	DMSO

Baseline median absolute deviation for the assay (bmad): 0.29
Response cutoff threshold used to determine hit calls: 0.869
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
17

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.274

Neutral control median absolute deviation, by plate: nmad	0.307

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	24.08%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3119

ATG_frPPARg_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for frog peroxisome proliferator activated
receptor gamma (PPARg)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_frPPARg_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_frPPARg_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene PPARg.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element frPPARg, which is responsive to the african clawed frog (Xenopus laevis) peroxisome
proliferator activated receptor gamma. [NCBI Reference Sequence: NP_001081312.1]

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.125
Response cutoff threshold used to determine hit calls: 0.374
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.752

Neutral control median absolute deviation, by plate: nmad	0.047

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.21%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3121

ATG_rt P PARg_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for rainbow trout peroxisome proliferator
activated receptor gamma (PPARg)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_rtPPARg_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_rtPPARg_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene PPARg.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element rtPPARg, which is responsive to the rainbow trout (Oncorhynchus mykiss) peroxisome
proliferator activated receptor gamma. [NCBI Reference Sequence: XP_021470054.1]

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.131
Response cutoff threshold used to determine hit calls: 0.393
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.849

Neutral control median absolute deviation, by plate: nmad	0.062

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.33%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3123

ATG J mPPARg_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for Japanese medaka peroxisome proliferator
activated receptor gamma (PPARg)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_jmPPARg_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATGJmPPARg_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene PPARg.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element jmPPARg, which is responsive to the Japanese medaka (Oryzias latipes) peroxisome
proliferator activated receptor gamma. [NCBI Reference Sequence: NP_001158348.1]

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.189
Response cutoff threshold used to determine hit calls: 0.566
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
8

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.537

Neutral control median absolute deviation, by plate: nmad	0.076

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.22%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3125

ATG_zfPPARgJcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for zebrafish peroxisome proliferator activated
receptor gamma (PPARg)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_zfPPARg_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_zfPPARg_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene PPARg.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element PPARg, which is responsive to the zebrafish (Danio rerio) peroxisome proliferator activated
receptor gamma. [GeneSymbol: pparg | GenelD: 5570371 Uniprot_SwissProt_Accession:A6XMH6 | NCBI
Reference Sequence: NP_571542.1],

-------
Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

-------
0.823 nM
Key positive control:

Triphenyl phosphate

200 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.15

Response cutoff threshold used to determine hit calls: 0.451

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
3

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.702

Neutral control median absolute deviation, by plate: nmad	0.042

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.91%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3127

ATG_hRXRb_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for human retinoid X receptor, beta (RXRb)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_hRXRb_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_hRXRb_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene RXRb.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element hRXRb, also known as human retinoid X receptor, beta [GeneSymbol:RXRB | GenelD:6257 |
Uniprot_SwissProt_Accession:P28702 |NCBI Reference Sequence: NP_068811.1],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Bexarotene	DMSO

Baseline median absolute deviation for the assay (bmad): 0.216
Response cutoff threshold used to determine hit calls: 0.647
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
7

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.401

Neutral control median absolute deviation, by plate: nmad	0.191

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.65%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3129

ATG_frRXRb_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for frog retinoid X receptor, beta (RXRb)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_frRXRb_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_frRXRb_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene RXRb.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element frRXRb, also known as african clawed frog (Xenopus laevis) retinoid X receptor, beta [NCBI
Reference Sequence: NP_001080936.1],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Bexarotene	DMSO

Baseline median absolute deviation for the assay (bmad): 0.238
Response cutoff threshold used to determine hit calls: 0.714
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
5

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.162

Neutral control median absolute deviation, by plate: nmad	0.18

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	15.51%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3131

ATG_rtRXRb_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for rainbow trout retinoid X receptor, beta
(RXRb)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_rtRXRb_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_rtRXRb_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene RXRb.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element rtRXRb, also known as rainbow trout (Oncorhynchus mykiss) retinoid X receptor, beta [NCBI
Reference Sequence: XP_021427688.1],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Bexarotene	DMSO

Baseline median absolute deviation for the assay (bmad): 0.166
Response cutoff threshold used to determine hit calls: 0.499
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
2

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.766

Neutral control median absolute deviation, by plate: nmad	0.11

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.31%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3133

ATG J m RXRb_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for Japanese medaka retinoid X receptor, beta
(RXRb)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATGJmRXRb_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATGJmRXRb_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene RXRb.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element jmRXRb, also known as Japanese medaka (Oryzias latipes) retinoid X receptor, beta [NCBI
Reference Sequence: XP_020562828.1],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Bexarotene	DMSO

Baseline median absolute deviation for the assay (bmad): 0.209
Response cutoff threshold used to determine hit calls: 0.628
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
7

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.506

Neutral control median absolute deviation, by plate: nmad	0.175

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.62%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3135

ATG_zfRXRb_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for zebrafish retinoid X receptor, beta (RXRb)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_zfRXRb_EcoTox2 is
one of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse
transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_zfRXRb_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene RXRb.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element zfRXRb, also known as zebrafish (Danio rerio) retinoid X receptor, beta [NCBI Reference
Sequence: NP_571350.1 ].

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Bexarotene	DMSO

Baseline median absolute deviation for the assay (bmad): 0.215
Response cutoff threshold used to determine hit calls: 0.645
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
2

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.661

Neutral control median absolute deviation, by plate: nmad	0.034

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.16%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3137

ATG_h ERa_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for human estrogen receptor

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_hERa_EcoTox2 is one
of 29 assay components measured or calculated from the ATG_EcoTox2 assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with fluorescence intensity signals
by Reverse transcription polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The
assay endpoint ATG_hERa_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-
signal activity can be used to understand the reporter gene at the transcription factor-level as they relate to the
gene ER. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a reporter gene function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element hER, which is responsive to the human estrogen receptor. [GeneSymbol:ESRl | GenelD:
20991 Uniprot_SwissProt_Accession:P03372 | NCBI Reference Sequence: NP_000116.2],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Estradiol	DMSO

Baseline median absolute deviation for the assay (bmad): 0.177
Response cutoff threshold used to determine hit calls: 0.53
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
8

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.77

Neutral control median absolute deviation, by plate: nmad	0.126

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	16.36%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3139

ATG_zfERl_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for zebrafish estrogen receptor

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_zfERl_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_zfERl_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene ER.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element zfER, which is responsive to the zebrafish (Danio rerio) estrogen receptor. [GeneSymbol:esrl
| GenelD: 2592521 Uniprot_SwissProt_Accession:P57717 | NCBI Reference Sequence: NP_694491.1],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Estradiol	DMSO

Baseline median absolute deviation for the assay (bmad): 0.208
Response cutoff threshold used to determine hit calls: 0.623
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.846

Neutral control median absolute deviation, by plate: nmad	0.128

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	15.16%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3141

ATG_frERl_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for frog estrogen receptor

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_frERl_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_frERl_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene ER.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element ER, which is responsive to the african clawed frog (Xenopus laevis) estrogen receptor.
[GeneSymbokesrl.L | GenelD: 398734 | Uniprot_SwissProt_Accession:Q6W5G7 | NCBI Reference Sequence:
N P_001083086.1],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Estradiol	DMSO

Baseline median absolute deviation for the assay (bmad): 0.139
Response cutoff threshold used to determine hit calls: 0.417
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
2

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.706

Neutral control median absolute deviation, by plate: nmad	0.105

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.79%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3143

ATG_hAR_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for human androgen receptor (AR)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_hAR_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_hAR_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene AR.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the human androgen receptor. [GeneSymbol:AR | GenelD:367 |
Uniprot_SwissProt_Accession:P10275 | NCBI Reference Sequence: NP_000035.2],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
11-keto Testosterone	DMSO

Baseline median absolute deviation for the assay (bmad): 0.202
Response cutoff threshold used to determine hit calls: 0.605
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.823

Neutral control median absolute deviation, by plate: nmad	0.081

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.82%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3145

ATG_zfAR_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for zebrafish androgen receptor (AR)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_zfAR_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_zfAR_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene AR.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element AR, which is responsive to the zebrafish (Danio rerio) androgen receptor. [GeneSymbokAR |
GeneID: 100005148 | NCBI Reference Sequence: NP_001076592.1],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
11-keto Testosterone	DMSO

Baseline median absolute deviation for the assay (bmad): 0.208
Response cutoff threshold used to determine hit calls: 0.625
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Non-mammalian Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.929

Neutral control median absolute deviation, by plate: nmad	0.11

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.9%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3147

ATG_M_61_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for M61 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_61_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_M_61_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the background control at the transcription factor-level as they relate to the gene.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a background control function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the background measurement intended target
family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_61, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.05

-------
Response cutoff threshold used to determine hit calls: 0.263

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 19

Number of chemicals tested: 19

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.998

Neutral control median absolute deviation, by plate: nmad	0.036

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	3.64%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3149

ATG_M_06_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for M06 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_06_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_M_06_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the background control at the transcription factor-level as they relate to the gene.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a background control function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the background measurement intended target
family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_06, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.05

-------
Response cutoff threshold used to determine hit calls: 0.263

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 19

Number of chemicals tested: 19

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.998

Neutral control median absolute deviation, by plate: nmad	0.036

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	3.64%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3151

ATG_M_32_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for M32 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_32_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_M_32_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the background control at the transcription factor-level as they relate to the gene.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a background control function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the background measurement intended target
family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_32, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.072

-------
Response cutoff threshold used to determine hit calls: 0.263

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 19

Number of chemicals tested: 19

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4.163

Neutral control median absolute deviation, by plate: nmad	0.408

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.81%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3153

ATG_M_19_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for M19 Gene Activation (Internal Marker)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_M_19_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_M_19_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the background control at the transcription factor-level as they relate to the gene.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a background control function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the background measurement intended target
family, where the subfamily is internal marker.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the transfected trans-acting
reporter gene and exogenous transcription factor GAL4-M_19, which is used as an internal marker.

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

-------
2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

NA

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.088

-------
Response cutoff threshold used to determine hit calls: 0.264

Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 19

Number of chemicals tested: 19

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.004

Neutral control median absolute deviation, by plate: nmad	0.073

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.24%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3155

ATG_hGR_EcoTox2

1.	General Information

1.1	Assay Title: Attagene TRANS-FACTORIAL EcoTox-2 HepG2 Assay for human glucocorticoid receptor (GR)

1.2	Assay Summary: ATG_EcoTox2 is a cell-based, multiplexed assay created by modifying the existing Attagene
TRANS-FACTORIAL system to include a panel of nuclear receptors from some or all the following species: human
(Homo sapiens), african clawed frog (Xenopus laevis), rainbow trout (Oncorhynchus mykiss), Japanese medaka
(Oryzias latipes), and zebrafish (Danio rerio). The ECOTOX2-FACTORIAL format uses HepG2, a human liver cell
line, with measurements taken at 24 hours after chemical dosing in a 24-well plate. ATG_hGR_EcoTox2 is one
of 29 assay components calculated in the ATG_EcoTox2 assay. It is designed to make measurements of mRNA
induction, a form of inducible reporter, as detected with fluorescence intensity signals by Reverse transcription
polymerase chain reaction (RT-PCR) and Capillary electrophoresis technology. The assay endpoint
ATG_hGR_EcoTox2 was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, measures of mRNA for gain-of-signal activity
can be used to understand the reporter gene at the transcription factor-level as they relate to the gene for
glucocorticoid receptor. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Attagene Inc. is a Contract Research Organization (CRO) offering a unique screening service using
its proprietary multiplexed pathway profiling platform, the FACTORIAL.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: FACTORIAL is a novel pathway profiling technology trademarked and patented by
Attagene, Inc.

1.9	Assay Throughput: 24-well plate. Transfected HepG2 cells are aliquoted into 24-well microtiter plates and
incubated with test compounds for 24 hours prior to PCR detection of total RNA transcription using capillary
electrophoresis.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals are indicative of inducible changes in transcription factor
activity. This is quantified by the level of mRNA reporter sequence unique to the trans-acting reporter gene
response element hGR, also known as human glucocorticoid receptor isoform alpha. [NCBI Reference Sequence:
NP_000167.1],

Cellular adaptive response to environmental triggers is often mediated by an intracellular network of regulatory
pathways that modulate gene expression. The signaling pathways interact with DNA by using transcription

-------
factors (TFs), or proteins that bind specific sequences on target genes. Assessing transcription factor activity can
help characterize the functional status and impact of chemical exposures for expression of genes of interest.

2.2	Scientific Principles: Trans-FACTORIAL is an embodiment of the FACTORIAL platform that is designed for
assessing agonist/antagonist properties of compounds across multiple NRs. The trans- FACTORIAL comprises a
library of one-hybrid reporter constructs (trans-RTUs). Atrans-RTU expresses a chimera GAL4-NR protein that
regulates transcription of a reporter sequence. The presence of agonists/antagonists of NR alters the
transactivation function of Gal4-NRand modulates reporter transcription.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice. This cell line has been cloned and transfected with a library of multiple reporter
transcription units.

2.4	Metabolic Competence: The HepG2 cells used in this assay are variant HG19, a cell line selected for enhanced
xenobiotic metabolism. These cells express 2 to 13 times more cytochrome P450 activity than parental HepG2.
The parental HepG2 cell line has been shown by others to retain the potential for Phase I and Phase II metabolic
responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A, 2E1, and 3A4/5 with
CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human hepatocytes
although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have even been
observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2 cells include
SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7). In addition,
HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53 protein (Boehme
et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant response element
(ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a number of ATP-binding
cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound proteins also regulated
in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Human liver HepG2 cells are transiently transfected with multiple reporter transcription units
(MRTUs) in 6-well plates using FuGene 6 reagent (Roche; 3 ml FuGene/1 mg DNA). The MRTU constructs are
regulated by a cis-regulating element (promoter). Each RTU expresses a GAL4-UAS that regulates the
transcription of a nearby target reporter gene sequence. A major difference between the CIS and TRANS system
is that in CIS activities of endogenous TFs are measured, whereas the TRANS assay evaluates changes in activities
of exogenous, chimeric NR-Gal4 proteins. Since the HepG2 cell line does not express some nuclear receptors,
the CIS assay cannot be used to evaluate these targets. The transfected cells are pooled, plated onto a 96-well
plates, and exposed to evaluated compound. At the end of 24 hour incubation, total RNA was isolated using
TriZol reagent (Invitrogen). The isolated RNA is then reverse-transcribed using oligo(dT) primer and Mo-MLV
reverse transcriptase (Invitrogen) with DNAse I (Ambion) treatment for 30 min. The one-tenth of the produced
cDNA was amplified by PCR using Taq DNA polymerase (Invitrogen) and two reporter sequence-specific primers.
The PCR products were fluorescently labeled by primer extension with 6-arboxyfluorescein (6-FAM) 5'-labeled
reporter sequence-specific primer (2 min at 95 C, 20 s at 68 C and 10 min at 72 C) and these products were
digested with 5U of Hpal (New England Biolabs) for 2h at 37 C. The fragments were purified using Qiaquick PCR
columns (Qiagen), analyzed on an ABI 3130xL genetic analyzer (Applied Biosystems) with peak positions
identified by using a set of X-rhodamine (ROX)-labeled MapMarkerlOOO molecular weight standards
(BioVentures). The raw capillary electrophoresis data was processed using Attagraph software (Attagene).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.823 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
Dexamethasone	DMSO

Baseline median absolute deviation for the assay (bmad): 0.128
Response cutoff threshold used to determine hit calls: 0.385
Detection technology used: RT-PCR and Capillary electrophoresis (Fluorescence)

2.6	Response: Increased transcription activity is measured by increased fluorescent intensity, specifically the
increased production of mRNA transcripts production in response to active transcription following transcription
factor (TF) interaction with promoter sequences as measured by reverse transcription-polymerase chain
reaction (RT-PCR) and capillary electrophoretic detection of fluorescently labeled mRNA.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using log2 fold-
induction over DMSO controls which provide a baseline signal. Negative and zero values are removed before
analysis and raw values are log transformed. All statistical analyses were conducted using R programming
language, employing tcpl package to generate model parameters and confidence intervals. Each chemical
concentration series was fit and the model which produces the lowest Akaike Information Criterion (AIC) value
is considered the winning model.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 3: log2_1.2 (Add a cutoff value of log2(1.2). Typically for fold change data.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 19

Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
3

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.994

Neutral control median absolute deviation, by plate: nmad	0.095

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.55%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Blackwell, B. R., Ankley, G. T., Bradley, P. M., Houck, K. A., Makarov, S. S., Medvedev, A. V.,
Swintek, J., & Villeneuve, D. L. (2019). Potential Toxicity of Complex Mixtures in Surface Waters from a
Nationwide Survey of United States Streams: Identifying in Vitro Bioactivities and Causative Chemicals.
Environmental science & technology, 53(2), 973-983. https://doi.org/10.1021/acs.est.8b05304

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 146

BSK_3C_Eselectin

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 3C Assay for Eselectin Biomarker Activity

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_Eselectin is an assay component measured in the BSK_3C assay. It measures E-selectin antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_3C_Eselectin was
analyzed into 1 assay endpoint. This assay endpoint, BSK_3C_Eselectin, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene SELE.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell adhesion molecules intended target family, where
the subfamily is selectins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: E-selectin antibody is used to tag and quantify the level of selectin E protein. Changes in the signals
are indicative of protein expression changes when conditioned to simulate proinflammation from cytokines
[GeneSymbokSELE | GenelD:6401 | Uniprot_SwissProt_Accession:P16581],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.023
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation of the Thl type, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 150.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 148

BSK_3C_HLADR

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 3C Assay for Human Leukocyte Antigen - DR isotype (HLADR) Biomarker
Activity

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_HLADR is an assay component measured in the BSK_3C assay. It measures HLA-DR antibody related to
regulation of gene expression using ELISA technology. Data from the assay component BSK_3C_HLADR was
analyzed into 1 assay endpoint. This assay endpoint, BSK_3C_HLADR, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene HLA-DRA.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell adhesion molecules intended target family, where
the subfamily is MHC Class II.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: HLA-DR antibody is used to tag and quantify the level of major histocompatibility complex, class II,
DR alpha protein. Changes in the signals are indicative of protein expression changes when conditioned to
simulate proinflammation from cytokines [GeneSymbol:HLA-DRA | GenelD:3122 |
Uniprot_SwissProt_Accession:P01903].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.026
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation oftheThltype, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
661

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 163.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 150

BSK_3C_ICAM1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 3C Assay for Intercellular Adhesion Molecule 1 (ICAM1) Biomarker Activity

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_ICAM1 is an assay component measured in the BSK_3C assay. It measures ICAM-1 antibody related to
regulation of gene expression using ELISA technology. Data from the assay component BSK_3C_ICAM1 was
analyzed into 1 assay endpoint. This assay endpoint, BSK_3C_ICAM1, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene ICAM1.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell adhesion molecules intended target family, where
the subfamily is Immunoglobulin CAM.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: ICAM-1 antibody is used to tag and quantify the level of intercellular adhesion molecule 1 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymboklCAMl | GenelD:3383 | Uniprot_SwissProt_Accession:P05362].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.017
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation oftheThltype, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 110.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 152

BSK_3C_IL8

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 3C Assay for lnterleukin-8 (IL8) Biomarker Activity

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_IL8 is an assay component measured in the BSK_3C assay. It measures IL-8 antibody related to
regulation of gene expression using ELISA technology. Data from the assay component BSK_3C_IL8 was analyzed
into 1 assay endpoint. This assay endpoint, BSK_3C_IL8, was analyzed with bidirectional fitting relative to DMSO
as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity
can be used to understand changes in the signaling as they relate to the gene CXCL8. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a signaling function. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the cytokine intended target family, where the subfamily is interleukins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: IL-8 antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 8 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbol:CXCL8 | GenelD:3576 | Uniprot_SwissProt_Accession:P10145].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.033
Response cutoff threshold used to determine hit calls: 0.099
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation of the Thl type, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 138.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 154

BSK_3C_MCP1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 3C Assay for Monocyte Chemoattractant Protein-1 (MCP1) Biomarker Activity

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_MCP1 is an assay component measured in the BSK_3C assay. It measures MCP-1 antibody related to
regulation of gene expression using ELISA technology. Data from the assay component BSK_3C_MCP1 was
analyzed into 1 assay endpoint. This assay endpoint, BSK_3C_MCP1, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene CCL2.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cytokine intended target family, where the subfamily
is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: MCP-1 antibody is used to tag and quantify the level of chemokine (C-C motif) ligand 2 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbol:CCL2 | GenelD:6347 | Uniprot_SwissProt_Accession:P13500].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.028
Response cutoff threshold used to determine hit calls: 0.085
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation oftheThltype, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 142.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 156

BSK_3C_MIG

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 3C Assay for Monokine induced gamma interferon (MIG) Biomarker Activity

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_MIG is an assay component measured in the BSK_3C assay. It measures MIG antibody related to
regulation of gene expression using ELISA technology. Data from the assay component BSK_3C_MIG was
analyzed into 1 assay endpoint. This assay endpoint, BSK_3C_MIG, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene CXCL9.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cytokine intended target family, where the subfamily
is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: MIG antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 9 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbol:CXCL9 | GenelD:4283 | Uniprot_SwissProt_Accession:Q07325],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.007
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation oftheThltype, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 104.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 158

BSK_3C_Proliferation

1. General Information

1.1	Assay Title: Cell Proliferation Assessment in the BioMAP Diversity Plus: 3C Assay

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_Proliferation is an assay component measured in the BSK_3C assay. It measures 0.1% sulforhodamine
related to cell proliferation using Sulforhodamine staining technology. Data from the assay component
BSK_3C_Proliferation was analyzed into 1 assay endpoint. This assay endpoint, BSK_3C_Proliferation, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type
of viability reporter, measures of total protein for gain or loss-of-signal activity can be used to understand the
viability at the cellular-level. Furthermore, this assay endpoint can be referred to as a secondary readout,
because this assay has produced multiple assay endpoints where this one serves a viability function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell cycle
intended target family, where the subfamily is proliferation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Proliferation in the 3C system is a measure of endothelial cell profliferation which is important to
the process of wound healing and angiogensis.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.039
Response cutoff threshold used to determine hit calls: 0.117
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation of the Thl type, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 200.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 160

BSK_3C_SRB

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: 3C Assay

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_SRB is an assay component measured in the BSK_3C assay. It measures 0.1% sulforhodamine related
to cell death using Sulforhodamine staining technology. Data from the assay component BSK_3C_SRB was
analyzed into 1 assay endpoint. This assay endpoint, BSK_3C_SRB, was analyzed with bidirectional fitting relative
to DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain or loss-of-signal
activity can be used to understand changes in the signaling. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
signaling function. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the cell morphology intended target family, where the subfamily is cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: SRB in the 3C system is a measure of the total protein content of venular endothelial cells. Cell
viability of adherent cells is measured by Sulforhodamine B (SRB) staining, a method that determines cell density
by measuring total protein content of test wells.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.022
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation of the Thl type, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 156.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 162

BSK_3C_Th rombomod ulin

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 3C Assay for Thrombomodulin Biomarker Activity

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_Thrombomodulin is an assay component measured in the BSK_3C assay. It measures Thrombomodulin
antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_3C_Thrombomodulin was analyzed into 1 assay endpoint. This assay endpoint, BSK_3C_Thrombomodulin,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as
they relate to the gene THBD. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves a signaling function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended
target family, where the subfamily is rhodopsin-like receptor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Thrombomodulin antibody is used to tag and quantify the level of thrombomodulin protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbokTHBD | GenelD:7056 | Uniprot_SwissProt_Accession:P07204].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.03
Response cutoff threshold used to determine hit calls: 0.09
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation of the Thl type, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 115.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 164

BSK_3C_TissueFactor

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 3C Assay for Tissue Factor Biomarker Activity

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_TissueFactor is an assay component measured in the BSK_3C assay. It measures Tissue Factor antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_3C_TissueFactor was analyzed into 1 assay endpoint. This assay endpoint, BSK_3C_TissueFactor, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they
relate to the gene F3. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is coagulation factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Tissue Factor antibody is used to tag and quantify the level of coagulation factor III
(thromboplastin, tissue factor) protein. Changes in the signals are indicative of protein expression changes when
conditioned to simulate proinflammation from cytokines [GeneSymbol:F3 | GenelD:2152 |
Uniprot_SwissProt_Accession:P13726],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.043
Response cutoff threshold used to determine hit calls: 0.13
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation oftheThltype, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 167.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 166

BSK_3C_uPAR

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 3C Assay for Urokinase-type plasminogen activator receptor (uPAR)
Biomarker Activity

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3CAR is an assay component measured in the BSK_3C assay. It measures uPAR antibody related to
regulation of gene expression using ELISA technology. Data from the assay component BSK_3CAR was analyzed
into 1 assay endpoint. This assay endpoint, BSK_3CAR, was analyzed with bidirectional fitting relative to DMSO
as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity
can be used to understand changes in the signaling as they relate to the gene PLAUR. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a signaling function. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the cytokine intended target family, where the subfamily is plasmogen activator.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: uPAR antibody is used to tag and quantify the level of plasminogen activator, urokinase receptor
protein. Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbokPLAUR | GenelD:5329 | Uniprot_SwissProt_Accession:Q03405],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.039
Response cutoff threshold used to determine hit calls: 0.117
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation oftheThltype, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 155.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 168

BSK_3C_VCAM1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 3C Assay for Vascular cell adhesion molecule 1 (VCAM1) Biomarker Activity

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_VCAM1 is an assay component measured in the BSK_3C assay. It measures VCAM-1 antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_3C_VCAM1 was
analyzed into 1 assay endpoint. This assay endpoint, BSK_3C_VCAM1, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene VCAM1.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell adhesion molecules intended target family, where
the subfamily is Immunoglobulin CAM.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: VCAM-1 antibody is used to tag and quantify the level of vascular cell adhesion molecule 1 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbokVCAMl | GenelD:7412 | Uniprot_SwissProt_Accession:P19320].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.019
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation oftheThltype, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 130.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 170

BSK_3C_Vis

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: 3C Assay for Morphology Visualization

1.2	Assay Summary: BSK_3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_3C_Vis is an assay component measured in the BSK_3C assay. It measures NA related to cell morphology
using light microscopy technology. Data from the assay component BSK_3C_Vis was analyzed into 1 assay
endpoint. This assay endpoint, BSK_3C_Vis, was analyzed with bidirectional fitting relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, gain or loss-of-signal activity can
be used to understand changes in the background control. Furthermore, this assay endpoint can be referred to
as a secondary readout, because this assay has produced multiple assay endpoints where this one serves a
background control function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cell morphology intended target family, where the subfamily is cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Visual microscropy is used to quantify changes to the morphology of the cells.

-------
The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

59.4 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: light microscopy (Microscopy)

2.6	Response: The Thl Vasculature (3C) system models vascular inflammation oftheThltype, an environment that
promotes monocyte and T cell adhesion and recruitment and is anti-angiogenic. This system is relevant for
chronic inflammatory diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

NA

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1540

Number of chemicals tested: 1484

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
482

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1198.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 172

BSK_4H_Eotaxin3

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 4H Assay for Eotaxin 3 Biomarker Activity

1.2	Assay Summary: BSK_4H is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_4H_Eotaxin3 is an assay component measured in the BSK_4H assay. It measures Eotaxin-3 antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_4H_Eotaxin3 was
analyzed into 1 assay endpoint. This assay endpoint, BSK_4H_Eotaxin3, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene CCL26.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cytokine intended target family, where the subfamily
is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Eotaxin-3 antibody is used to tag and quantify the level of chemokine (C-C motif) ligand 26 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from histamine and IL4 [GeneSymbol:CCL26 | GenelD:10344 |
Uniprot_SwissProt_Accession:Q9Y258],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.027
Response cutoff threshold used to determine hit calls: 0.081
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Th2 Vasculature (4H) system models vascular inflammation of the Th2 type, an environment
that promotes mast cell, basophil, eosinophil, T and B cell recruitment and is proangiogenic. This system is
relevant for diseases where Th2-type inflammatory conditions play a role such as allergy, asthma, and ulcerative
colitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
537

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 142.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 174

BSK_4H_MCP1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 4H Assay for Monocyte Chemoattractant Protein-1 (MCP1) Biomarker
Activity

1.2	Assay Summary: BSK_4H is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_4H_MCP1 is an assay component measured in the BSK_4H assay. It measures MCP-1 antibody related to
regulation of gene expression using ELISA technology. Data from the assay component BSK_4H_MCP1 was
analyzed into 1 assay endpoint. This assay endpoint, BSK_4H_MCP1, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene CCL2.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cytokine intended target family, where the subfamily
is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: MCP-1 antibody is used to tag and quantify the level of chemokine (C-C motif) ligand 2 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from histamine and IL4 [GeneSymbol:CCL2 | GenelD:6347 |
Uniprot_SwissProt_Accession:P13500].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.054
Response cutoff threshold used to determine hit calls: 0.161
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Th2 Vasculature (4H) system models vascular inflammation of the Th2 type, an environment
that promotes mast cell, basophil, eosinophil, T and B cell recruitment and is proangiogenic. This system is
relevant for diseases where Th2-type inflammatory conditions play a role such as allergy, asthma, and ulcerative
colitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and

-------
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 152.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 176

BSK_4H_Pselectin

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 4H Assay for Pselectin Biomarker Activity

1.2	Assay Summary: BSK_4H is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_4H_Pselectin is an assay component measured in the BSK_4H assay. It measures P-Selectin antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_4H_Pselectin was
analyzed into 1 assay endpoint. This assay endpoint, BSK_4H_Pselectin, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene SELP.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell adhesion molecules intended target family, where
the subfamily is selectins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: P-Selectin antibody is used to tag and quantify the level of selectin P (granule membrane protein
140kDa, antigen CD62) protein. Changes in the signals are indicative of protein expression changes when
conditioned to simulate proinflammation from histamine and IL4 [GeneSymbokSELP | GenelD:6403 |
Uniprot_SwissProt_Accession:P16109],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.041
Response cutoff threshold used to determine hit calls: 0.123
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Th2 Vasculature (4H) system models vascular inflammation of the Th2 type, an environment
that promotes mast cell, basophil, eosinophil, T and B cell recruitment and is proangiogenic. This system is
relevant for diseases where Th2-type inflammatory conditions play a role such as allergy, asthma, and ulcerative
colitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
490

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 181.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 178

BSK_4H_SRB

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: 4H Assay

1.2	Assay Summary: BSK_4H is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_4H_SRB is an assay component measured in the BSK_4H assay. It measures 0.1% sulforhodamine related
to cell death using Sulforhodamine staining technology. Data from the assay component BSK_4H_SRB was
analyzed into 1 assay endpoint. This assay endpoint, BSK_4H_SRB, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell morphology intended target family, where the subfamily is cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: SRB in the 4H system is a measure of the total protein content of venular endothelial cells. Cell
viability of adherent cells is measured by Sulforhodamine B (SRB) staining, a method that determines cell density
by measuring total protein content of test wells.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.02
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Th2 Vasculature (4H) system models vascular inflammation of the Th2 type, an environment
that promotes mast cell, basophil, eosinophil, T and B cell recruitment and is proangiogenic. This system is
relevant for diseases where Th2-type inflammatory conditions play a role such as allergy, asthma, and ulcerative
colitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 150.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 180

BSK_4H_uPAR

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 4H Assay for Urokinase-type plasminogen activator receptor (uPAR)
Biomarker Activity

1.2	Assay Summary: BSK_4H is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_4HAR is an assay component measured in the BSK_4H assay. It measures uPAR antibody related to
regulation of gene expression using ELISA technology. Data from the assay component BSK_4HAR was analyzed
into 1 assay endpoint. This assay endpoint, BSK_4HAR, was analyzed with bidirectional fitting relative to DMSO
as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity
can be used to understand changes in the signaling as they relate to the gene PLAUR. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a signaling function. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the cytokine intended target family, where the subfamily is plasmogen activator.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: uPAR antibody is used to tag and quantify the level of plasminogen activator, urokinase receptor
protein. Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from histamine and IL4 [GeneSymbokPLAUR | GenelD:5329 |
Uniprot_SwissProt_Accession:Q03405],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.037
Response cutoff threshold used to determine hit calls: 0.111
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Th2 Vasculature (4H) system models vascular inflammation of the Th2 type, an environment
that promotes mast cell, basophil, eosinophil, T and B cell recruitment and is proangiogenic. This system is
relevant for diseases where Th2-type inflammatory conditions play a role such as allergy, asthma, and ulcerative
colitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
415

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 122.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 182

BSK_4H_VCAM 1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 4H Assay for Vascular cell adhesion molecule 1 (VCAM1) Biomarker Activity

1.2	Assay Summary: BSK_4H is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_4H_VCAM1 is an assay component measured in the BSK_4H assay. It measures VCAM-1 antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_4H_VCAM1 was
analyzed into 1 assay endpoint. This assay endpoint, BSK_4H_VCAM1, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene VCAM1.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell adhesion molecules intended target family, where
the subfamily is Immunoglobulin CAM.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: VCAM-1 antibody is used to tag and quantify the level of vascular cell adhesion molecule 1 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from histamine and IL4 [GeneSymbol:VCAMl | GenelD:7412 |
Uniprot_SwissProt_Accession:P19320].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.023
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Th2 Vasculature (4H) system models vascular inflammation of the Th2 type, an environment
that promotes mast cell, basophil, eosinophil, T and B cell recruitment and is proangiogenic. This system is
relevant for diseases where Th2-type inflammatory conditions play a role such as allergy, asthma, and ulcerative
colitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
508

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 140.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 184

BSK_4H_VEGFRII

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: 4H Assay for Vascular endothelial growth factor III (VEGFRII) Biomarker
Activity

1.2	Assay Summary: BSK_4H is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium, a
human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_4H_VEGFRII is an assay component measured in the BSK_4H assay. It measures VEGFRII antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_4H_VEGFRII was
analyzed into 1 assay endpoint. This assay endpoint, BSK_4H_VEGFRII, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene KDR.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the kinase intended target family, where the subfamily is
receptor tyrosine kinase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: VEGFRII antibody is used to tag and quantify the level of kinase insert domain receptor (a type III
receptor tyrosine kinase) protein. Changes in the signals are indicative of protein expression changes when
conditioned to simulate proinflammation from histamine and IL4 [GeneSymbokKDR | GenelD:3791 |
Uniprot_SwissProt_Accession:P35968],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium primary cell used. Primary human cell types used
in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.035
Response cutoff threshold used to determine hit calls: 0.105
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Th2 Vasculature (4H) system models vascular inflammation of the Th2 type, an environment
that promotes mast cell, basophil, eosinophil, T and B cell recruitment and is proangiogenic. This system is
relevant for diseases where Th2-type inflammatory conditions play a role such as allergy, asthma, and ulcerative
colitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of kinase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and

-------
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 84.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 186

BSK_BE3C_HLADR

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Human Leukocyte Antigen - DR isotype (HLADR) Biomarker
Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_BE3C_HLADR is an assay component measured in the BSK_BE3C assay. It measures HLA-DR antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_BE3C_HLADR was analyzed into 1 assay endpoint. This assay endpoint, BSK_BE3C_HLADR, was analyzed
with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of
binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they relate
to the gene HLA-DRA. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cell adhesion molecules
intended target family, where the subfamily is MHC Class II.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: HLA-DR antibody is used to tag and quantify the level of major histocompatibility complex, class II,
DR alpha protein. Changes in the signals are indicative of protein expression changes when conditioned to
simulate proinflammation from cytokines [GeneSymbol:HLA-DRA | GenelD:3122 |
Uniprot_SwissProt_Accession:P01903].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.025
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
494

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 153.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 188

BSK_BE3C_ILla

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for lnterleukin-1 alpha (ILla) Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. BSK_BE3C_ILla
is an assay component measured in the BSK_BE3C assay. It measures IL-la antibody related to regulation of
gene expression using ELISA technology. Data from the assay component BSK_BE3C_ILla was analyzed into 1
assay endpoint. This assay endpoint, BSK_BE3C_ILla, was analyzed with bidirectional fitting relative to DMSO
as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity
can be used to understand changes in the signaling as they relate to the gene ILIA. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a signaling function. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the cytokine intended target family, where the subfamily is interleukins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: IL-la antibody is used to tag and quantify the level of interleukin 1, alpha protein. Changes in the
signals are indicative of protein expression changes when conditioned to simulate proinflammation from
cytokines [GeneSymboklLlA | GenelD:3552 | Uniprot_SwissProt_Accession:P01583].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.027
Response cutoff threshold used to determine hit calls: 0.08
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 112.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 190

BSK_BE3C_IP10

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Interferon-gamma inducible Protein 10 (IP10) Biomarker
Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. BSK_BE3C_IP10
is an assay component measured in the BSK_BE3C assay. It measures IP-10 antibody related to regulation of
gene expression using ELISA technology. Data from the assay component BSK_BE3C_IP10 was analyzed into 1
assay endpoint. This assay endpoint, BSK_BE3C_IP10, was analyzed with bidirectional fitting relative to DMSO
as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity
can be used to understand changes in the signaling as they relate to the gene CXCL10. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a signaling function. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the cytokine intended target family, where the subfamily is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: IP-10 antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 10 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbol:CXCL10 | GenelD:3627 | Uniprot_SwissProt_Accession:P02778],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.028
Response cutoff threshold used to determine hit calls: 0.084
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 170.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 192

BSK_BE3C_MIG

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Monokine induced gamma interferon (MIG) Biomarker
Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. BSK_BE3C_MIG
is an assay component measured in the BSK_BE3C assay. It measures MIG antibody related to regulation of gene
expression using ELISA technology. Data from the assay component BSK_BE3C_MIG was analyzed into 1 assay
endpoint. This assay endpoint, BSK_BE3C_MIG, was analyzed with bidirectional fitting relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand changes in the signaling as they relate to the gene CXCL9. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cytokine intended target family, where the subfamily is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: MIG antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 9 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbol:CXCL9 | GenelD:4283 | Uniprot_SwissProt_Accession:Q07325],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.008
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 94.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 194

BSK_BE3C_MMP1

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Matrix Metallopeptidase 1 (MMP1) Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_BE3C_MMP1 is an assay component measured in the BSK_BE3C assay. It measures MMP-1 antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_BE3C_MMP1 was
analyzed into 1 assay endpoint. This assay endpoint, BSK_BE3C_MMP1, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene MMP1.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the protease intended target family, where the subfamily
is matrix metalloproteinase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

-------
2.1	Purpose: MMP-1 antibody is used to tag and quantify the level of matrix metallopeptidase 1 (interstitial
collagenase) protein. Changes in the signals are indicative of protein expression changes when conditioned to
simulate proinflammation from cytokines [GeneSymbol:MMPl | GenelD:4312 |
Uniprot_SwissProt_Accession:P03956],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.031
Response cutoff threshold used to determine hit calls: 0.092
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 165.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 196

BSK_BE3C_PAI1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Plasminogen activator inhibitor-1 (PAI1) Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. BSK_BE3C_PAI1
is an assay component measured in the BSK_BE3C assay. It measures PAI-1 antibody related to regulation of
gene expression using ELISA technology. Data from the assay component BSK_BE3C_PAI1 was analyzed into 1
assay endpoint. This assay endpoint, BSK_BE3C_PAI1, was analyzed with bidirectional fitting relative to DMSO
as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity
can be used to understand changes in the signaling as they relate to the gene SERPINE1. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
where this one serves a signaling function. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the cytokine intended target family, where the subfamily is plasmogen activator
inhibitor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: PAI-1 antibody is used to tag and quantify the level of serpin peptidase inhibitor, clade E (nexin,
plasminogen activator inhibitor type 1), member 1 protein. Changes in the signals are indicative of protein
expression changes when conditioned to simulate proinflammation from cytokines [GeneSymbol:SERPINEl |
GenelD:5054 | Uniprot_SwissProt_Accession:P05121],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.037
Response cutoff threshold used to determine hit calls: 0.112
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 146.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 198

BSK_BE3C_SRB

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: BE3C Assay

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. BSK_BE3C_SRB
is an assay component measured in the BSK_BE3C assay. It measures 0.1% sulforhodamine related to cell death
using Sulforhodamine staining technology. Data from the assay component BSK_BE3C_SRB was analyzed into 1
assay endpoint. This assay endpoint, BSK_BE3C_SRB, was analyzed with bidirectional fitting relative to DMSO as
the negative control and baseline of activity. Using a type of viability reporter, gain or loss-of-signal activity can
be used to understand changes in the signaling. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell
morphology intended target family, where the subfamily is cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: SRB in the BE3C system is a measure of the total protein content of bronchial epithelial cells. Cell
viability of adherent cells is measured by Sulforhodamine B (SRB) staining, a method that determines cell density
by measuring total protein content of test wells.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.006
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 116.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 200

BSK_BE3C_TGFbl

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Transforming growth factor beta 1 (TGFbl) Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_BE3C_TGFbl is an assay component measured in the BSK_BE3C assay. It measures TGF-bl antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_BE3C_TGFbl was
analyzed into 1 assay endpoint. This assay endpoint, BSK_BE3C_TGFbl, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene TGFB1.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the growth factor intended target family, where the
subfamily is transforming growth factor beta.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

-------
2.1	Purpose: TGF-bl antibody is used to tag and quantify the level of transforming growth factor, beta 1 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbokTGFBl | GenelD:7040 | Uniprot_SwissProt_Accession:P01137].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

59.4 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.035
Response cutoff threshold used to determine hit calls: 0.106
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of growth factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1540	Number of chemicals tested: 1484

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 72.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 202

BSK_BE3C_tPA

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Tissue plasminogen activator (tPA) Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. BSK_BE3C_tPA
is an assay component measured in the BSK_BE3C assay. It measures tPA antibody related to regulation of gene
expression using ELISA technology. Data from the assay component BSK_BE3C_tPA was analyzed into 1 assay
endpoint. This assay endpoint, BSK_BE3C_tPA, was analyzed with bidirectional fitting relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand changes in the signaling as they relate to the gene PLAT. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the protease intended target family, where the subfamily is serine protease.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: tPA antibody is used to tag and quantify the level of plasminogen activator, tissue protein. Changes
in the signals are indicative of protein expression changes when conditioned to simulate proinflammation from
cytokines [GeneSymbokPLAT | GenelD:5327 | Uniprot_SwissProt_Accession:P00750],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.045
Response cutoff threshold used to determine hit calls: 0.134
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 155.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 204

BSK_BE3C_uPA

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for urokinase plasminogen activator (uPA) Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. BSK_BE3CA is
an assay component measured in the BSK_BE3C assay. It measures uPA antibody related to regulation of gene
expression using ELISA technology. Data from the assay component BSK_BE3CA was analyzed into 1 assay
endpoint. This assay endpoint, BSK_BE3CA, was analyzed with bidirectional fitting relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand changes in the signaling as they relate to the gene PLAU. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the protease intended target family, where the subfamily is serine protease.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: uPA antibody is used to tag and quantify the level of plasminogen activator, urokinase protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbokPLAU | GenelD:5328 | Uniprot_SwissProt_Accession:P00749],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.027
Response cutoff threshold used to determine hit calls: 0.082
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 119.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 206

BSK_BE3C_uPAR

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Urokinase-type plasminogen activator receptor (uPAR)
Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. BSK_BE3CAR
is an assay component measured in the BSK_BE3C assay. It measures uPAR antibody related to regulation of
gene expression using ELISA technology. Data from the assay component BSK_BE3CAR was analyzed into 1 assay
endpoint. This assay endpoint, BSK_BE3CAR, was analyzed with bidirectional fitting relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand changes in the signaling as they relate to the gene PLAUR. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cytokine intended target family, where the subfamily is plasmogen activator.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: uPAR antibody is used to tag and quantify the level of plasminogen activator, urokinase receptor
protein. Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbokPLAUR | GenelD:5329 | Uniprot_SwissProt_Accession:Q03405],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.038
Response cutoff threshold used to determine hit calls: 0.114
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 142.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 208

BSK_CASM3C_HLADR

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for Human Leukocyte Antigen - DR isotype (HLADR)
Biomarker Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_HLADR is an assay component measured in
the BSK_CASM3C assay. It measures HLA-DR antibody related to regulation of gene expression using ELISA
technology. Data from the assay component BSK_CASM3C_HLADR was analyzed into 1 assay endpoint. This
assay endpoint, BSK_CASM3C_HLADR, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to
understand changes in the signaling as they relate to the gene HLA-DRA. Furthermore, this assay endpoint can
be referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell adhesion molecules intended target family, where the subfamily is MHC Class II.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: HLA-DR antibody is used to tag and quantify the level of major histocompatibility complex, class II,
DR alpha protein. Changes in the signals are indicative of protein expression changes when conditioned to
simulate proinflammation from cytokines [GeneSymbol:HLA-DRA | GenelD:3122 |
Uniprot_SwissProt_Accession:P01903].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.035
Response cutoff threshold used to determine hit calls: 0.105
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
351

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 126.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 210

BSK_CASM3C_I L6

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for lnterleukin-6 (IL6) Biomarker Activity for

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_IL6 is an assay component measured in the
BSK_CASM3C assay. It measures IL-6 antibody related to regulation of gene expression using ELISA technology.
Data from the assay component BSK_CASM3C_IL6 was analyzed into 1 assay endpoint. This assay endpoint,
BSK_CASM3C_IL6, was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline
of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in
the signaling as they relate to the gene IL6. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cytokine
intended target family, where the subfamily is interleukins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: IL-6 antibody is used to tag and quantify the level of interleukin 6 protein. Changes in the signals
are indicative of protein expression changes when conditioned to simulate proinflammation from cytokines
[GeneSymbol:IL6 | GenelD:3569 | Uniprot_SwissProt_Accession:P05231].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.04
Response cutoff threshold used to determine hit calls: 0.121
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 79.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 212

BSK_CASM3C_I L8

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for lnterleukin-8 (IL8) Biomarker Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_IL8 is an assay component measured in the
BSK_CASM3C assay. It measures IL-8 antibody related to regulation of gene expression using ELISA technology.
Data from the assay component BSK_CASM3C_IL8 was analyzed into 1 assay endpoint. This assay endpoint,
BSK_CASM3C_IL8, was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline
of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in
the signaling as they relate to the gene CXCL8. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cytokine
intended target family, where the subfamily is interleukins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: IL-8 antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 8 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbol:CXCL8 | GenelD:3576 | Uniprot_SwissProt_Accession:P10145].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.017
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 109.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 214

BSK_CASM3C_LDLR

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for Low-density lipoprotein receptor (LDLR) Biomarker
Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_LDLR is an assay component measured in the
BSK_CASM3C assay. It measures LDLR antibody related to regulation of gene expression using ELISA technology.
Data from the assay component BSK_CASM3C_LDLR was analyzed into 1 assay endpoint. This assay endpoint,
BSK_CASM3C_LDLR, was analyzed with bidirectional fitting relative to DMSO as the negative control and
baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand
changes in the signaling as they relate to the gene LDLR. Furthermore, this assay endpoint can be referred to
as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
signaling function. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the misc protein intended target family, where the subfamily is LDL receptor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: LDLR antibody is used to tag and quantify the level of low density lipoprotein receptor protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbokLDLR | GenelD:3949 | Uniprot_SwissProt_Accession:P01130].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.025
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of misc protein.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 106.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 216

BSK_CASM3C_MCP1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for Monocyte Chemoattractant Protein-1 (MCP1) Biomarker
Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_MCP1 is an assay component measured in
the BSK_CASM3C assay. It measures MCP-1 antibody related to regulation of gene expression using ELISA
technology. Data from the assay component BSK_CASM3C_MCP1 was analyzed into 1 assay endpoint. This assay
endpoint, BSK_CASM3C_MCP1, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand
changes in the signaling as they relate to the gene CCL2. Furthermore, this assay endpoint can be referred to
as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
signaling function. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the cytokine intended target family, where the subfamily is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: MCP-1 antibody is used to tag and quantify the level of chemokine (C-C motif) ligand 2 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbol:CCL2 | GenelD:6347 | Uniprot_SwissProt_Accession:P13500].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.029
Response cutoff threshold used to determine hit calls: 0.087
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 91.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 218

BSK_CASM3C_MCSF

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for Macrophage colony-stimulating factor (MCSF) Biomarker
Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_MCSF is an assay component measured in
the BSK_CASM3C assay. It measures M-CSF antibody related to regulation of gene expression using ELISA
technology. Data from the assay component BSK_CASM3C_MCSF was analyzed into 1 assay endpoint. This assay
endpoint, BSK_CASM3C_MCSF, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand
changes in the signaling as they relate to the gene CSF1. Furthermore, this assay endpoint can be referred to as
a primary readout, because this assay has produced multiple assay endpoints where this one serves a signaling
function. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
cytokine intended target family, where the subfamily is colony stimulating factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: M-CSF antibody is used to tag and quantify the level of colony stimulating factor 1 (macrophage)
protein. Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbokCSFl | GenelD:1435 | Uniprot_SwissProt_Accession:P09603].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.032
Response cutoff threshold used to determine hit calls: 0.097
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 112.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 220

BSK_CASM3C_M IG

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for Monokine induced gamma interferon (MIG) Biomarker
Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_MIG is an assay component measured in the
BSK_CASM3C assay. It measures MIG antibody related to regulation of gene expression using ELISA technology.
Data from the assay component BSK_CASM3C_MIG was analyzed into 1 assay endpoint. This assay endpoint,
BSK_CASM3C_MIG, was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline
of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in
the signaling as they relate to the gene CXCL9. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cytokine
intended target family, where the subfamily is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: MIG antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 9 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbol:CXCL9 | GenelD:4283 | Uniprot_SwissProt_Accession:Q07325],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.016
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 108.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 222

BSK_CASM3C_Proliferation

1. General Information

1.1	Assay Title: Cell Proliferation Assessment in the BioMAP Diversity Plus: CASM3C Assay

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_Proliferation is an assay component
measured in the BSK_CASM3C assay. It measures 0.1% sulforhodamine related to cell proliferation using
Sulforhodamine staining technology. Data from the assay component BSK_CASM3C_Proliferation was analyzed
into 1 assay endpoint. This assay endpoint, BSK_CASM3C_Proliferation, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter, measures of
total protein for gain or loss-of-signal activity can be used to understand the viability at the cellular-level.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a viability function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is proliferation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: Proliferation in the CASM3C system is a measure of coronary artery smooth muscle cell
proliferation which is important to the process of vasacular biology and restenosis.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.032
Response cutoff threshold used to determine hit calls: 0.097
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 152.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 224

BSK_CASM3C_SAA

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for serum amyloid A (SAA) Biomarker Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_SAA is an assay component measured in the
BSK_CASM3C assay. It measures SAA antibody related to regulation of gene expression using ELISA technology.
Data from the assay component BSK_CASM3C_SAA was analyzed into 1 assay endpoint. This assay endpoint,
BSK_CASM3C_SAA, was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline
of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in
the signaling as they relate to the gene SAA1. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell adhesion
molecules intended target family, where the subfamily is apolipoproteins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Serum Amyloid A (SAA) is a member of the apolipoprotein family that is an acute phase reactant.
SAA is categorized as an inflammation-related activity in the CASM3C system modeling Thl vascular smooth
muscle inflammation. Changes in the signals are indicative of protein expression changes when conditioned to
simulate proinflammation from cytokines [GeneSymbokSAAl | GenelD:6288 |
Uniprot_SwissProt_Accession:P0DJI8].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.034
Response cutoff threshold used to determine hit calls: 0.101
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
249

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 95.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 226

BSK_CASM3C_SRB

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: CASM3C Assay

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_SRB is an assay component measured in the
BSK_CASM3C assay. It measures 0.1% sulforhodamine related to cell death using Sulforhodamine staining
technology. Data from the assay component BSK_CASM3C_SRB was analyzed into 1 assay endpoint. This assay
endpoint, BSK_CASM3C_SRB, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of viability reporter, gain or loss-of-signal activity can be used to
understand changes in the signaling. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves a signaling function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell morphology
intended target family, where the subfamily is cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: SRB in the CASM3C system is a measure of the total protein content of coronary artery smooth
muscle cells. Cell viability of adherent cells is measured by Sulforhodamine B (SRB) staining, a method that
determines cell density by measuring total protein content of test wells.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.015
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 110.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 228

BSK_CASM3C_Th rom bomod ulin

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for Thrombomodulin Biomarker Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_Thrombomodulin is an assay component
measured in the BSK_CASM3C assay. It measures Thrombomodulin antibody related to regulation of gene
expression using ELISA technology. Data from the assay component BSK_CASM3C_Thrombomodulin was
analyzed into 1 assay endpoint. This assay endpoint, BSK_CASM3C_Thrombomodulin, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of binding
reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they relate to the
gene THBD. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a signaling function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the gpcr intended target family, where the
subfamily is rhodopsin-like receptor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: Thrombomodulin antibody is used to tag and quantify the level of thrombomodulin protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbokTHBD | GenelD:7056 | Uniprot_SwissProt_Accession:P07204].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.033
Response cutoff threshold used to determine hit calls: 0.1
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 103.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 230

BSK_CASM3C_TissueFactor

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for Tissue Factor Biomarker Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_TissueFactor is an assay component
measured in the BSK_CASM3C assay. It measures Tissue Factor antibody related to regulation of gene expression
using ELISA technology. Data from the assay component BSK_CASM3C_TissueFactor was analyzed into 1 assay
endpoint. This assay endpoint, BSK_CASM3C_TissueFactor, was analyzed with bidirectional fitting relative to
DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal
activity can be used to understand changes in the signaling as they relate to the gene F3. Furthermore, this
assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a signaling function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the cytokine intended target family, where the subfamily is
coagulation factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: Tissue Factor antibody is used to tag and quantify the level of coagulation factor III
(thromboplastin, tissue factor) protein. Changes in the signals are indicative of protein expression changes when
conditioned to simulate proinflammation from cytokines [GeneSymbol:F3 | GenelD:2152 |
Uniprot_SwissProt_Accession:P13726],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.038
Response cutoff threshold used to determine hit calls: 0.114
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
233

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 104.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 232

BSK_CASM3C_u PAR

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for Urokinase-type plasminogen activator receptor (uPAR)
Biomarker Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3CAR is an assay component measured in the
BSK_CASM3C assay. It measures uPAR antibody related to regulation of gene expression using ELISA technology.
Data from the assay component BSK_CASM3CAR was analyzed into 1 assay endpoint. This assay endpoint,
BSK_CASM3CAR, was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline
of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in
the signaling as they relate to the gene PLAUR. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cytokine
intended target family, where the subfamily is plasmogen activator.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: uPAR antibody is used to tag and quantify the level of plasminogen activator, urokinase receptor
protein. Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbokPLAUR | GenelD:5329 | Uniprot_SwissProt_Accession:Q03405],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.036
Response cutoff threshold used to determine hit calls: 0.107
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 120.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 234

BSK_CASM3C_VCAM 1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for Vascular cell adhesion molecule 1 (VCAM1) Biomarker
Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_VCAM1 is an assay component measured in
the BSK_CASM3C assay. It measures VCAM-1 antibody related to regulation of gene expression using ELISA
technology. Data from the assay component BSK_CASM3C_VCAM1 was analyzed into 1 assay endpoint. This
assay endpoint, BSK_CASM3C_VCAM1, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to
understand changes in the signaling as they relate to the gene VCAM1. Furthermore, this assay endpoint can
be referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell adhesion molecules intended target family, where the subfamily is Immunoglobulin CAM.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: VCAM-1 antibody is used to tag and quantify the level of vascular cell adhesion molecule 1 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines [GeneSymbokVCAMl | GenelD:7412 | Uniprot_SwissProt_Accession:P19320].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.029
Response cutoff threshold used to determine hit calls: 0.086
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 127.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 236

BSK_h DFCG F_Col lagen 111

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Collagen III Biomarker Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_Collagenlll is an assay component measured in the BSK_hDFCGF assay. It measures Collagen
antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_Collagenlll was analyzed into 1 assay endpoint. This assay endpoint, BSK_hDFCGF_Collagenlll, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they
relate to the gene COL3A1. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cell adhesion molecules
intended target family, where the subfamily is collagen.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Collagen antibody is used to tag and quantify the level of collagen, type III, alpha 1 protein. Changes
in the signals are indicative of protein expression changes when conditioned to simulate proinflammation from
cytokines and growth factors [GeneSymbol:COL3Al | GenelD:1281 | Uniprot_SwissProt_Accession:P02461].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.047
Response cutoff threshold used to determine hit calls: 0.142
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 185.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
BSK_h DFCG F_EG FR

Assay Endpoint ID: 238

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Epidermal growth factor receptor (EGFR) Biomarker
Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_EGFR is an assay component measured in the BSK_hDFCGF assay. It measures EGFR antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_EGFR was analyzed into 1 assay endpoint. This assay endpoint, BSK_hDFCGF_EGFR, was analyzed
with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of
binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they relate
to the gene EGFR. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a signaling function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the kinase intended target family, where
the subfamily is receptor tyrosine kinase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: EGFR antibody is used to tag and quantify the level of epidermal growth factor receptor protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and growth factors [GeneSymbokEGFR | GenelD:1956 |
Uniprot_SwissProt_Accession:P00533].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.041
Response cutoff threshold used to determine hit calls: 0.123
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of kinase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
164

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 118.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 240

BSK_hDFCGF_IL8

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for lnterleukin-8 (IL8) Biomarker Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_IL8 is an assay component measured in the BSK_hDFCGF assay. It measures IL-8 antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_hDFCGF_IL8 was
analyzed into 1 assay endpoint. This assay endpoint, BSK_hDFCGF_IL8, was analyzed with bidirectional fitting
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand changes in the signaling as they relate to the gene CXCL8.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cytokine intended target family, where the subfamily
is interleukins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: IL-8 antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 8 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and growth factors [GeneSymbol:CXCL8 | GenelD:3576 |
Uniprot_SwissProt_Accession:P10145].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.008
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 162.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 242

BSK_hDFCGF_IP10

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Interferon-gamma inducible Protein 10 (IP10) Biomarker
Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_IP10 is an assay component measured in the BSK_hDFCGF assay. It measures IP-10 antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_IP10 was analyzed into 1 assay endpoint. This assay endpoint, BSK_hDFCGF_IP10, was analyzed
with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of
binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they relate
to the gene CXCL10. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: IP-10 antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 10 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and growth factors [GeneSymbol:CXCL10 | GenelD:3627 |
Uniprot_SwissProt_Accession:P02778],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.027
Response cutoff threshold used to determine hit calls: 0.081
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
550

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 184.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 244

BSKJl DFCG F_MCSF

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Macrophage colony-stimulating factor (MCSF) Biomarker
Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_MCSF is an assay component measured in the BSK_hDFCGF assay. It measures M-CSF antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_MCSF was analyzed into 1 assay endpoint. This assay endpoint, BSK_hDFCGF_MCSF, was analyzed
with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of
binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they relate
to the gene CSF1. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a signaling function. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the cytokine intended target family, where
the subfamily is colony stimulating factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: M-CSF antibody is used to tag and quantify the level of colony stimulating factor 1 (macrophage)
protein. Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and growth factors [GeneSymbokCSFl | GenelD:1435 |
Uniprot_SwissProt_Accession:P09603].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.036
Response cutoff threshold used to determine hit calls: 0.107
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
534

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 181.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 246

BSK_hDFCGF_MIG

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Monokine induced gamma interferon (MIG) Biomarker
Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_MIG is an assay component measured in the BSK_hDFCGF assay. It measures MIG antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_hDFCGF_MIG
was analyzed into 1 assay endpoint. This assay endpoint, BSK_hDFCGF_MIG, was analyzed with bidirectional
fitting relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand changes in the signaling as they relate to the gene CXCL9.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cytokine intended target family, where the subfamily
is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: MIG antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 9 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and growth factors [GeneSymbol:CXCL9 | GenelD:4283 |
Uniprot_SwissProt_Accession:Q07325],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.016
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
422

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 193.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 248

BSK_hDFCGF_MMPl

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Matrix Metallopeptidase 1 (MMP1) Biomarker Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_MMPl is an assay component measured in the BSK_hDFCGF assay. It measures MMP-1 antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_MMPl was analyzed into 1 assay endpoint. This assay endpoint, BSK_hDFCGF_MMPl, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they
relate to the gene MMP1. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the protease intended target
family, where the subfamily is matrix metalloproteinase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: MMP-1 antibody is used to tag and quantify the level of matrix metallopeptidase 1 (interstitial
collagenase) protein. Changes in the signals are indicative of protein expression changes when conditioned to
simulate proinflammation from cytokines and growth factors [GeneSymbokMMPl | GenelD:4312 |
Uniprot_SwissProt_Accession:P03956],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.026
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 193.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 250

BSK_hDFCGF_PAIl

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Plasminogen activator inhibitor-1 (PAI1) Biomarker Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_PAIl is an assay component measured in the BSK_hDFCGF assay. It measures PAI-1 antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_PAIl was analyzed into 1 assay endpoint. This assay endpoint, BSK_hDFCGF_PAIl, was analyzed
with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of
binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they relate
to the gene SERPINE1. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is plasmogen activator inhibitor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: PAI-1 antibody is used to tag and quantify the level of serpin peptidase inhibitor, clade E (nexin,
plasminogen activator inhibitor type 1), member 1 protein. Changes in the signals are indicative of protein
expression changes when conditioned to simulate proinflammation from cytokines and growth factors
[GeneSymbokSERPINEl | GenelD:5054 | Uniprot_SwissProt_Accession:P05121],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.055
Response cutoff threshold used to determine hit calls: 0.166
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 184.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 252

BSK_h DFCG F_Proliferation

1. General Information

1.1	Assay Title: Cell Proliferation Assessment in the BioMAP Diversity Plus: hDFCGF Assay

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_Proliferation is an assay component measured in the BSK_hDFCGF assay. It measures 0.1%
sulforhodamine related to cell proliferation using Sulforhodamine staining technology. Data from the assay
component BSK_hDFCGF_Proliferation was analyzed into 1 assay endpoint. This assay endpoint,
BSK_hDFCGF_Proliferation, was analyzed with bidirectional fitting relative to DMSO as the negative control and
baseline of activity. Using a type of viability reporter, measures of total protein for gain or loss-of-signal activity
can be used to understand the viability at the cellular-level. Furthermore, this assay endpoint can be referred
to as a secondary readout, because this assay has produced multiple assay endpoints where this one serves a
viability function. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the cell cycle intended target family, where the subfamily is proliferation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: 0.1% sulforhodamine is used to tag and quantify the total protein levels in the system. Changes to
these absorbance signals can be indicative of the viability in the system when conditioned to simulate
proinflammation from cytokines and growth factors.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.045
Response cutoff threshold used to determine hit calls: 0.136
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 260.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 254

BSK_hDFCGF_SRB

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: hDFCGF Assay

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_SRB is an assay component measured in the BSK_hDFCGF assay. It measures 0.1% sulforhodamine
related to cell death using Sulforhodamine staining technology. Data from the assay component
BSK_hDFCGF_SRB was analyzed into 1 assay endpoint. This assay endpoint, BSK_hDFCGF_SRB, was analyzed
with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of
viability reporter, gain or loss-of-signal activity can be used to understand changes in the signaling. Furthermore,
this assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a signaling function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the cell morphology intended target family, where the subfamily is
cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: 0.1% sulforhodamine is used to tag and quantify the total protein levels in the system. Changes to
these absorbance signals can be indicative of the viability in the system when conditioned to simulate
proinflammation from cytokines and growth factors.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.024
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 146.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 256

BSK_hDFCGF_TIMPl

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for TIMP metallopeptidase inhibitor 1 (TIMP1) Biomarker
Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_TIMPl is an assay component measured in the BSK_hDFCGF assay. It measures TIMP-1 antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_TIMPl was analyzed into 1 assay endpoint. This assay endpoint, BSK_hDFCGF_TIMPl, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they
relate to the gene TIMP1. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the protease inhibitor intended
target family, where the subfamily is metalloproteinase inhibitor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: TIMP-1 antibody is used to tag and quantify the level of TIMP metallopeptidase inhibitor 1 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and growth factors [GeneSymbol:TIMPl | GenelD:7076 |
Uniprot_SwissProt_Accession:P01033],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.045
Response cutoff threshold used to determine hit calls: 0.134
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease inhibitor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
356

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 167.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 258

BSK_h DFCG F_VCAM 1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Vascular cell adhesion molecule 1 (VCAM1) Biomarker
Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_VCAMl is an assay component measured in the BSK_hDFCGF assay. It measures VCAM-1 antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_VCAMl was analyzed into 1 assay endpoint. This assay endpoint, BSK_hDFCGF_VCAMl, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they
relate to the gene VCAM1. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cell adhesion molecules
intended target family, where the subfamily is Immunoglobulin CAM.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: VCAM-1 antibody is used to tag and quantify the level of vascular cell adhesion molecule 1 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and growth factors [GeneSymbol:VCAMl | GenelD:7412 |
Uniprot_SwissProt_Accession:P19320].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.047
Response cutoff threshold used to determine hit calls: 0.141
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
531

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 188.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 260

BSK_KF3CT_ICAM 1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: KF3CT Assay for Intercellular Adhesion Molecule 1 (ICAM1) Biomarker
Activity

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CT_ICAM1 is an assay component measured in the BSK_KF3CT assay. It measures
ICAM-1 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_KF3CT_ICAM1 was analyzed into 1 assay endpoint. This assay endpoint, BSK_KF3CT_ICAM1,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as
they relate to the gene ICAM1. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves a signaling function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell adhesion
molecules intended target family, where the subfamily is Immunoglobulin CAM.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: ICAM-1 antibody is used to tag and quantify the level of intercellular adhesion molecule 1 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and TGFb [GeneSymboUCAMl | GenelD:3383 |
Uniprot_SwissProt_Accession:P05362].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.021
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and

-------
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 144.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 262

BSK_KF3CT_ILla

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: KF3CT Assay for lnterleukin-1 alpha (ILla) Biomarker Activity

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CT_ILla is an assay component measured in the BSK_KF3CT assay. It measures IL-la
antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_KF3CT_ILla was analyzed into 1 assay endpoint. This assay endpoint, BSK_KF3CT_ILla, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of binding
reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they relate to the
gene ILIA. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a signaling function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the cytokine intended target family, where the
subfamily is interleukins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: IL-la antibody is used to tag and quantify the level of interleukin 1, alpha protein. Changes in the
signals are indicative of protein expression changes when conditioned to simulate proinflammation from
cytokines and TGFb [GeneSymboULlA | GenelD:3552 | Uniprot_SwissProt_Accession:P01583].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.029
Response cutoff threshold used to determine hit calls: 0.086
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 140.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 264

BSK_KF3CT_IP10

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: KF3CT Assay for Interferon-gamma inducible Protein 10 (IP10) Biomarker
Activity

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CT_IP10 is an assay component measured in the BSK_KF3CT assay. It measures IP-10
antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_KF3CT_IP10 was analyzed into 1 assay endpoint. This assay endpoint, BSK_KF3CT_IP10, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of binding
reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they relate to the
gene CXCL10. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a signaling function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the cytokine intended target family, where the
subfamily is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: IP-10 antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 10 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and TGFb [GeneSymbol:CXCL10 | GenelD:3627 |
Uniprot_SwissProt_Accession:P02778],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.014
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and

-------
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 171.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 266

BSK_KF3CT_MCP1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: KF3CT Assay for Matrix Metallopeptidase 1 (MMP1) Biomarker Activity

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CT_MCP1 is an assay component measured in the BSK_KF3CT assay. It measures
MCP-1 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_KF3CT_MCP1 was analyzed into 1 assay endpoint. This assay endpoint, BSK_KF3CT_MCP1, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they
relate to the gene CCL2. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: MCP-1 antibody is used to tag and quantify the level of chemokine (C-C motif) ligand 2 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and TGFb [GeneSymbol:CCL2 | GenelD:6347 |
Uniprot_SwissProt_Accession:P13500].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.038
Response cutoff threshold used to determine hit calls: 0.115
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
341

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 165.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 268

BSK_KF3CT_M M P9

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: KF3CT Assay for Matrix Metallopeptidase 9 (MMP9) Biomarker Activity

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CT_MMP9 is an assay component measured in the BSK_KF3CT assay. It measures
MMP-9 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_KF3CT_MMP9 was analyzed into 1 assay endpoint. This assay endpoint, BSK_KF3CT_MMP9,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as
they relate to the gene MMP9. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves a signaling function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the protease
intended target family, where the subfamily is matrix metalloproteinase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: MMP-9 antibody is used to tag and quantify the level of matrix metallopeptidase 9 (gelatinase B,
92kDa gelatinase, 92kDa type IV collagenase) protein. Changes in the signals are indicative of protein expression
changes when conditioned to simulate proinflammation from cytokines and TGFb [GeneSymbol:MMP9 |
GenelD:4318 | Uniprot_SwissProt_Accession:P14780],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.028
Response cutoff threshold used to determine hit calls: 0.084
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
490

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 152.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 270

BSK_KF3CT_SRB

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: KF3CT Assay

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CT_SRB is an assay component measured in the BSK_KF3CT assay. It measures 0.1%
sulforhodamine related to cell death using Sulforhodamine staining technology. Data from the assay component
BSK_KF3CT_SRB was analyzed into 1 assay endpoint. This assay endpoint, BSK_KF3CT_SRB, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of viability
reporter, gain or loss-of-signal activity can be used to understand changes in the signaling. Furthermore, this
assay endpoint can be referred to as a primary readout, because this assay has produced multiple assay
endpoints where this one serves a signaling function. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the cell morphology intended target family, where the subfamily is
cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: 0.1% sulforhodamine is used to tag and quantify the total protein levels in the system. Changes to
these absorbance signals can be indicative of the viability in the system when conditioned to simulate
proinflammation from cytokines and TGFb.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.005
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 135.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 272

BSK_KF3CT_TGFbl

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: KF3CT Assay for Transforming growth factor beta 1 (TGFbl) Biomarker
Activity

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CT_TGFbl is an assay component measured in the BSK_KF3CT assay. It measures
TGF-bl antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_KF3CT_TGFbl was analyzed into 1 assay endpoint. This assay endpoint, BSK_KF3CT_TGFbl,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as
they relate to the gene TGFB1. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves a signaling function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the growth factor
intended target family, where the subfamily is transforming growth factor beta.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: TGF-bl antibody is used to tag and quantify the level of transforming growth factor, beta 1 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and TGFb [GeneSymbol:TGFBl | GenelD:7040 |
Uniprot_SwissProt_Accession:P01137].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.036
Response cutoff threshold used to determine hit calls: 0.107
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of growth factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

2

Standard maximum concentration tested:

59.4 nM
Neutral vehicle control:

DMSO

NA

-------
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and

-------
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1540	Number of chemicals tested: 1484

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 92.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 274

BSK_KF3CT_TI M P2

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: KF3CT Assay for TIMP metallopeptidase inhibitor 2 (TIMP2) Biomarker
Activity

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CT_TIMP2 is an assay component measured in the BSK_KF3CT assay. It measures
TIMP-2 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_KF3CT_TIMP2 was analyzed into 1 assay endpoint. This assay endpoint, BSK_KF3CT_TIMP2,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as
they relate to the gene TIMP2. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves a signaling function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the protease
inhibitor intended target family, where the subfamily is metalloproteinase inhibitor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: TIMP-2 antibody is used to tag and quantify the level of TIMP metallopeptidase inhibitor 2 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and TGFb [GeneSymbol:TIMP2 | GenelD:7077 |
Uniprot_SwissProt_Accession:Q96MC4],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.031
Response cutoff threshold used to determine hit calls: 0.092
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease inhibitor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and

-------
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 126.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 276

BSK_KF3CT_uPA

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: KF3CT Assay for urokinase plasminogen activator (uPA) Biomarker Activity

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CTA is an assay component measured in the BSK_KF3CT assay. It measures uPA
antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_KF3CTA was analyzed into 1 assay endpoint. This assay endpoint, BSK_KF3CTA, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of binding
reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they relate to the
gene PLAU. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints where this one serves a signaling function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the protease intended target family, where the
subfamily is serine protease.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: uPA antibody is used to tag and quantify the level of plasminogen activator, urokinase protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from cytokines and TGFb [GeneSymbokPLAU | GenelD:5328 |
Uniprot_SwissProt_Accession:P00749],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.02
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
336

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 142.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 278

BSK_LPS_CD40

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for CD40 Biomarker Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_CD40 is an assay component measured in the BSK_LPS
assay. It measures CD40 antibody related to regulation of gene expression using ELISA technology. Data from
the assay component BSK_LPS_CD40 was analyzed into 1 assay endpoint. This assay endpoint, BSK_LPS_CD40,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as
they relate to the gene CD40. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is inflammatory factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: CD40 antibody is used to tag and quantify the level of CD40 molecule, TNF receptor superfamily
member 5 protein. Changes in the signals are indicative of protein expression changes when conditioned to
simulate proinflammation from Toll-like receptor (TLR4) activator [GeneSymbol:CD40 | GenelD:958 |
Uniprot_SwissProt_Accession:P25942],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.046
Response cutoff threshold used to determine hit calls: 0.138
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 146.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 280

BSK_LPS_Eselectin

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for Eselectin Biomarker Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_Eselectin is an assay component measured in the
BSK_LPS assay. It measures E-selectin antibody related to regulation of gene expression using ELISA technology.
Data from the assay component BSK_LPS_Eselectin was analyzed into 1 assay endpoint. This assay endpoint,
BSK_LPS_Eselectin, was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline
of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in
the signaling as they relate to the gene SELE. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell adhesion
molecules intended target family, where the subfamily is selectins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: E-selectin antibody is used to tag and quantify the level of selectin E protein. Changes in the signals
are indicative of protein expression changes when conditioned to simulate proinflammation from Toll-like
receptor (TLR4) activator [GeneSymbol:SELE | GenelD:6401 | Uniprot_SwissProt_Accession:P16581],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.025
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 132.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 282

BSK_LPS_ILla

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for lnterleukin-1 alpha (ILla) Biomarker Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_ILla is an assay component measured in the BSK_LPS
assay. It measures IL-a antibody related to regulation of gene expression using ELISA technology. Data from the
assay component BSK_LPS_ILla was analyzed into 1 assay endpoint. This assay endpoint, BSK_LPS_ILla, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they
relate to the gene ILIA. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is interleukins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: IL-a antibody is used to tag and quantify the level of interleukin 1, alpha protein. Changes in the
signals are indicative of protein expression changes when conditioned to simulate proinflammation from Toll-
like receptor (TLR4) activator [GeneSymboklLlA | GenelD:3552 | Uniprot_SwissProt_Accession:P01583].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.041
Response cutoff threshold used to determine hit calls: 0.123
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 120.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 284

BSK_LPS_IL8

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for lnterleukin-8 (IL8) Biomarker Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_IL8 is an assay component measured in the BSK_LPS
assay. It measures IL-8 antibody related to regulation of gene expression using ELISA technology. Data from the
assay component BSK_LPS_IL8 was analyzed into 1 assay endpoint. This assay endpoint, BSK_LPS_IL8, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they
relate to the gene CXCL8. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is interleukins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: IL-8 antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 8 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from Toll-like receptor (TLR4) activator [GeneSymbol:CXCL8 | GenelD:3576 |
Uniprot_SwissProt_Accession:P10145].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.03
Response cutoff threshold used to determine hit calls: 0.089
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 134.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 286

BSK_LPS_MCP1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for Monocyte Chemoattractant Protein-1 (MCP1) Biomarker
Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_MCP1 is an assay component measured in the BSK_LPS
assay. It measures MCP-1 antibody related to regulation of gene expression using ELISA technology. Data from
the assay component BSK_LPS_MCP1 was analyzed into 1 assay endpoint. This assay endpoint, BSK_LPS_MCP1,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as
they relate to the gene CCL2. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: MCP-1 antibody is used to tag and quantify the level of chemokine (C-C motif) ligand 2 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from Toll-like receptor (TLR4) activator [GeneSymbol:CCL2 | GenelD:6347 |
Uniprot_SwissProt_Accession:P13500].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.03
Response cutoff threshold used to determine hit calls: 0.09
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
403

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 121.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 288

BSK_LPS_MCSF

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for Macrophage colony-stimulating factor (MCSF) Biomarker
Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_MCSF is an assay component measured in the BSK_LPS
assay. It measures M-CSF antibody related to regulation of gene expression using ELISA technology. Data from
the assay component BSK_LPS_MCSF was analyzed into 1 assay endpoint. This assay endpoint, BSK_LPS_MCSF,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as
they relate to the gene CSF1. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is colony stimulating factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: M-CSF antibody is used to tag and quantify the level of colony stimulating factor 1 (macrophage)
protein. Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from Toll-like receptor (TLR4) activator [GeneSymbol:CSFl | GenelD:1435 |
Uniprot_SwissProt_Accession:P09603].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.035
Response cutoff threshold used to determine hit calls: 0.104
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
463

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 139.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 290

BSK_LPS_PGE2

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for Prostaglandin E2 (PGE2) Biomarker Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_PGE2 is an assay component measured in the BSK_LPS
assay. It measures PGE2 antibody related to regulation of gene expression using ELISA technology. Data from
the assay component BSK_LPS_PGE2 was analyzed into 1 assay endpoint. This assay endpoint, BSK_LPS_PGE2,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as
they relate to the gene PTGER2. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves a signaling function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the gpcr intended
target family, where the subfamily is rhodopsin-like receptor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: PGE2 antibody is used to tag and quantify the level of prostaglandin E receptor 2 (subtype EP2),
53kDa protein. Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from Toll-like receptor (TLR4) activator [GeneSymbol:PTGER2 | GenelD:5732 |
Uniprot_SwissProt_Accession:P43116],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.047
Response cutoff threshold used to determine hit calls: 0.141
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 163.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 292

BSK_LPS_SRB

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: LPS Assay

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_SRB is an assay component measured in the BSK_LPS
assay. It measures 0.1% sulforhodamine related to cell death using Sulforhodamine staining technology. Data
from the assay component BSK_LPS_SRB was analyzed into 1 assay endpoint. This assay endpoint, BSK_LPS_SRB,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of viability reporter, gain or loss-of-signal activity can be used to understand changes in the signaling.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a signaling function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell morphology intended target family, where the
subfamily is cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: 0.1% sulforhodamine is used to tag and quantify the total protein levels in the system. Changes to
these absorbance signals can be indicative of the viability in the system when conditioned to simulate
proinflammation from Toll-like receptor (TLR4) activator.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.018
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 125.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 294

BSK_LPS_TissueFactor

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for Tissue Factor Biomarker Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_TissueFactor is an assay component measured in the
BSK_LPS assay. It measures CD142 Tissue Factor antibody related to regulation of gene expression using ELISA
technology. Data from the assay component BSK_LPS_TissueFactor was analyzed into 1 assay endpoint. This
assay endpoint, BSK_LPS_TissueFactor, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to
understand changes in the signaling as they relate to the gene F3. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cytokine intended target family, where the subfamily is coagulation factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: CD142 Tissue Factor antibody is used to tag and quantify the level of coagulation factor III
(thromboplastin, tissue factor) protein. Changes in the signals are indicative of protein expression changes when
conditioned to simulate proinflammation from Toll-like receptor (TLR4) activator [GeneSymbol:F3 |
GenelD:2152 | Uniprot_SwissProt_Accession:P13726],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.04
Response cutoff threshold used to determine hit calls: 0.121
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 108.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 296

BSK_LPS_TNFa

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for Tumour Necrosis Factor alpha (TNFa) Biomarker Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_TNFa is an assay component measured in the BSK_LPS
assay. It measures TNF-a antibody related to regulation of gene expression using ELISA technology. Data from
the assay component BSK_LPS_TNFa was analyzed into 1 assay endpoint. This assay endpoint, BSK_LPS_TNFa,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as
they relate to the gene TNF. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is inflammatory factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: TNF-a antibody is used to tag and quantify the level of tumor necrosis factor protein. Changes in
the signals are indicative of protein expression changes when conditioned to simulate proinflammation from
Toll-like receptor (TLR4) activator [GeneSymbokTNF | GenelD:7124 | Uniprot_SwissProt_Accession:P01375].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.045
Response cutoff threshold used to determine hit calls: 0.136
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 160.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 298

BSK_LPS_VCAM 1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for Vascular cell adhesion molecule 1 (VCAM1) Biomarker Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_VCAM1 is an assay component measured in the BSK_LPS
assay. It measures VCAM-1 antibody related to regulation of gene expression using ELISA technology. Data from
the assay component BSK_LPS_VCAM1 was analyzed into 1 assay endpoint. This assay endpoint,
BSK_LPS_VCAM1, was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline
of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in
the signaling as they relate to the gene VCAM1. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell adhesion
molecules intended target family, where the subfamily is Immunoglobulin CAM.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: VCAM-1 antibody is used to tag and quantify the level of vascular cell adhesion molecule 1 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from Toll-like receptor (TLR4) activator [GeneSymbokVCAMl | GenelD:7412 |
Uniprot_SwissProt_Accession:P19320].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.022
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 153.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 300

BSK_SAg_CD38

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: SAg Assay for CD38 Biomarker Activity

1.2	Assay Summary: BSK_SAg is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_SAg_CD38 is one of 10 assay component(s) measured or
calculated from the BSK_SAg assay. It is designed to make measurements of enzyme-linked immunosorbent
assay, a form of binding reporter, as detected with fluorescence intensity signals by Enzyme-linked
immunosorbent assay technology. Data from the assay component BSK_SAg_CD38 was analyzed into 1 assay
endpoint. This assay endpoint, BSK_SAg_CD38, was analyzed with bidirectional fitting relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand changes in the signaling as they relate to the gene CD38. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cytokine intended target family, where the subfamily is other cytokine.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: CD38 antibody is used to tag and quantify the level of CD38 molecule protein. Changes in the
signals are indicative of protein expression changes when conditioned to simulate proinflammation from T-cell
Receptor (TCR) activation [GeneSymbol:CD38 | GenelD:952 | Uniprot_SwissProt_Accession:P28907].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.024
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The T Cell Activation (SAg) system models vascular inflammation and T cell activation. This system is
relevant to inflammatory conditions where T cells play a key role including organ transplantation, rheumatoid
arthritis, psoriasis, Crohn's disease and multiple sclerosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 126.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 302

BSK_SAg_CD40

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: SAg Assay for CD40 Biomarker Activity

1.2	Assay Summary: BSK_SAg is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_SAg_CD40 is one of 10 assay component(s) measured or
calculated from the BSK_SAg assay. It is designed to make measurements of enzyme-linked immunosorbent
assay, a form of binding reporter, as detected with fluorescence intensity signals by Enzyme-linked
immunosorbent assay technology. Data from the assay component BSK_SAg_CD40 was analyzed into 1 assay
endpoint. This assay endpoint, BSK_SAg_CD40, was analyzed with bidirectional fitting relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand changes in the signaling as they relate to the gene CD40. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cytokine intended target family, where the subfamily is inflammatory factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: CD40 antibody is used to tag and quantify the level of CD40 molecule, TNF receptor superfamily
member 5 protein. Changes in the signals are indicative of protein expression changes when conditioned to
simulate proinflammation from T-cell Receptor (TCR) activation [GeneSymbol:CD40 | GenelD:958 |
Uniprot_SwissProt_Accession:P25942],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.033
Response cutoff threshold used to determine hit calls: 0.1
Detection technology used: ELISA (Fluorescence)

2.6	Response: The T Cell Activation (SAg) system models vascular inflammation and T cell activation. This system is
relevant to inflammatory conditions where T cells play a key role including organ transplantation, rheumatoid
arthritis, psoriasis, Crohn's disease and multiple sclerosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
534

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 145.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 304

BSK_SAg_CD69

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: SAg Assay for CD69 Biomarker Activity

1.2	Assay Summary: BSK_SAg is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_SAg_CD69 is one of 10 assay component(s) measured or
calculated from the BSK_SAg assay. It is designed to make measurements of enzyme-linked immunosorbent
assay, a form of binding reporter, as detected with fluorescence intensity signals by Enzyme-linked
immunosorbent assay technology. Data from the assay component BSK_SAg_CD69 was analyzed into 1 assay
endpoint. This assay endpoint, BSK_SAg_CD69, was analyzed with bidirectional fitting relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand changes in the signaling as they relate to the gene CD69. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
one serves a signaling function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cytokine intended target family, where the subfamily is inflammatory factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: CD69 antibody is used to tag and quantify the level of CD69 molecule protein. Changes in the
signals are indicative of protein expression changes when conditioned to simulate proinflammation from T-cell
Receptor (TCR) activation [GeneSymbol:CD69 | GenelD:969 | Uniprot_SwissProt_Accession:Q07108].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.035
Response cutoff threshold used to determine hit calls: 0.106
Detection technology used: ELISA (Fluorescence)

2.6	Response: The T Cell Activation (SAg) system models vascular inflammation and T cell activation. This system is
relevant to inflammatory conditions where T cells play a key role including organ transplantation, rheumatoid
arthritis, psoriasis, Crohn's disease and multiple sclerosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 161.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 306

BSK_SAg_Eselectin

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: SAg Assay for Eselectin Biomarker Activity

1.2	Assay Summary: BSK_SAg is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_SAg_Eselectin is an assay component measured in the
BSK_SAg assay. It measures E-selectin antibody related to regulation of gene expression using ELISA technology.
Data from the assay component BSK_SAg_Eselectin was analyzed into 1 assay endpoint. This assay endpoint,
BSK_SAg_Eselectin, was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline
of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in
the signaling as they relate to the gene SELE. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell adhesion
molecules intended target family, where the subfamily is selectins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: E-selectin antibody is used to tag and quantify the level of selectin E protein. Changes in the signals
are indicative of protein expression changes when conditioned to simulate proinflammation from T-cell
Receptor (TCR) activation [GeneSymbol:SELE | GenelD:6401 | Uniprot_SwissProt_Accession:P16581],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.038
Response cutoff threshold used to determine hit calls: 0.115
Detection technology used: ELISA (Fluorescence)

2.6	Response: The T Cell Activation (SAg) system models vascular inflammation and T cell activation. This system is
relevant to inflammatory conditions where T cells play a key role including organ transplantation, rheumatoid
arthritis, psoriasis, Crohn's disease and multiple sclerosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 165.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 308

BSK_SAg_IL8

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: SAg Assay for lnterleukin-8 (IL8) Biomarker Activity

1.2	Assay Summary: BSK_SAg is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_SAg_IL8 is an assay component measured in the BSK_SAg
assay. It measures IL-8 antibody related to regulation of gene expression using ELISA technology. Data from the
assay component BSK_SAg_IL8 was analyzed into 1 assay endpoint. This assay endpoint, BSK_SAg_IL8, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they
relate to the gene CXCL8. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is interleukins.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: IL-8 antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 8 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from T-cell Receptor (TCR) activation [GeneSymbol:CXCL8 | GenelD:3576 |
Uniprot_SwissProt_Accession:P10145].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.05
Response cutoff threshold used to determine hit calls: 0.15
Detection technology used: ELISA (Fluorescence)

2.6	Response: The T Cell Activation (SAg) system models vascular inflammation and T cell activation. This system is
relevant to inflammatory conditions where T cells play a key role including organ transplantation, rheumatoid
arthritis, psoriasis, Crohn's disease and multiple sclerosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 135.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 310

BSK_SAgJVICPl

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: SAg Assay for Monocyte Chemoattractant Protein-1 (MCP1) Biomarker
Activity

1.2	Assay Summary: BSK_SAg is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_SAg_MCPl is an assay component measured in the BSK_SAg
assay. It measures MCP-1 antibody related to regulation of gene expression using ELISA technology. Data from
the assay component BSK_SAg_MCPl was analyzed into 1 assay endpoint. This assay endpoint, BSK_SAg_MCPl,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using
a type of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as
they relate to the gene CCL2. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: MCP-1 antibody is used to tag and quantify the level of chemokine (C-C motif) ligand 2 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from T-cell Receptor (TCR) activation [GeneSymbol:CCL2 | GenelD:6347 |
Uniprot_SwissProt_Accession:P13500].

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,

-------
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

Baseline median absolute deviation for the assay (bmad): 0.032
Response cutoff threshold used to determine hit calls: 0.095
Detection technology used: ELISA (Fluorescence)

2.6	Response: The T Cell Activation (SAg) system models vascular inflammation and T cell activation. This system is
relevant to inflammatory conditions where T cells play a key role including organ transplantation, rheumatoid
arthritis, psoriasis, Crohn's disease and multiple sclerosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

I arget (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 2035

Active hit count: hitc>0.9
443

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 126.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 312

BSK_SAgJVIIG

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: SAg Assay for Monokine induced gamma interferon (MIG) Biomarker Activity

1.2	Assay Summary: BSK_SAg is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_SAg_MIG is an assay component measured in the BSK_SAg
assay. It measures MIG antibody related to regulation of gene expression using ELISA technology. Data from the
assay component BSK_SAg_MIG was analyzed into 1 assay endpoint. This assay endpoint, BSK_SAg_MIG, was
analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand changes in the signaling as they
relate to the gene CXCL9. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints where this one serves a signaling function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the cytokine intended target
family, where the subfamily is chemotactic factor.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: MIG antibody is used to tag and quantify the level of chemokine (C-X-C motif) ligand 9 protein.
Changes in the signals are indicative of protein expression changes when conditioned to simulate
proinflammation from T-cell Receptor (TCR) activation [GeneSymbol:CXCL9 | GenelD:4283 |
Uniprot_SwissProt_Accession:Q07325],

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.009
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The T Cell Activation (SAg) system models vascular inflammation and T cell activation. This system is
relevant to inflammatory conditions where T cells play a key role including organ transplantation, rheumatoid
arthritis, psoriasis, Crohn's disease and multiple sclerosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 110.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 314

BSK_SAg_PBMCCytotoxicity

1. General Information

1.1	Assay Title: PBMC Cytotoxicity Assessment in the BioMAP Diversity Plus: SAg Assay

1.2	Assay Summary: BSK_SAg is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_SAg_PBMCCytotoxicity is an assay component measured in
the BSK_SAg assay. It measures Alamar blue related to cell death using Alamar Blue Reduction technology. Data
from the assay component BSK_SAg_PBMCCytotoxicity was analyzed into 1 assay endpoint. This assay endpoint,
BSK_SAg_PBMCCytotoxicity, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of viability reporter, gain or loss-of-signal activity can be used to
understand changes in the viability. Furthermore, this assay endpoint can be referred to as a secondary readout,
because this assay has produced multiple assay endpoints where this one serves a viability function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell cycle
intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Changes to fluorescence intensity signals produced from an enzymatic reaction involving the key
substrate [Alamar blue] are correlated to the viability of the cells in the system.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.022
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Alamar Blue Reduction (Fluorescence)

2.6	Response: The T Cell Activation (SAg) system models vascular inflammation and T cell activation. This system is
relevant to inflammatory conditions where T cells play a key role including organ transplantation, rheumatoid
arthritis, psoriasis, Crohn's disease and multiple sclerosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Immunosuppression: Assays associated with markers of immunosuppression, see 10.14573/altex.2203041

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 153.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen

-------
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 316

BSK_SAg_Proliferation

1. General Information

1.1	Assay Title: Cell Proliferation Assessment in the BioMAP Diversity Plus: SAg Assay

1.2	Assay Summary: BSK_SAg is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_SAg_Proliferation is an assay component in the BSK_SAg
assay. It measures protein content, a form of viability reporter, as detected with absorbance signals by
Sulforhodamine staining technology. Data from the assay component BSK_SAg_Proliferation was analyzed into
1 assay endpoint. This assay endpoint, BSK_SAg_Proliferation, was analyzed with bidirectional fitting relative
to DMSO as the negative control and baseline of activity. Using a type of viability reporter, measures of total
protein for gain or loss-of-signal activity can be used to understand the viability at the cellular-level.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a viability function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is proliferation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: 0.1% sulforhodamine is used to tag and quantify the total protein levels in the system. Changes to
these absorbance signals can be indicative of the viability in the system when conditioned to simulate
proinflammation from T-cell Receptor (TCR) activation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.04

Response cutoff threshold used to determine hit calls: 0.12

Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The T Cell Activation (SAg) system models vascular inflammation and T cell activation. This system is
relevant to inflammatory conditions where T cells play a key role including organ transplantation, rheumatoid
arthritis, psoriasis, Crohn's disease and multiple sclerosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Immunosuppression: Assays associated with markers of immunosuppression, see 10.14573/altex.2203041

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 200.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 318

BSK_SAg_SRB

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: SAg Assay

1.2	Assay Summary: BSK_SAg is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_SAg_SRB is an assay component measured in the BSK_SAg
assay. It measures protein content, a form of viability reporter, as detected with absorbance signals by
Sulforhodamine staining technology. Data from the assay component BSK_SAg_SRB was analyzed into 1 assay
endpoint. This assay endpoint, BSK_SAg_SRB, was analyzed with bidirectional fitting relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, gain or loss-of-signal activity can be
used to understand changes in the signaling. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves a signaling function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell
morphology intended target family, where the subfamily is cell conformation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: 0.1% sulforhodamine is used to tag and quantify the total protein levels in the system. Changes to
these absorbance signals can be indicative of the viability in the system when conditioned to simulate
proinflammation from T-cell Receptor (TCR) activation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.017
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The T Cell Activation (SAg) system models vascular inflammation and T cell activation. This system is
relevant to inflammatory conditions where T cells play a key role including organ transplantation, rheumatoid
arthritis, psoriasis, Crohn's disease and multiple sclerosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2035	Number of chemicals tested: 1705

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 118.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2809

BSK_BT_Bcel l_Prol iferation

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: BT Assay for B cell Proliferation

1.2	Assay Summary: BSK_BT is a cell-based, multiplexed-readout assay that uses
B cell and peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements
taken at 24 hours after chemical dosing in a microplate: 96-well plate. BSK_BT_Bcell_Proliferation is an assay
component measured in the BSK_BT assay. It measures protein content, a form of viability reporter, as detected
with absorbance signals by Sulforhodamine staining technology. Data from the assay component
BSK_BT_Bcell_Proliferation was analyzed at the endpoint, BSK_BT_Bcell_Proliferation, in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of viability
reporter, gain or loss-of-signal activity can be used to understand cellular changes in viability. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the 'cell cycle1 intended target
family, where the subfamily is 'proliferation1.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: B cell proliferation is a critical event driving both adaptive immunity (antibody production) as well
as auto-immune diseases where B cells are key disease players (Lupus, MS, RA etc). Inhibition of B cell
proliferation is considered an immune supressive effect.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: suspension B and peripheral blood mononuclear cells primary cell co-culture used.
Primary human cell types used in BioMAP systems and their stimuli included the following: 3C System
(HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and
HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR
ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial
epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.025
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The B and T Cell Autoimmunity (BT) system models T cell dependent B cell activation and class
switching as would occur in a germinal center. This system is relevant for diseases and conditions where B cell
activation and antibody production are relevant. These include autoimmune disease, oncology, asthma and
allergy.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Immunosuppression: Assays associated with markers of immunosuppression, see 10.14573/altex.2203041

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 63.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2811

BSK_BT_PBMCCytotoxicity

1. General Information

1.1	Assay Title: PBMC Cytotoxicity Assessment in the BioMAP Diversity Plus: BT Assay

1.2	Assay Summary: BSK_BT is a cell-based, multiplexed-readout assay that uses
B cell and peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements
taken at 24 hours after chemical dosing in a microplate: 96-well plate. BSK_BT_PBMCCytotoxicity is an assay
component measured in the BSK_BT assay. It measures dehydrogenase activity, a form of viability reporter, as
detected with fluorescence intensity signals by Alamar Blue Reduction technology. Data from the assay
component BSK_BT_PBMCCytotoxicity was analyzed at the endpoint, BSK_BT_PBMCCytotoxicity, in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of viability reporter, gain or loss-of-signal activity can be used to understand cellular changes in viability. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the 'cell cycle1
intended target family, where the subfamily is 'cytotoxicity1.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: PBMC Cytotoxicity in the BT system is a measure of the cell death of PBMC. Cell viability of non-
adherent cells is measured by alamarBlue® staining, a method based on a cell permeable compound that emitts
fluorescence after entering cells. The number of living cells is proportional to the amount of fluorescence
produced

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: suspension B and peripheral blood mononuclear cells primary cell co-culture used.
Primary human cell types used in BioMAP systems and their stimuli included the following: 3C System
(HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and
HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR
ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial
epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.02
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Alamar Blue Reduction (Fluorescence)

2.6	Response: The B and T Cell Autoimmunity (BT) system models T cell dependent B cell activation and class
switching as would occur in a germinal center. This system is relevant for diseases and conditions where B cell
activation and antibody production are relevant. These include autoimmune disease, oncology, asthma and
allergy.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Immunosuppression: Assays associated with markers of immunosuppression, see 10.14573/altex.2203041

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,

-------
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767

Number of chemicals tested: 577

Active hit count: hitc>0.9
150

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 59.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2813

BSK_BT_slgG

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BT Assay for Secreted IgG (slgG) Biomarker Activity

1.2	Assay Summary: BSK_BT is a cell-based, multiplexed-readout assay that uses
B cell and peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements
taken at 24 hours after chemical dosing in a microplate: 96-well plate. BSK_BT_slgG is an assay component
measured in the BSK_BT assay. It measures secreted IgG related to regulation of gene expression using ELISA
technology. Data from the assay component BSK_BT_slgG was analyzed at the endpoint, BSK_BT_slgG, in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of cytokine quantitation reporter, gain or loss-of-signal activity can be used to understand protein changes. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the
'immunoglobulin' intended target family, where the subfamily is 'IgG'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Secreted IgG (slgG) is produced by B cells and is the main type of antibody found in blood and
extracellular fluid that mediates the immune response against pathogens. slgG is categorized as an
immunomodulatory-related activity in the BT system modeling T cell depdendent B cell activation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: suspension B and peripheral blood mononuclear cells primary cell co-culture used.
Primary human cell types used in BioMAP systems and their stimuli included the following: 3C System
(HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and
HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR
ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial
epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.062
Response cutoff threshold used to determine hit calls: 0.185
Detection technology used: ELISA (Fluorescence)

2.6	Response: The B and T Cell Autoimmunity (BT) system models T cell dependent B cell activation and class
switching as would occur in a germinal center. This system is relevant for diseases and conditions where B cell
activation and antibody production are relevant. These include autoimmune disease, oncology, asthma and
allergy.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Immunosuppression: Assays associated with markers of immunosuppression, see 10.14573/altex.2203041

Additionally, this assay was annotated to the intended target family of immunoglobulin.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 77.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2815

BSK_BT_xlL17A

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: BT Assay for Interleukin 17A (IL17A) Biomarker Activity

1.2	Assay Summary: BSK_BT is a cell-based, multiplexed-readout assay that uses
B cell and peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements
taken at 24 hours after chemical dosing in a microplate: 96-well plate. BSK_BT_xlL17A is an assay component
measured in the BSK_BT assay. It measures secreted IL-17A related to regulation of gene expression using
LumineX xMAP technology. Data from the assay component BSK_BT_xlL17A was analyzed at the endpoint,
BSK_BT_xlL17A, in the positive analysis fitting direction relative to DMSO as the negative control and baseline
of activity. Using a type of cytokine quantitation reporter, gain or loss-of-signal activity can be used to
understand protein changes. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the 'cytokine' intended target family, where the subfamily is 'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Interleukin 17A (IL-17A) is a proinflammatory cytokine produced by T cells that induces cytokine
production and mediates monocyte and neutrophil recruitment to sites of inflammation. Secreted IL-17A (sIL-

-------
17A) is categorized as an immunomodulatory-related activity in the BT system modeling T cell depdendent B
cell activation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: suspension B and peripheral blood mononuclear cells primary cell co-culture used.
Primary human cell types used in BioMAP systems and their stimuli included the following: 3C System
(HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and
HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR
ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial
epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.143
Response cutoff threshold used to determine hit calls: 0.428
Detection technology used: LumineXxMAP (Fluorescence)

2.6	Response: The B and T Cell Autoimmunity (BT) system models T cell dependent B cell activation and class
switching as would occur in a germinal center. This system is relevant for diseases and conditions where B cell
activation and antibody production are relevant. These include autoimmune disease, oncology, asthma and
allergy.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 62.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2817

BSK_BT_xlL17F

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: BT Assay for Interleukin 17F (IL17F) Biomarker Activity

1.2	Assay Summary: BSK_BT is a cell-based, multiplexed-readout assay that uses
B cell and peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements
taken at 24 hours after chemical dosing in a microplate: 96-well plate. BSK_BT_xlL17F is an assay component
measured in the BSK_BT assay. It measures secreted IL-17F related to regulation of gene expression using
LumineX xMAP technology. Data from the assay component BSK_BT_xlL17F was analyzed at the endpoint,
BSK_BT_xlL17F, in the positive analysis fitting direction relative to DMSO as the negative control and baseline
of activity. Using a type of cytokine quantitation reporter, gain or loss-of-signal activity can be used to
understand protein changes. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the 'cytokine' intended target family, where the subfamily is 'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Interleukin 17F (IL-17F) is a proinflammatory cytokine produced by T cells that induces cytokine,
chemokine and adhesion molecule production and mediates neutrophil recruitment to sites of inflammation.

-------
Secreted IL-17F (slL-17F) is categorized as an immunomodulatory-related activity in the BT system modeling T
cell depdendent B cell activation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: suspension B and peripheral blood mononuclear cells primary cell co-culture used.
Primary human cell types used in BioMAP systems and their stimuli included the following: 3C System
(HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and
HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR
ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial
epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.074
Response cutoff threshold used to determine hit calls: 0.221
Detection technology used: LumineXxMAP (Fluorescence)

2.6	Response: The B and T Cell Autoimmunity (BT) system models T cell dependent B cell activation and class
switching as would occur in a germinal center. This system is relevant for diseases and conditions where B cell
activation and antibody production are relevant. These include autoimmune disease, oncology, asthma and
allergy.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 58.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2819

BSK_BT_xll_2

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BT Assay for Interleukin 2 (IL2) Biomarker Activity

1.2	Assay Summary: BSK_BT is a cell-based, multiplexed-readout assay that uses
B cell and peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements
taken at 24 hours after chemical dosing in a microplate: 96-well plate. BSK_BT_xlL2 is an assay component
measured in the BSK_BT assay. It measures secreted IL-2 related to regulation of gene expression using LumineX
xMAP technology. Data from the assay component BSK_BT_xlL2 was analyzed at the endpoint, BSK_BT_xlL2, in
the positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a
type of cytokine quantitation reporter, gain or loss-of-signal activity can be used to understand protein changes.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the 'cytokine'
intended target family, where the subfamily is 'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Interleukin 2 (IL-2) is a secreted proinflammatory cytokine produced by T cells that regulates
lymphocyte proliferation and promotes T cell differentiation. Secreted IL-2 (IL-2) is categorized as an
immunomodulatory-related activity in the BT system modeling T cell depdendent B cell activation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: suspension B and peripheral blood mononuclear cells primary cell co-culture used.
Primary human cell types used in BioMAP systems and their stimuli included the following: 3C System
(HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and
HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR
ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial
epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.072
Response cutoff threshold used to determine hit calls: 0.216
Detection technology used: LumineXxMAP (Fluorescence)

2.6	Response: The B and T Cell Autoimmunity (BT) system models T cell dependent B cell activation and class
switching as would occur in a germinal center. This system is relevant for diseases and conditions where B cell
activation and antibody production are relevant. These include autoimmune disease, oncology, asthma and
allergy.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 66.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2821

BSK_BT_xlL6

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BT Assay for Interleukin 6 (IL6) Biomarker Activity

1.2	Assay Summary: BSK_BT is a cell-based, multiplexed-readout assay that uses
B cell and peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements
taken at 24 hours after chemical dosing in a microplate: 96-well plate. BSK_BT_xlL6 is an assay component
measured in the BSK_BT assay. It measures secreted IL-6 related to regulation of gene expression using LumineX
xMAP technology. Data from the assay component BSK_BT_xlL6 was analyzed at the endpoint, BSK_BT_xlL6, in
the positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a
type of cytokine quantitation reporter, gain or loss-of-signal activity can be used to understand protein changes.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the 'cytokine'
intended target family, where the subfamily is 'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Interleukin 6 (IL-6) is a secreted proinflammatory cytokine and acute phase reactant. Secreted IL-
6 (slL-6) is categorized as an immunomodulatory-related activity in the BT system modeling T cell depdendent
B cell activation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: suspension B and peripheral blood mononuclear cells primary cell co-culture used.
Primary human cell types used in BioMAP systems and their stimuli included the following: 3C System
(HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and
HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR
ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial
epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.09
Response cutoff threshold used to determine hit calls: 0.269
Detection technology used: LumineXxMAP (Fluorescence)

2.6	Response: The B and T Cell Autoimmunity (BT) system models T cell dependent B cell activation and class
switching as would occur in a germinal center. This system is relevant for diseases and conditions where B cell
activation and antibody production are relevant. These include autoimmune disease, oncology, asthma and
allergy.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 68.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2823

BSK_BT_xTNFa

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BT Assay for Tumor necrosis factor alpha (TNF-alpha) Biomarker Activity

1.2	Assay Summary: BSK_BT is a cell-based, multiplexed-readout assay that uses
B cell and peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements
taken at 24 hours after chemical dosing in a microplate: 96-well plate. BSK_BT_xTNFa is an assay component of
the BSK_BT assay. It measures secreted TNF-alpha related to regulation of gene expression using LumineXxMAP
technology. Data from the assay component BSK_BT_xTNFa was analyzed at the endpoint, BSK_BT_xTNFa, in
the positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a
type of cytokine quantitation reporter, gain or loss-of-signal activity can be used to understand protein changes.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the 'cytokine'
intended target family, where the subfamily is 'inflammatory factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Tumor necrosis factor alpha (TNF-alpha) is a secreted proinflammatory cytokine involved in Thl
inflammation. Secreted TNF-alpha is categorized as an inflammation-related activity in the BT system modeling
T cell dependent B cell activation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: suspension B and peripheral blood mononuclear cells primary cell co-culture used.
Primary human cell types used in BioMAP systems and their stimuli included the following: 3C System
(HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and
HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR
ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial
epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.057
Response cutoff threshold used to determine hit calls: 0.171
Detection technology used: LumineXxMAP (Fluorescence)

2.6	Response: The B and T Cell Autoimmunity (BT) system models T cell dependent B cell activation and class
switching as would occur in a germinal center. This system is relevant for diseases and conditions where B cell
activation and antibody production are relevant. These include autoimmune disease, oncology, asthma and
allergy.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 39.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2825

BSK_Myo F_ACTA1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: MyoF Assay for Actin alpha 1 (ACTA1) Biomarker Activity

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_ACTAl is an assay component measured in the BSK_MyoF assay. It measures a-SMA antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_MyoF_ACTAl was analyzed at the endpoint, BSK_MyoF_ACTAl, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand protein changes. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended target family, where the
subfamily is 'actin'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: alpha-Smooth muscle actin (a-SMA) is a protein involved in muscle contraction, cell motility,
structure and integrity and is a marker for activated myofibroblast phenotype. a-SMA is categorized as a tissue
remodeling-related activity in the MyoF system modeling pulmonary myofibroblast development.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.033
Response cutoff threshold used to determine hit calls: 0.1
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 80.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2827

BSK_MyoF_bFGF

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: MyoF Assay for Basic fibroblast growth factor (bFGF) Biomarker Activity

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_bFGF is an assay component measured in the BSK_MyoF assay. It measures bFGF antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_MyoF_bFGF was
analyzed at the endpoint, BSK_MyoF_bFGF, in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand protein changes. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the 'growth factor1 intended target family, where the subfamily is 'basic fibroblast
growth factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Basic fibroblast growth factor (bFGF) is a pro-fibrotic growth factor that drives fibroblast
proliferation, migration and fibronectin synthesis. bFGF is categorized as a tissue remodeling-related activity in
the MyoF system modeling pulmonary myofibroblast development.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.021
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of growth factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 73.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2829

BSK_Myo F_VCAM 1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: MyoF Assay for Vascular cell adhesion molecule 1 (VCAM1) Biomarker
Activity

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_VCAMl is an assay component measured in the BSK_MyoF assay. It measures VCAM-1 antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_MyoF_VCAMl was analyzed at the endpoint, BSK_MyoF_VCAMl, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand protein changes. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended target family, where the
subfamily is 'Immunoglobulin CAM'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Vascular Cell Adhesion Molecule 1 (VCAM-1/CD106) is a cell adhesion molecule that mediates
adhesion of monocytes and T cells to endothelial cells. VCAM-1 is categorized as an inflammation-related
activity in the MyoF system modeling pulmonary myofibroblast development.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.03
Response cutoff threshold used to determine hit calls: 0.09
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 64.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2831

BSK_MyoF_Collagen I

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: MyoF Assay for Collagen I Biomarker Activity

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_Collagenl is an assay component measured in the BSK_MyoF assay. It measures Collagen I antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_MyoF_Collagenl was analyzed at the endpoint, BSK_MyoF_Collagenl, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended target family, where
the subfamily is 'collagen'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Collagen I is involved in tissue remodeling and fibrosis, and is the most common fibrillar collagen
that is found in skin, bone, tendons and other connective tissues. Collagen I is categorized a tissue remodeling-
related activity in the MyoF system modeling pulmonary myofibroblast development.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.03
Response cutoff threshold used to determine hit calls: 0.091
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 61.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2833

BSK_Myo F_Col lagen 111

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: MyoF Assay for Collagen III Biomarker Activity

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_Collagenlll is an assay component measured in the BSK_MyoF assay. It measures Collagen III
antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_MyoF_Collagenlll was analyzed at the endpoint, BSK_MyoF_Collagenlll, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended target family, where
the subfamily is 'collagen'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Collagen III is an extracellular matrix protein and fibrillar collagen found in extensible connective
tissues (skin, lung and vascular system) and is involved in cell adhesion, cell migration, tissue remodeling.

-------
Collagen III is categorized a tissue remodeling-related activity in the MyoF system modeling pulmonary
myofibroblast development.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.03
Response cutoff threshold used to determine hit calls: 0.091
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 66.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2835

BSK_Myo F_Col lagen IV

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: MyoF Assay for Collagen IV Biomarker Activity

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_CollagenlV is an assay component measured in the BSK_MyoF assay. It measures Collagen IV
antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_MyoF_CollagenlV was analyzed at the endpoint, BSK_MyoF_CollagenlV, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended target family, where
the subfamily is 'collagen'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Collagen IV is the major structural component of the basal lamina. Collagen IV is categorized a
tissue remodeling-related activity in the MyoF system modeling pulmonary myofibroblast development.

-------
The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.048
Response cutoff threshold used to determine hit calls: 0.143
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

NA

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767

Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
129

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 61.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2837

BSK_MyoF_IL8

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: MyoF Assay for lnterleukin-8 (IL8) Biomarker Activity

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_IL8 is an assay component measured in the BSK_MyoF assay. It measures IL-8 antibody related to
regulation of gene expression using ELISA technology. Data from the assay component BSK_MyoF_IL8 was
analyzed at the endpoint, BSK_MyoF_IL8, in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand protein changes. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the 'cytokine' intended target family, where the subfamily is 'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Interleukin 8 (IL-8/CXCL8) is a chemokine that mediates neutrophil recruitment into acute
inflammatory sites. IL-8 is categorized as an inflammation-related activity in the MyoF system modeling
pulmonary myofibroblast development.

-------
The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.052
Response cutoff threshold used to determine hit calls: 0.157
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

NA

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767

Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
97

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 62.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2839

BSK_Myo F_Deco ri n

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: MyoF Assay for Decorin Biomarker Activity

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_Decorin is an assay component measured in the BSK_MyoF assay. It measures Decorin antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_MyoF_Decorin was analyzed at the endpoint, BSK_MyoF_Decorin, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand protein changes. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended target family, where the
subfamily is 'decorin'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Decorin is a proteoglycan that is a component of connective tissue and is involved in collagen and
matrix assembly. Decorin is categorized as a tissue remodeling-related activity in the MyoF system modeling
pulmonary myofibroblast development.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.031
Response cutoff threshold used to determine hit calls: 0.093
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 55.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2841

BSK_MyoF_MMPl

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: MyoF Assay for Matrix Metallopeptidase 1 (MMP1) Biomarker Activity

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_MMPl is an assay component measured in the BSK_MyoF assay. It measures PAI-1 antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_MyoF_MMPl
was analyzed at the endpoint, BSK_MyoF_MMPl, in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can
be used to understand protein changes. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the 'protease' intended target family, where the subfamily is 'matrix
metalloproteinase'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Matrix metalloproteinase-1 (MMP-1) is an interstitial collagenase that degrades collagens I, II and
III and is involved in the process of tissue remodeling. MMP-1 is categorized as a tissue remodeling-related
activity in the MyoF system modeling pulmonary myofibroblast development.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.054
Response cutoff threshold used to determine hit calls: 0.163
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 47.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2843

BSK_MyoF_RAIl

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: MyoF Assay for Plasminogen activator inhibitor-1 (PAI1) Biomarker Activity

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_PAIl is an assay component measured in the BSK_MyoF assay. It measures PAI-1 antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_MyoF_PAIl was
analyzed at the endpoint, BSK_MyoF_PAIl, in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand protein changes. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the 'cytokine' intended target family, where the subfamily is 'plasmogen activator
inhibitor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Plasminogen activator inhibitor-1 (PAI-I) is a serine proteinase inhibitor and inhibitor of tissue
plasminogen activator (tPA) and urokinase (uPA) and is involved in tissue remodeling and fibrinolysis. PAI-I is

-------
categorized as a tissue remodeling-related activity in the MyoF system modeling pulmonary myofibroblast
development.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.025
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 58.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2845

BSK_MyoF_SRB

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: MyoF Assay

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_SRB is an assay component measured in the BSK_MyoF assay. It measures 0.1% sulforhodamine
related to cell death using Sulforhodamine staining technology. Data from the assay component BSK_MyoF_SRB
was analyzed at the endpoint, BSK_MyoF_SRB, in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, gain or loss-of-signal activity can be
used to understand protein changes. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the 'cell morphology1 intended target family, where the subfamily is 'cell
conformation'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: SRB in the MyoF system is a measure of the total protein content of lung fibroblasts. Cell viability
of adherent cells is measured by Sulforhodamine B (SRB) staining, a method that determines cell density by
measuring total protein content of test wells.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.009
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 56.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2847

BSK_MyoF_TI M PI

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: MyoFAssayforTIMP metallopeptidase inhibitor 1 (TIMP1) Biomarker Activity

1.2	Assay Summary: BSK_MyoF is a cell-based, multiplexed-readout assay that uses lung fibroblasts, a human
vascular primary cell, with measurements taken at 24 hours after chemical dosing in a microplate: 96-well plate.
BSK_MyoF_TIMPl is an assay component measured in the BSK_MyoF assay. It measures TIMP-1 antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_MyoF_TIMPl was analyzed at the endpoint, BSK_MyoF_TIMPl, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand protein changes. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the 'protease' intended target family, where the subfamily is 'matrix
metalloproteinase'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: TIMP-1 is a tissue inhibitor of matrix metalloprotease-7 (MMP-7) and other MMPs, and is involved
in tissue remodeling, angiogenesis and fibrosis. TIMP-1 is categorized as a tissue remodeling-related activity in
the MyoF system modeling pulmonary myofibroblast development.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent lung fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.048
Response cutoff threshold used to determine hit calls: 0.145
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Fibrosis (MyoF) system models the development of pulmonary myofibroblasts, and is relevant
to respiratory disease settings as well as other chronic inflammatory settings where fibrosis occurs, such as
rheumatoid arthritis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 767	Number of chemicals tested: 577

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 60.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2849

BSK_BF4T_MCP1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for Monocyte Chemoattractant Protein-1 (MCP1) Biomarker
Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_MCP1 is an assay component measured in the BSK_BF4T assay. It
measures MCP-1 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_BF4T_MCP1 was analyzed at the endpoint, BSK_BF4T_MCP1, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'chemotactic factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Monocyte chemoattractant protein-1 (MCP-1/CCL2) is a chemoattractant cytokine (chemokine)
that regulates the recruitment of monocytes and T cells into sites of inflammation. MCP-1 is categorized as an
inflammation-related activity in the BF4T system modeling Th2 airway inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.032
Response cutoff threshold used to determine hit calls: 0.097
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 26.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2851

BSK_B F4T_Eotaxi n 3

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for Eotaxin 3 Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_Eotaxin3 is an assay component measured in the BSK_BF4T assay. It
measures Eotaxin-3 antibody related to regulation of gene expression using ELISA technology. Data from the
assay component BSK_BF4T_Eotaxin3 was analyzed at the endpoint, BSK_BF4T_Eotaxin3, in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of binding
reporter, gain or loss-of-signal activity can be used to understand protein changes. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where
the subfamily is 'chemotactic factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Eotaxin-3/CCL26 is a chemokine that mediates recruitment of eosinophils and basophils into tissue
sites. Eotaxin-3 is categorized as an inflammation-related activity in the BF4T system modeling Th2 airway
inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.051
Response cutoff threshold used to determine hit calls: 0.153
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 26.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2853

BSK_B F4T_VCAM 1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for Vascular cell adhesion molecule 1 (VCAM1) Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_VCAM1 is an assay component measured in the BSK_BF4T assay. It
measures VCAM-1 antibody related to regulation of gene expression using ELISA technology. Data from the
assay component BSK_BF4T_VCAM1 was analyzed at the endpoint, BSK_BF4T_VCAM1, in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of binding
reporter, gain or loss-of-signal activity can be used to understand protein changes. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended
target family, where the subfamily is 'Immunoglobulin CAM'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Vascular Cell Adhesion Molecule 1 (VCAM-1/CD106) is a cell adhesion molecule that mediates
adhesion of monocytes and T cells to endothelial cells. VCAM-1 is categorized as an inflammation-related
activity in the BF4T system modeling Th2 airway inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.037
Response cutoff threshold used to determine hit calls: 0.112
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 347

Active hit count: hitc>0.9
49

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 30.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2855

BSK_BF4T_ICAM 1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for Intercellular Adhesion Molecule 1 (ICAM1) Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_ICAM1 is an assay component measured in the BSK_BF4T assay. It
measures ICAM-1 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_BF4T_ICAM1 was analyzed at the endpoint, BSK_BF4T_ICAM1, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended target family, where
the subfamily is 'Immunoglobulin CAM'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Intercellular Adhesion Molecule 1 (ICAM-1/CD54) is a cell adhesion molecule that mediates
leukocyte-endothelial cell adhesion and leukocyte recruitment. ICAM-1 is categorized as an inflammation-
related activity in the BF4T system modeling Th2 airway inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.018
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 24.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2857

BSK_BF4T_CD90

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for CD90 Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_CD90 is an assay component measured in the BSK_BF4T assay. It
measures CD90 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_BF4T_CD90 was analyzed at the endpoint, BSK_BF4T_CD90, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'inflammatory factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: CD90 is a cell surface glycoprotein that mediates cell-cell and cell-matrix interactions. CD90 is
categorized as a tissue remodeling-related activity in the BF4T system modeling Th2 airway inflammation.

-------
The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.013
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

NA

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347

Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
16

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 24.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2859

BSK_BF4T_IL8

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for lnterleukin-8 (IL8) Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_IL8 is an assay component measured in the BSK_BF4T assay. It
measures IL-8 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_BF4T_IL8 was analyzed at the endpoint, BSK_BF4T_IL8, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand protein changes. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily is
'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Interleukin 8 (IL-8/CXCL8) is a chemokine that mediates neutrophil recruitment into acute
inflammatory sites. IL-8 is categorized as an inflammation-related activity in the BF4T system modeling Th2
airway inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.034
Response cutoff threshold used to determine hit calls: 0.102
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 25.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2861

BSK_BF4T_ILla

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for lnterleukin-1 alpha (ILla) Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_ILla is an assay component measured in the BSK_BF4T assay. It
measures IL-la antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_BF4T_ILla was analyzed at the endpoint, BSK_BF4T_ILla, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Interleukin 1 alpha (IL-la) is a secreted proinflammatory cytokine involved in endothelial cell
activation and neutrophil recruitment. Secreted IL-1? (siL-l?)is categorized as an inflammation-related activity
in the BF4T system modeling Th2 airway inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.018
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 16.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2863

BSK_B F4T_Kerati n818

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for Keratin 818 Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_Keratin818 is an assay component measured in the BSK_BF4T assay.
It measures Keratin 8/18 antibody related to regulation of gene expression using ELISA technology. Data from
the assay component BSK_BF4T_Keratin818 was analyzed at the endpoint, BSK_BF4T_Keratin818, in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand protein changes. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the 'cell adhesion molecules'
intended target family, where the subfamily is 'keratin'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Keratin 8/18 is an intermediate filament heterodimer of fibrous structural poteins involved in
Epithelial cell death, EMT, COPD, Lung Inflammation. Keratin 8/18 is categorized as a tissue remodeling-related
activity in the BF4T system modeling Th2 airway inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.024
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 17.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2865

BSK_BF4T_MMP1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for Matrix Metallopeptidase 1 (MMP1) Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_MMP1 is an assay component measured in the BSK_BF4T assay. It
measures MMP-1 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_BF4T_MMP1 was analyzed at the endpoint, BSK_BF4T_MMP1, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'protease' intended target family, where the subfamily
is 'matrix metalloproteinase'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Matrix metalloproteinase-1 (MMP-1) is an interstitial collagenase that degrades collagens I, II and
III and is involved in the process of tissue remodeling. MMP-1 is categorized as a tissue remodeling-related
activity in the BF4T system modeling Th2 airway inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.018
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 20.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2867

BSK_BF4T_MMP3

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for Matrix Metallopeptidase 3 (MMP3) Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_MMP3 is an assay component measured in the BSK_BF4T assay. It
measures MMP-3 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_BF4T_MMP3 was analyzed at the endpoint, BSK_BF4T_MMP3, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'protease' intended target family, where the subfamily
is 'matrix metalloproteinase'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Matrix metalloproteinase-3 (MMP-3) is an enzyme involved in tissue remodeling that can activate
other MMPs (MMP-1, MMP-7 and MMP-9) and degrade collagens (II, III, IV, IX and X), proteoglycans, fibronectin,

-------
laminin and elastin. MMP-3 is categorized as a tissue remodeling-related activity in the BF4T system modeling
Th2 airway inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.029
Response cutoff threshold used to determine hit calls: 0.086
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 19.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2869

BSK_BF4T_MMP9

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for Matrix Metallopeptidase 9 (MMP9) Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_MMP9 is an assay component measured in the BSK_BF4T assay. It
measures MMP-9 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_BF4T_MMP9 was analyzed at the endpoint, BSK_BF4T_MMP9, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'protease' intended target family, where the subfamily
is 'matrix metalloproteinase'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Matrix metalloproteinase-9 (MMP-9) is a gelatinase B that degrades collagen IV and gelatin and is
involved in airway matrix remodeling. MMP-9 is categorized as a tissue remodeling-related activity in the BF4T
system modeling Th2 airway inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.029
Response cutoff threshold used to determine hit calls: 0.088
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 30.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2871

BSK_BF4T_PAI1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for Plasminogen activator inhibitor-1 (PAI1) Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_PAI1 is an assay component measured in the BSK_BF4T assay. It
measures PAI-1 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_BF4T_PAI1 was analyzed at the endpoint, BSK_BF4T_PAI1, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'plasmogen activator inhibitor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Plasminogen activator inhibitor-1 (PAI-I) is a serine proteinase inhibitor and inhibitor of tissue
plasminogen activator (tPA) and urokinase (uPA) and is involved in tissue remodeling and fibrinolysis. PAI-I is
categorized as a tissue remodeling-related activity in the BF4T system modeling Th2 airway inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.016
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 29.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2873

BSK_BF4T_SRB

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: BF4T Assay

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_SRB is an assay component measured in the BSK_BF4T assay. It
measures 0.1% sulforhodamine related to cell death using Sulforhodamine staining technology. Data from the
assay component BSK_BF4T_SRB was analyzed at the endpoint, BSK_BF4T_SRB, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter,
gain or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the 'cell morphology1 intended target family, where
the subfamily is 'cell conformation'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: SRB in the BF4T system is a measure of the total protein content of bronchial epithelial cells and
dermal fibroblasts. Cell viability of adherent cells is measured by Sulforhodamine B (SRB) staining, a method
that determines cell density by measuring total protein content of test wells.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.009
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 32.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2875

BSK_BF4T_tPA

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for Tissue plasminogen activator (tPA) Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4T_tPA is an assay component measured in the BSK_BF4T assay. It
measures tPA antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_BF4T_tPA was analyzed at the endpoint, BSK_BF4T_tPA, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand protein changes. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the 'protease' intended target family, where the subfamily is 'serine
protease'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Tissue plasminogen activator (tPA) is a serine protease that catalyzes the cleavage of plasminogen
to plasmin and regulates clot degradation. tPA is involved in fibrinolyis, cell migration and tissue remodeling.
tPA is categorized as a tissue remodeling-related activity in the BF4T system modeling Th2 airway inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.029
Response cutoff threshold used to determine hit calls: 0.086
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 26.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2877

BSK_BF4T_uPA

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: BF4T Assay for urokinase plasminogen activator (uPA) Biomarker Activity

1.2	Assay Summary: BSK_BF4T is a cell-based, multiplexed-readout assay that uses bronchial epithelial cells and
dermal fibroblasts, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing
in a microplate: 96-well plate. BSK_BF4TA is an assay component measured in the BSK_BF4T assay. It measures
uPA antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_BF4TA was analyzed at the endpoint, BSK_BF4TA, in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity
can be used to understand protein changes. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the 'protease' intended target family, where the subfamily is 'serine protease'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Urokinse plasminogen activator (uPA) is a serine protease with thrombolytic activity. Triggers
fibrinolysis and extracellular matrix degradation. uPA is categorized as a tissue remodeling-related activity in
the BF4T system modeling Th2 airway inflammation.

-------
The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells and dermal fibroblasts primary cell used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.019
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Disease (BF4T) system models lung inflammation of the Th2 type, an environment that
promotes the recruitment of eosinophils, mast cells and basophils as well as effector memory T cells. This system
is relevant for allergy and asthma, pulmonary fibrosis, as well as COPD exacerbations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

NA

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347

Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
40

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 27.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2879

BSK_BE3C_ICAM1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Intercellular Adhesion Molecule 1 (ICAM1) Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_BE3C_ICAM1 is an assay component measured in the BSK_BE3C assay. It measures ICAM-1 antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_BE3C_ICAM1 was
analyzed at the endpoint, BSK_BE3C_ICAM1, in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand protein changes. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the 'cell adhesion molecules' intended target family, where the subfamily is
'Immunoglobulin CAM'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Intercellular Adhesion Molecule 1 (ICAM-1/CD54) is a cell adhesion molecule that mediates
leukocyte-endothelial cell adhesion and leukocyte recruitment. ICAM-1 is categorized as an inflammation-
related activity in the BE3C system modeling Thl lung inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.017
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 32.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2881

BSK_BE3C_ITAC

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Interferon-inducible T-cell alpha chemoattractant (ITAC)
Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. BSK_BE3C_ITAC
is an assay component measured in the BSK_BE3C assay. It measures l-TAC antibody related to regulation of
gene expression using ELISA technology. Data from the assay component BSK_BE3C_ITAC was analyzed at the
endpoint, BSK_BE3C_ITAC, in the positive analysis fitting direction relative to DMSO as the negative control and
baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand
protein changes. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the 'cytokine' intended target family, where the subfamily is 'chemotactic factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Interferon inducible T Cell Alpha Chemoattractant (l-TAC/CXCLll) is a chemokine that mediates T
cell and monocyte chemotaxis. I-TAC is categorized as an inflammation-related activity in the BE3C system
modeling Thl lung inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.016
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 34.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2883

BSK_BE3C_IL8

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for lnterleukin-8 (IL8) Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. BSK_BE3C_IL8
is an assay component measured in the BSK_BE3C assay. It measures IL-8 antibody related to regulation of gene
expression using ELISA technology. Data from the assay component BSK_BE3C_IL8 was analyzed at the
endpoint, BSK_BE3C_IL8, in the positive analysis fitting direction relative to DMSO as the negative control and
baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand
protein changes. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the 'cytokine' intended target family, where the subfamily is 'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Interleukin 8 (IL-8/CXCL8) is a chemokine that mediates neutrophil recruitment into acute
inflammatory sites. IL-8 is categorized as an inflammation-related activity in the BE3C system modeling Thl lung
inflammation.

-------
The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.039
Response cutoff threshold used to determine hit calls: 0.116
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

NA

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495

Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
80

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 39.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2885

BSK_BE3C_EGFR

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Epidermal growth factor receptor (EGFR) Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_BE3C_EGFR is an assay component measured in the BSK_BE3C assay. It measures EGFR antibody related to
regulation of gene expression using ELISA technology. Data from the assay component BSK_BE3C_EGFR was
analyzed at the endpoint, BSK_BE3C_EGFR, in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand protein changes. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the 'kinase' intended target family, where the subfamily is 'receptor tyrosine kinase'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Epidermal growth factor receptor (EGFR) is a cell surface receptor for epidermal growth factor
involved in cell proliferation, cell differentiation, tissue remodeling and tumor growth. EGFR is categorized as a
tissue remodeling-related activity in the BE3C system modeling Thl lung inflammation.

-------
The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.018
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of kinase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

NA

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495

Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
55

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 34.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2887

BSK_BE3C_Keratin818

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Keratin 818 Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_BE3C_Keratin818 is an assay component measured in the BSK_BE3C assay. It measures Keratin 8/18
antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_BE3C_Keratin818 was analyzed at the endpoint, BSK_BE3C_Keratin818, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended target family, where
the subfamily is 'keratin'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Keratin 8/18 is an intermediate filament heterodimer of fibrous structural poteins involved in
Epithelial cell death, EMT, COPD, Lung Inflammation. Keratin 8/18 is categorized as a tissue remodeling-related
activity in the BE3C system modeling Thl lung inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.019
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 39.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2889

BSK_BE3C_MMP9

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: BE3C Assay for Matrix Metallopeptidase 9 (MMP9) Biomarker Activity

1.2	Assay Summary: BSK_BE3C is a cell-based, multiplexed-readout assay that uses bronchial epithelial cell, a human
lung primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_BE3C_MMP9 is an assay component measured in the BSK_BE3C assay. It measures MMP-9 antibody related
to regulation of gene expression using ELISA technology. Data from the assay component BSK_BE3C_MMP9 was
analyzed at the endpoint, BSK_BE3C_MMP9, in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be
used to understand protein changes. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the 'protease' intended target family, where the subfamily is 'matrix
metalloproteinase'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Matrix metalloproteinase-9 (MMP-9) is a gelatinase B that degrades collagen IV and gelatin and is
involved in airway matrix remodeling. MMP-9 is categorized as a tissue remodeling-related activity in the BE3C
system modeling Thl lung inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent bronchial epithelial cells primary cell used. Primary human cell types used in
BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and
IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC
and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial
epithelial cells/1 L-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.024
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Lung Inflammation (BE3C) system models lung inflammation of the Thl type, an environment
that promotes monocyte and T cell adhesion and recruitment. This system is relevant for sarcoidosis and
pulmonary responses to respiratory infections.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 51.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2891

BSK_CASM3C_RAI 1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: CASM3C Assay for Plasminogen activator inhibitor-1 (PAI1) Biomarker
Activity

1.2	Assay Summary: BSK_CASM3C is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium
and coronary artery smooth muscle cells, a human vascular primary cell co-culture, with measurements taken
at 24 hours after chemical dosing in a 96-well plate. BSK_CASM3C_PAI1 is an assay component measured in the
BSK_CASM3C assay. It measures PAI-1 antibody related to regulation of gene expression using ELISA technology.
Data from the assay component BSK_CASM3C_PAI1 was analyzed at the endpoint, BSK_CASM3C_PAI1, in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, gain or loss-of-signal activity can be used to understand protein changes. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the 'cytokine' intended target
family, where the subfamily is 'plasmogen activator inhibitor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Plasminogen activator inhibitor-1 (PAI-I) is a serine proteinase inhibitor and inhibitor of tissue
plasminogen activator (tPA) and urokinase (uPA) and is involved in tissue remodeling and fibrinolysis. PAI-I is
categorized as a tissue remodeling-related activity in the CASM3C system modeling Thl vascular smooth muscle
inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent coronary artery smooth muscle cells primary cell used. Primary human cell
types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha
and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System
(PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System
(bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human
dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha,
IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta,
TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.01
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Cardiovascular Disease (CASM3C) system models vascular inflammation of the Thl type, an
environment that promotes monocyte and T cell recruitment. This system is relevant for chronic inflammatory
diseases, vascular inflammation and restenosis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 22.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2893

BSK_hDFCGF_MCPl

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Monocyte Chemoattractant Protein-1 (MCP1) Biomarker
Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_MCPl is an assay component measured in the BSK_hDFCGF assay. It measures MCP-1 antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_MCPl was analyzed at the endpoint, BSK_hDFCGF_MCPl, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand protein changes. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily is
'chemotactic factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Monocyte chemoattractant protein-1 (MCP-1/CCL2) is a chemoattractant cytokine (chemokine)
that regulates the recruitment of monocytes and T cells into sites of inflammation. MCP-1 is categorized as an
inflammation-related activity in the HDF3CGF system modeling Thl inflammation involved in wound healing
and matrix remodeling of the skin.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.027
Response cutoff threshold used to determine hit calls: 0.08
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 47.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2895

BSK_hDFCGF_ICAMl

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Intercellular Adhesion Molecule 1 (ICAM1) Biomarker
Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_ICAMl is an assay component measured in the BSK_hDFCGF assay. It measures ICAM-1 antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_ICAMl was analyzed at the endpoint, BSK_hDFCGF_ICAMl, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended target family, where
the subfamily is 'Immunoglobulin CAM'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Intercellular Adhesion Molecule 1 (ICAM-1/CD54) is a cell adhesion molecule that mediates
leukocyte-endothelial cell adhesion and leukocyte recruitment. ICAM-1 is categorized as an inflammation-
related activity in the HDF3CGF system modeling Thl inflammation involved in wound healing and matrix
remodeling of the skin.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20

-------
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.017
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 58.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2897

BSK_hDFCGF_Collagenl

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Collagen I Biomarker Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_Collagenl is an assay component measured in the BSK_hDFCGF assay. It measures Collagen I
antibody related to regulation of gene protein expression using ELISA technology. Data from the assay
component BSK_hDFCGF_Collagenl was analyzed at the endpoint, BSK_hDFCGF_Collagenl, in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of binding
reporter, gain or loss-of-signal activity can be used to understand protein changes. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended
target family, where the subfamily is 'collagen'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Collagen I is involved in tissue remodeling and fibrosis, and is the most common fibrillar collagen
that is found in skin, bone, tendons and other connective tissues. Collagen I is categorized as a tissue

-------
remodeling-related activity in the HDF3CGF system modeling Thl inflammation involved in wound healing and
matrix remodeling of the skin.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.05
Response cutoff threshold used to determine hit calls: 0.151
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 45.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2899

BSKJlDFCGFJTAC

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for Interferon-inducible T-cell alpha chemoattractant (ITAC)
Biomarker Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_ITAC is an assay component measured in the BSK_hDFCGF assay. It measures l-TAC antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_ITAC was analyzed at the endpoint, BSK_hDFCGF_ITAC, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand protein changes. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily is
'chemotactic factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Interferon inducible T Cell Alpha Chemoattractant (l-TAC/CXCLll) is a chemokine that mediates T
cell and monocyte chemotaxis. I-TAC is categorized as an inflammation-related activity in the HDF3CGF system
modeling Thl inflammation involved in wound healing and matrix remodeling of the skin

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.028
Response cutoff threshold used to determine hit calls: 0.083
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 49.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2901

BSK_hDFCGF_TIMP2

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: hDFCGF Assay for TIMP metallopeptidase inhibitor 2 (TIMP2) Biomarker
Activity

1.2	Assay Summary: BSK_hDFCGF is a cell-based, multiplexed-readout assay that uses foreskin fibroblast, a human
skin primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate.
BSK_hDFCGF_TIMP2 is an assay component measured in the BSK_hDFCGF assay. It measures TIMP-2 antibody
related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_hDFCGF_TIMP2 was analyzed at the endpoint, BSK_hDFCGF_TIMP2, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand protein changes. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the 'protease' intended target family, where the subfamily is 'matrix
metalloproteinase'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: TIMP-2 is a tissue inhibitor of matrix metalloproteases and is involved in tissue remodeling,
angiogenesis and fibrosis. TIMP-2 is categorized as a tissue remodeling-related activity in the HDF3CGF system
modeling Thl inflammation involved in wound healing and matrix remodeling of the skin.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent foreskin fibroblast primary cell used. Primary human cell types used in BioMAP
systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H
System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR
ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-
lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells and human dermal
fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta, TNFalpha, IFNgamma,
EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-lbeta, TNFalpha and
IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and IFNgamma), MyoF
System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC and
macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.037
Response cutoff threshold used to determine hit calls: 0.111
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Wound Healing, Fibrosis and Inflammation (HDF3CGF) system models wound healing and
matrix/tissue remodeling in the context of Thl-type inflammation. This system is relevant for various diseases
including fibrosis, rheumatoid arthritis, psoriasis, as well as stromal biology in tumors.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 39.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2903

BSK_KF3CT_IL8

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: KF3CT Assay for lnterleukin-8 (IL8) Biomarker Activity

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CT_IL8 is an assay component measured in the BSK_KF3CT assay. It measures IL-8
antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_KF3CT_IL8 was analyzed at the endpoint, BSK_KF3CT_IL8, in the positive analysis fitting direction relative
to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal
activity can be used to understand protein changes. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily is 'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Interleukin 8 (IL-8/CXCL8) is a chemokine that mediates neutrophil recruitment into acute
inflammatory sites. IL-8 is categorized as an inflammation-related activity in the KF3CT system modeling Thl
cutaneous inflammation.

-------
The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.02
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495

Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
55

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 42.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2905

BSK_KF3CT_MIG

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: KF3CT Assay for Monokine induced gamma interferon (MIG) Biomarker
Activity

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CT_MIG is an assay component measured in the BSK_KF3CT assay. It measures MIG
antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_KF3CT_MIG was analyzed at the endpoint, BSK_KF3CT_MIG, in the positive analysis fitting direction relative
to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal
activity can be used to understand protein changes. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily is 'chemotactic
factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Monokine induced by gamma interferon (MIG/CXCL9) is a chemokine that mediates T cell
recruitment. MIG is categorized as an inflammation-related activity in the KF3CT system modeling Thl
cutaneous inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.012
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 33.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2907

BSK_KF3CT_PAI1

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: KF3CT Assay for Plasminogen activator inhibitor-1 (PAI1) Biomarker Activity

1.2	Assay Summary: BSK_KF3CT is a cell-based, multiplexed-readout assay that uses keratinocytes and foreskin
fibroblasts, a human skin primary cell co-culture, with measurements taken at 24 hours after chemical dosing
in a 96-well plate. BSK_KF3CT_PAI1 is an assay component measured in the BSK_KF3CT assay. It measures PAI-
1 antibody related to regulation of gene expression using ELISA technology. Data from the assay component
BSK_KF3CT_PAI1 was analyzed at the endpoint, BSK_KF3CT_PAI1, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-
of-signal activity can be used to understand protein changes. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily is
'plasmogen activator inhibitor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Plasminogen activator inhibitor-1 (PAI-I) is a serine proteinase inhibitor and inhibitor of tissue
plasminogen activator (tPA) and urokinase (uPA) and is involved in tissue remodeling and fibrinolysis. PAI-I is
categorized as a tissue remodeling-related activity in the KF3CT system modeling Thl cutaneous inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent keratinocytes and foreskin fibroblasts primary cell co-culture used. Primary
human cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.02
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Psoriasis and Dermatitis (KF3CT) system models cutaneous inflammation of the Thl type, an
environment that promotes monocyte and T cell adhesion and recruitment. This system is relevant for
cutaneous responses to tissue damage caused by mechanical, chemical, or infectious agents, as well as certain
states of psoriasis and dermatitis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 53.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2909

BSKJMphgJVICPl

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: IMphg Assay for Monocyte Chemoattractant Protein-1 (MCP1) Biomarker
Activity

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSK_IMphg_MCPl is an assay component measured in the BSKJMphg assay. It
measures MCP-1 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_IMphg_MCPl was analyzed at the endpoint, BSK_IMphg_MCPl, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'chemotactic factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

-------
2.1	Purpose: Monocyte chemoattractant protein-1 (MCP-1/CCL2) is a chemoattractant cytokine (chemokine)
that regulates the recruitment of monocytes and T cells into sites of inflammation. MCP-1 is categorized as an
inflammation-related activity in the IMphg system modeling macrophage-driven Thl vascular inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.023
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 31.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2911

BSKJMphgJVIIPla

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: IMphg Assay for Macrophage Inflammatory Protein-1 Alpha (MlPla)
Biomarker Activity

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSK_IMphg_MIPla is an assay component measured in the BSKJMphg assay. It
measures MlP-la antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_IMphg_MIPla was analyzed at the endpoint, BSK_IMphg_MIPla, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'inflammatory factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Macrophage inflammatory protein 1 alpha (MIP-la/CCL3) is a pro-inflammatory chemokine that
mediates leukocyte recruitment to sites of inflammation. MlP-la is categorized as an inflammation-related
activity in the IMphg system modeling macrophage-driven Thl vascular inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.031
Response cutoff threshold used to determine hit calls: 0.092
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 25.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2913

BSK_IMphg_VCAMl

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: IMphg Assay for Vascular cell adhesion molecule 1 (VCAM1) Biomarker
Activity

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSK_IMphg_VCAMl is an assay component measured in the BSKJMphg assay. It
measures VCAM-1 antibody related to regulation of gene expression using ELISA technology. Data from the
assay component BSK_IMphg_VCAMl was analyzed at the endpoint, BSK_IMphg_VCAMl, in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of binding
reporter, gain or loss-of-signal activity can be used to understand protein changes. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended
target family, where the subfamily is 'Immunoglobulin CAM'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Vascular Cell Adhesion Molecule 1 (VCAM-1/CD106) is a cell adhesion molecule that mediates
adhesion of monocytes and T cells to endothelial cells. VCAM-1 is categorized as an inflammation-related
activity in the IMphg system modeling macrophage-driven Thl vascular inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.018
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 36.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2915

BSK_IMphg_CD40

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: IMphg Assay for CD40 Biomarker Activity

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSK_IMphg_CD40 is an assay component measured in the BSKJMphg assay. It
measures CD40 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_IMphg_CD40 was analyzed at the endpoint, BSK_IMphg_CD40, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'inflammatory factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: CD40 is a cell surface adhesion receptor and costimulatory receptor for T cell activation that is
expressed on antigen presenting cells, endothelial cells, smooth muscle cells, fibroblasts and epithelial cells.

-------
CD40 is categorized as an immunomodulatory-related activity in the IMphg system modeling macrophage-
driven Thl vascular inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.025
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 26.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2917

BSKJ M phg^ESelecti n

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: IMphg Assay for Eselectin Biomarker Activity

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSK_IMphg_ESelectin is an assay component measured in the BSKJMphg assay. It
measures E-selectin antibody related to regulation of gene expression using ELISA technology. Data from the
assay component BSK_IMphg_ESelectin was analyzed at the endpoint, BSK_IMphg_ESelectin, in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of binding
reporter, gain or loss-of-signal activity can be used to understand protein changes. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'cell adhesion molecules' intended
target family, where the subfamily is 'selectins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: E-Selectin/CD62E is a cell adhesion molecule expressed only on endothelial cells that mediates
leukocyte-endothelial cell interactions. E-Selectin is categorized as an inflammation-related activity in the
IMphg system modeling macrophage-driven Thl vascular inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.028
Response cutoff threshold used to determine hit calls: 0.084
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell adhesion molecules.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 27.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2919

BSK_IMphg_CD69

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: IMphg Assay for CD69 Biomarker Activity

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSK_IMphg_CD69 is an assay component measured in the BSKJMphg assay. It
measures CD69 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_IMphg_CD69 was analyzed at the endpoint, BSK_IMphg_CD69, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'inflammatory factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: CD69 is a cell surface activation antigen that is induced early during immune activation and is
involved in macrophage activation. CD69 is categorized as an immunomodulatory-related activity in the IMphg
system modeling macrophage-driven Thl vascular inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.04
Response cutoff threshold used to determine hit calls: 0.119
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 23.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2921

BSK_IMphgJL8

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: IMphg Assay for lnterleukin-8 (IL8) Biomarker Activity

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSK_IMphg_IL8 is an assay component measured in the BSKJMphg assay. It
measures IL-8 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_IMphg_IL8 was analyzed at the endpoint, BSK_IMphg_IL8, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Interleukin 8 (IL-8/CXCL8) is a chemokine that mediates neutrophil recruitment into acute
inflammatory sites. IL-8 is categorized as an inflammation-related activity in the IMphg system modeling
macrophage-driven Thl vascular inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.04
Response cutoff threshold used to determine hit calls: 0.12
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 347

Active hit count: hitc>0.9
74

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 29.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2923

BSKJMphgJLla

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: IMphg Assay for lnterleukin-1 alpha (ILla) Biomarker Activity

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSKJMphgJLla is an assay component measured in the BSKJMphg assay. It
measures IL-la antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSKJMphgJLla was analyzed at the endpoint, BSKJMphgJLla, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Interleukin lalpha (IL-la) is a secreted proinflammatory cytokine involved in endothelial cell
activation and neutrophil recruitment. Secreted IL-la (sIL-la) is categorized as an inflammation-related activity
in the IMphg system modeling macrophage-driven Thl vascular inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.039
Response cutoff threshold used to determine hit calls: 0.117
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 347

Active hit count: hitc>0.9
31

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 21.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2925

BSK_IMphg_MCSF

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: IMphg Assay for Macrophage colony-stimulating factor (MCSF) Biomarker
Activity

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSK_IMphg_MCSF is an assay component measured in the BSKJMphg assay. It
measures M-CSF antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSK_IMphg_MCSF was analyzed at the endpoint, BSK_IMphg_MCSF, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'colony stimulating factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

-------
2.1	Purpose: Macrophage colony-stimulating factor (M-CSF) is a secreted and cell surface cytokine that
mediates macrophage differentiation. M-CSF is categorized as an immunomodulatory-related activity in the
IMphg system modeling macrophage-driven Thl vascular inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.024
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 347

Active hit count: hitc>0.9
24

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 19.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2927

BSKJMphgJLlO

1. General Information

1.1	Assay Title: BioMAP Diversity Plus: IMphg Assay for lnterleukin-10 (IL10) Biomarker Activity

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSKJMphgJLlO is an assay component measured in the BSKJMphg assay. It
measures IL-10 antibody related to regulation of gene expression using ELISA technology. Data from the assay
component BSKJMphgJLlO was analyzed at the endpoint, BSKJMphgJLlO, in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of binding reporter, gain
or loss-of-signal activity can be used to understand protein changes. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where the subfamily
is 'interleukins'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Interleukin 10 (IL-10) is a secreted anti-inflammatory cytokine. Secreted IL-10 (slL-10) is
categorized as an immunomodulatory-related activity in the IMphg system modeling macrophage-driven Thl
vascular inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.059
Response cutoff threshold used to determine hit calls: 0.178
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Immunosuppression: Assays associated with markers of immunosuppression, see 10.14573/altex.2203041

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 29.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2929

BSK_IMphg_SRB

1. General Information

1.1	Assay Title: Viability Assessment in the BioMAP Diversity Plus: IMphg Assay

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSK_IMphg_SRB is an assay component measured in the BSKJMphg assay. It
measures 0.1% sulforhodamine related to cell death using Sulforhodamine staining technology. Data from the
assay component BSK_IMphg_SRB was analyzed at the endpoint, BSK_IMphg_SRB, in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of viability
reporter, gain or loss-of-signal activity can be used to understand protein changes. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'cell morphology1 intended target
family, where the subfamily is 'cell conformation'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: SRB in the IMphg system is a measure of the total protein content of venular endothelial cells and
macrophages. Cell viability of adherent cells is measured by Sulforhodamine B (SRB) staining, a method that
determines cell density by measuring total protein content of test wells.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.011
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 347

Active hit count: hitc>0.9
41

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 18.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2931

BSKJMphg^SRB.Mphg

1. General Information

1.1	Assay Title: Macrophage Viability Assessment in the BioMAP Diversity Plus: IMphg Assay

1.2	Assay Summary: BSKJMphg is a cell-based, multiplexed-readout assay that uses venular endothelial cells and
macrophages, a human vascular primary cell, with measurements taken at 24 hours after chemical dosing in a
microplate: 96-well plate. BSK_IMphg_SRB.Mphg is an assay component measured in the BSKJMphg assay. It
measures 0.1% sulforhodamine related to cell death using Sulforhodamine staining technology. Data from the
assay component BSK_IMphg_SRB.Mphg was analyzed at the endpoint, BSK_IMphg_SRB.Mphg, in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of
viability reporter, gain or loss-of-signal activity can be used to understand protein changes. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the 'cell morphology1 intended
target family, where the subfamily is 'cell conformation'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: SRB-Mphg in the IMphg system is a measure of the total protein content of macrophages alone.
Cell viability of adherent cells is measured by Sulforhodamine B (SRB) staining, a method that determines cell
density by measuring total protein content of test wells.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent venular endothelial cells and macrophages primary cell used. Primary human
cell types used in BioMAP systems and their stimuli included the following: 3C System (HUVEC/IL-lbeta,
TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System (PBMC and HUVEC/LPS), SAg
System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-IgM + TCR ligands), BE3C
System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System (bronchial epithelial cells
and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal fibroblasts/IL-lbeta,
TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and dermal fibroblasts/IL-
lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle cells/IL-lbeta, TNFalpha and
IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and TGFbeta), Mphg System (HUVEC
and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

2.2 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

60 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.023
Response cutoff threshold used to determine hit calls: 0.079
Detection technology used: Sulforhodamine staining (Spectrophotometry)

2.6	Response: The Macrophage Activation (IMphg) system models chronic inflammation of the Thl type and
macrophage activation responses. This system is relevant to inflammatory conditions where monocytes play a
key role including atherosclerosis, restenosis, rheumatoid arthritis, and other chronic inflammatory conditions.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell morphology.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 164

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 21.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Berg EL. Human Cell-Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front
Big Data. 2019 Dec 11;2:47. doi: 10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Kunkel EJ,
Plavec I, Nguyen D, Melrose J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM,
Butcher EC, Berg EL. Rapid structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in
complex human primary cell-based models. Assay Drug Dev Technol. 2004 Aug;2(4):431-41. PubMed PMID:
15357924.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2933

BSK_LPS_Th rombomod ulin

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for Thrombomodulin Biomarker Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_Thrombomodulin is an assay component measured in
the BSK_LPS assay. It measures Thrombomodulin antibody related to regulation of gene expression using ELISA
technology. Data from the assay component BSK_LPS_Thrombodulin was analyzed at the endpoint,
BSK_LPS_Thrombomodulin, in the positive analysis fitting direction relative to DMSO as the negative control
and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand
protein changes. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the 'gpcr' intended target family, where the subfamily is 'rhodopsin-like receptor1.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Thrombomodulin/CD141 is a cell surface receptor for complement factor 3b with anti-coagulant,
anti-inflammatory and cytoprotective activities during the process of fibrinolysis, coagulation and thrombosis.

-------
Thrombomodulin is categorized as a hemostasis-related activity in the LPS system modeling monocyte-driven
Thl vascular inflammation.

The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate

-------
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.033
Response cutoff threshold used to determine hit calls: 0.1
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of gpcr.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495	Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 34.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2935

BSK_LPS_CD69

1.	General Information

1.1	Assay Title: BioMAP Diversity Plus: LPS Assay for CD69 Biomarker Activity

1.2	Assay Summary: BSK_LPS is a cell-based, multiplexed-readout assay that uses umbilical vein endothelium and
peripheral blood mononuclear cells, a human vascular primary cell co-culture, with measurements taken at 24
hours after chemical dosing in a 96-well plate. BSK_LPS_CD69 is an assay component measured in the BSK_LPS
assay. It measures CD69 antibody related to regulation of gene expression using ELISA technology. Data from
the assay component BSK_LPS_CD69 was analyzed at the endpoint, BSK_LPS_CD69, in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of binding
reporter, gain or loss-of-signal activity can be used to understand protein changes. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'cytokine' intended target family, where
the subfamily is 'inflammatory factor'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Bioseek is a division of DiscoveRx Corporation and developed the BioMAP system providing
uniquely informative biological activity profiles in complex human primary cell systems.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The BioMAP platform uses both proprietary primary human cell-based assay design and
integrated bioinformatics analysis platform to generate signature profiles.

1.9	Assay Throughput: 96-well plate. Assays were initiated by addition of chemical samples for 1 h followed by
addition of appropriate stimuli. Assay plates were then incubated for 24 h unless otherwise indicated. The MyoF
system was stimulated for 48 h, and the BT system was stimulated for either 72 h (soluble readouts) or 6 d (for
measurement of secreted IgG). Concentrations of stimuli were as follows: cytokines (IL-lbeta, 1 ng/mL,
Peprotech 200-01B; TNFalpha, 5 ng/mL, Peprotech 300-01A; IFNgamma, 20 ng/mL, Peprotech 300-02; IL-4, 5
ng/mL, 200-04), activators (histamine, 10 uM, Sigma H7125; SAg, 20 ng/mL or LPS, 2 ng/mL, Sigma L7770),
growth factors (TGF-beta, 5 ng/mL, R&D Systems 240-B/CF; EGF, Peprotech AF-100-15; basic-FGF,
ThermoScientific 13256029; PDGF-BB, 10 ng/mL, Peprotech 100-14B; Zymosan, 10 ug/mL, Invivogen tlrl-zyn;
Anti-IgM, 500 ng/mL). Superantigens (SAg), staphylococcal enterotoxin B (SEB) and toxic shock syndrome toxin-
1 (TSST-1) (staphylococcal enterotoxin F) from Staphylococcus aureus, and lipopolysaccharide (LPS)
from Salmonella enteritidis were obtained from Sigma. The number of lymphocytes or macrophages added to
the SAg, LPS, BT and Mphg systems were as follows for 96-well format: B cells (2.5 x 104), PBMC (7.5 x
104 cells/well for LPS and SAg systems or 2.5 x 104 cells/ well for BT system) or macrophages (7.5 x
104 cells/well). After stimulation, plates and supernatants were harvested and biomarkers quantitated by ELISA
and other methods.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: CD69 is a cell surface activation antigen. CD69 is categorized as an immunomodulatory-related
activity in the LPS system modeling monocyte-driven Thl vascular inflammation.

-------
The BioMAP Diversity Plus panel includes 12 assays encompassing 148 endpoints particularly enriched with
capabilities to detect modulators and effectors of vascular and immune biology. This panel has been extensively
used in pharmaceutical and consumer products research for characterization of product candidates. Platform
includes the T cell activation system (SAg) measuring multiple endpoints modulated by a cocktail of
superantigens, and the B and T cell autoimmunity assay (BT) for T cell-dependent B cell activation and antibody
production as key modulators of the innate and adaptive immune response, respectively. Additional assays
include models of vascular inflammation, monocyte activation, lung inflammation and fibrosis, cardiovascular
inflammation, and wound healing.

2.2	Scientific Principles: The BioMAP Diversity Plus panel can be used to model complex tissue and disease biology
of organs (vasculature, immune system, skin, lung) and general tissue biology. Use of the BioMAP panel of
human primary cell systems as (patho)physiologically relevant screening assays for evaluating adverse effects
has previously been demonstrated through testing pharmaceuticals and clinical candidates as well as
environmental chemicals in the EPA's ToxCast program. Beyond immunosuppression, significant bioactivity in
human primary cells that correlated with mechanisms of action that may indicate potential for adverse effects
in vivo. Future studies measuring the effects of environmental chemicals associated with immunotoxicity in the
BioMAP co-culture systems, along with other human cell-based models of immune related effects including
inflammation, may be useful for better defining the bioactivity profiles of non-pharmaceutical immunotoxic
compounds and understanding mechanisms of putative immune related effects in human populations.

2.3	Experimental System: adherent umbilical vein endothelium and peripheral blood mononuclear cells primary
cell co-culture used. Primary human cell types used in BioMAP systems and their stimuli included the following:
3C System (HUVEC/IL-lbeta, TNFalpha and IFNgamma), 4H System (HUVEC/IL-4 and histamine), LPS System
(PBMC and HUVEC/LPS), SAg System (PBMC and HUVEC/TCR ligands), BT System (CD19+B cells and PBMC/anti-
IgM + TCR ligands), BE3C System (bronchial epithelial cells/IL-lbeta, TNFalpha and IFNgamma), BF4T System
(bronchial epithelial cells and human dermal fibroblasts/TNFalpha and IL-4), HDF3CGF System (human dermal
fibroblasts/IL-lbeta, TNFalpha, IFNgamma, EGF, basic-FGF and PDGF-BB), KF3CT System (keratinocytes and
dermal fibroblasts/IL-lbeta, TNFalpha and IFNgamma), CASM3C System (coronary artery smooth muscle
cells/IL-lbeta, TNFalpha and IFNgamma), MyoF System (differentiated lung myofibroblasts/TNFalpha and
TGFbeta), Mphg System (HUVEC and macrophages/TLR2 ligands)

2.4	Metabolic Competence: Primary cell types included in the BioMAP platform retain regulatory processes of their
in vivo counterparts. All primary human cells utilized were obtained via commercially available sources and were
used at early passage (< P4) or without passaging (in the case of PBMC and B cells) to minimize adaptation to
cell culture and preserve physiological signaling responses. Xenobiotic biotransformation potential has not been
characterized.

2.5	Exposure Regime: The levels of cell surface (or secreted, indicated by the prefix "s") biomarker endpoints were
measured by ELISA. Overt cytotoxicity to cells in confluent adherent cultures (all systems other than the BT
system) was assessed by measuring total protein levels using sulforhodamine B (SRB) staining in parallel cultures
at the time of biomarker measurements, indicated as SRB endpoints. For proliferation assays for adherent cell
types, individual cell types are cultured at sub-confluence and relative cell numbers quantified by SRB staining
at time points optimized for each system (48 h: 3C and CASM3C systems; 72 h: BT and HDF3CGF systems; 96 h:
SAg system). SRB was performed by staining cells with 0.1% SRB after fixation with 10% TCA and reading wells
at 560 nm. Viability and proliferation of PBMC (T cells) was quantified by Alamar Blue reduction for the SAg and
BT systems. For PBMC viability (referred to as PBMC Cytotoxicity within the assay endpoint names), cells were
plated (75,000/well in a 96-well plate) and then chemical samples added for 1 h before addition of activators,
SEB and TSST-1 (20 ng/mL final concentration each). Cells were incubated for 90 h. Then, Alamar Blue (20
uL/well) (Invitrogen, Cat DAL1100) was added for 6 h, and the plates were read with a fluorescence microplate
reader at 546/580 nm (excitation/emission). For PBMC proliferation, cells were plated and activated as above
but incubated for only 16 h prior to addition of Alamar Blue. Plates were read after 6 h.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
4

Standard minimum concentration tested:

5.62 nM
Key positive control:
colchicine

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

151 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.034
Response cutoff threshold used to determine hit calls: 0.101
Detection technology used: ELISA (Fluorescence)

2.6	Response: The Monocyte Activation (LPS) system models chronic monocyte activation and vascular
inflammation. This system is relevant to inflammatory conditions where monocytes play a key role including
atherosclerosis, restenosis, rheumatoid arthritis, other chronic inflammatory conditions and metabolic diseases.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cytokine.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Measurement values for each well (one biomarker per well) were divided by the mean value
from 8 DMSO control samples (from the same plate) to generate a ratio. All ratios were then loglO transformed.
Historical controls are the loglO-ratios of DMSO control wells that are collected over time (23 experimental runs
collected over 2 years). The transformed ratios for the 12-assay BioMAP panel were received by the EPA and
loaded into the ToxCast database, invitrodb under the BioSeek assay source identifier, abbreviated as BSK. BSK
was used for continuity in invitrodb and in public versions of ToxCast data despite more recent changes in the
name and ownership of the assay technology (now owned by Eurofins Discovery and referred to as BioMAP
systems).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

NA

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 4: Iogl0_1.2 (Add a cutoff value of Iogl0(1.2). Typically for fold change data.), 15: loec.coff
(Method not yet updated for tcpl implementation: Identify the lowest observed effective concentration
(loec) compared to baseline.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 495

Number of chemicals tested: 238

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
84

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 42.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Houck KA, Dix DJ, Judson RS, Kavlock RJ, Yang J, Berg EL. Profiling bioactivity of the ToxCast
chemical library using BioMAP primary human cell systems. J Biomol Screen. 2009 Oct;14(9):1054-66. doi:
10.1177/1087057109345525. Epub 2009 Sep 22. PubMed PMID: 19773588., Kleinstreuer NC, Yang J, Berg EL,
Knudsen TB, Richard AM, Martin MT, Reif DM, Judson RS, Polokoff M, Dix DJ, Kavlock RJ, Houck KA. Phenotypic
screening of the ToxCast chemical library to classify toxic and therapeutic mechanisms. Nat Biotechnol. 2014
Jun;32(6):583-91. doi: 10.1038/nbt.2914. Epub 2014 May 18. PubMed PMID: 24837663., Kunkel EJ, Dea M,
Ebens A, Hytopoulos E, Melrose J, Nguyen D, Ota KS, Plavec I, Wang Y, Watson SR, Butcher EC, Berg EL. An
integrative biology approach for analysis of drug action in models of human vascular inflammation. FASEB J.
2004 Aug; 18(11):1279-81. Epub 2004 Jun 18. PubMed PMID: 15208272., Kunkel EJ, Plavec I, Nguyen D, Melrose
J, Rosier ES, Kao LT, Wang Y, Hytopoulos E, Bishop AC, Bateman R, Shokat KM, Butcher EC, Berg EL. Rapid
structure-activity and selectivity analysis of kinase inhibitors by BioMAP analysis in complex human primary cell-
based models. Assay Drug DevTechnol. 2004 Aug;2(4):431-41. PubMed PMID: 15357924., Berg EL. Human Cell-
Based in vitro Phenotypic Profiling for Drug Safety-Related Attrition. Front Big Data. 2019 Dec 11;2:47. doi:
10.3389/fdata.2019.00047. PMID: 33693370; PMCID: PMC7931891., Wetmore BA, Clewell RA, Cholewa B, Parks
B, Pendse SN, Black MB, Mansouri K, Haider S, Berg EL, Judson RS, Houck KA, Martin M, Clewell HJ 3rd, Andersen
ME, Thomas RS, McMullen PD. Assessing bioactivity-exposure profiles of fruit and vegetable extracts in the
BioMAP profiling system. Toxicol In Vitro. 2019 Feb;54:41-57. doi: 10.1016/j.tiv.2018.09.006. Epub 2018 Sep 12.
PMID: 30218698; PMCID: PMC6635950.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2484

CCTE_Deisen roth_AI M E_96WELL_LUC_Active

1. General Information

1.1	Assay Title: CCTE's 96-well Estrogen Receptor Transactivation Assay with AIME metabolic competence (ERTA
luciferase; metabolism positive), Deisenroth Lab

1.2	Assay Summary: CCTE Deisenroth AIME 96WELL(AIME metabolism assay) is a cell-based, single-readout assay
that uses VM7Luc4E2, a human mammary gland/breast cell line, with measurements taken at 24 hour after
chemical dosing in a 96-well plate. CCTE_Deisenroth_AIME_96WELL_LUC_Active is one of four assay
components measured or calculated from the CCTE_Deisenroth_AIME_96WELL assay. It is designed to make
measurements of luciferase induction, a form of inducible reporter, as detected with bioluminescence signals
by luciferase-coupled ATP quantitation technology. CCTE_Deisenroth_AIME_96WELL_LUC_Active is a
component of the CCTE_Deisenroth_AIME_96WELL assay that measures the effect of retroftting the VM7Luc
ERTA assay with the Alginate Immobolization of Metabolic Enzymes (AIME) metabolism method. AIME is
optimized to impart phase I hepatic metabolism using phenobarbital/beta-naphthoflavone induced hepatic S9
from male Sprague Dawley rats. Parent chemicals are incubated for 2 hours in the AIME assay prior to transfer
to the VM7Luc ERTA assay plate. The AIME metabolism assay is run simultaneously in negative (inactive) and
positive (active) modes to evaluate the activity of parent and metabolites, respectively. This endpoint evaluates
reporter induction in metabolism positive mode. Using a type of inducible reporter in VM7Luc cells, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to xenobiotic metabolism
and the gene ESR1. This assay endpoint, CCTE_Deisenroth_AIME_96WELL_LUC_Active, was analyzed in the
positive analysis fitting direction relative to 17beta-estradiol as the positive control in the ERTA assay. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The AIME method has been successfully applied to the VM7Luc estrogen receptor
transactivation assay (ERTA) to evaluate metabolism-dependent false positive and false negative target assay
effects for a set of ER active and inactive chemicals (Deisenroth 2020, Hopperstad 2022). It has also been
employed to screen a set of 768 chemicals derived from theToxCast library to provide a model for incorporating
hepatic metabolism into existing test guideline studies including experimental design components, reference
chemical consideration, assay performance criteria, and data analysis workflows.

1.9	Assay Throughput: 96-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: CCTE_Deisenroth_AIME_96WELL_LUC_Active was designed to measure changes to
bioluminescence signals produced from an enzymatic reaction involving the key substrate [Bright-Glo], Changes
are indicative of changes in transcriptional gene expression due to agonist activity regulated by the human
estrogen receptor 1 [GeneSymbokESRl | GenelD:2099 | Uniprot_SwissProt_Accession:P03372].

The Alginate Immobilization of Metabolic Enzymes (AIME) is a lid-based technique for retrofitting existing
bioassays with hepatic metabolic competence (Deisenroth 2020, Hopperstad 2022). The platform consists of
custom microplate peg lids with encapsulated hepatic S9-alginate microspheres attached to solid supports that
metabolize test compounds for 96- and 384-well based assays.

2.2	Scientific Principles: A potential limitation to the use of in vitro assay data in regulatory decision-making is the
lack of coverage for xenobiotic metabolic processes occurring within an organism. Indeed, hepatic- and
peripheral-tissue metabolism can yield metabolites that exhibit greater activity than the parent compound
(bioactivation) or are inactive (bioinactivation) for a given biological target. Interpretation of biological effect
data for chemicals screened in high-throughput screening (HTS) assays may benefit from incorporation of
xenobiotic metabolic competence to decrease uncertainty associated with predicting potential human health
hazards from in vitro assay data (Jacobs 2008, Jacobs 2013, OECD 2014). The Alginate Immobilization of
Metabolic Enzymes (AIME) platform consists of custom 96- or 384-well microplate lids containing solid supports
attached to encapsulated hepatic S9-alginate microspheres. The optimized platform uses rat or human hepatic
S9 to evaluate phase I xenobiotic metabolism while minimizing cytotoxic and assay interference effects.
Deployment to an estrogen receptor transactivation (ERTA) assay with a test set of compounds previously
identified by estrogen receptor quantitative structure-activity relationship (ER-QSAR) analysis to generate ER-
active metabolites6 demonstrated utility for identification of false positive and false negative target assay
effects, reprioritization of hazard based on metabolism-dependent bioactivity, and enhanced in vivo
concordance with the rodent uterotrophic bioassay.

2.3	Experimental System: adherent VM7Luc4E2 cell line used. The AIME platform consists of a custom polystyrene
lids with 96- or 384- solid support pegs designed to be compatible with all ANSI/SLAS standard microplates. Each
peg of the AIME lid contains a single microsphere of alginate hydrogel mixed with hepatic S9 fraction
(metabolism positive mode) or alginate hydrogel mixed with water (metabolism negative) and reach a standard
depth that provides sufficient distance to the well bottom. The method is compatible with biochemical and cell-
based target assays.

2.4	Metabolic Competence: Initial characterization of the AIME microsphere function focused on two key areas: 1)
the ability of small molecules to freely enter the microsphere and undergo metabolism by the cytochrome P450
enzymes, and 2) the duration of cytochrome P450 enzyme activity of the S9. To evaluate these features, a panel
of five reference substrates (phenacetin, dextromethorphan, bupropion, diclofenac, and chlorzoxazone), known
to be metabolized by human cytochrome P450 enzymes to specific and identifiable metabolites (Bjornsson
2003), were incubated with the AIME lids for 0, 2, 4, and 8 hours and analyzed by LC-MS/MS. The results
demonstrate that S9 functioned well within the alginate microsphere and formed detectable metabolites from
the probe substrates.

2.5	Exposure Regime: The AIME procedure consists of a preparative "pre-stamping" step using a barium
chloride/poly-L-lysine (PLL) solution followed by "stamping" the lid in alginate-hepatic S9 (metabolism positive)
or alginate-water (metabolism negative) to form microspheres. Once stamping is completed, the AIME lids are
placed in duplicate microplates (metabolism positive or negative) containing chemical solutions and incubated
for two hours to facilitate metabolic reactions. The conditioned medium, containing parent chemical or
metabolites, is then transferred to the cell-based VM7Luc Estrogen Receptor Transactivation Assay (ERTA) for
24 hours. A typical experimental design tests a 10-point concentration series (0.002 - 200 uM) for 8 compounds
(96-well format)l or 32 compounds (384-well format)2 in single technical replicate. AIME assay controls include
trans-stilbene (bioactivated control) and ethylparaben (bioinactivated control). ERTA assay plate controls
include 17-beta estradiol (positive control) and DMSO (solvent control). Experiments typically comprise a
minimum of four experimental replicates (n=4).

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.002 nM
Key positive control:

17beta-estradiol

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.322
Response cutoff threshold used to determine hit calls: 3.224
Detection technology used: multimode microplate reader (Luminescence)

2.6	Response: The ERTA assay evaluates binding and transactivation of an estrogen receptor (ER)-responsive
luciferase reporter to determine endogenous ER activity. AIME assay controls were selected for compatibility
with the ERTA assay and are intended to demonstrate metabolic-dependent shifts in estrogenic bioactivity. A
fluorescence-based cell viability reagent is used to measure treatment dependent cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Well-level data obtained as raw luminescence units for the ERTA luciferase assay are normalized
as percent activity of positive control. Well-level data obtained as raw fluorescence intensity units for the
cytotoxicity assay are normalized as log2 fold change to the DMSO control. Concentration-response modeling
of the 'percent activity' and 'log2 fold change' endpoints is used to derive quantitative potency and efficacy
values. Metabolism-dependent 'shifts' in bioactivity are evaluated by modeling the difference in concentration-
dependent effects.

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise maximum,
by assay plate ID (apid), of the medians of the corrected values (cval) forgain-of-signal single- or multiple-
concentration negative control wells (wilt = m or o) by apid, well type, and concentration.), 11:
bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 43: resp.pc.pval.cor (Calculate
the normalized response (resp) as a percent of control, i.e. the ratio of the difference between the
corrected (cval) and baseline (bval) values divided the positive control (pval) value multiplied by 100;
resp = (cval-bval)/pval*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

7: bmadlO (Add a cutoff value of 10 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical

-------
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 73

Number of chemicals tested: 73

Active hit count: hitc>0.9
39

ACTIVITY HIT CALLS

Inactive hit count: Oihitc 0.9
30

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	19

gain-loss (gnls) model:	21

power(pow) model:	2

linear-polynomial (polyl) model:	3

quadratic-polynomial(poly2) model:	9

exponential-2 (exp2) model:	0

exponential-3 (exp3) model:	0

exponential-4 (exp4) model:	8

exponential-5 (exp5) model:	10

NA hit count: hitc^O
4

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.

-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	36.111

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

927
85.25
12.19%

31225
789.484

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 10.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Deisenroth C, DeGroot DE, Zurlinden T, Eicher A, McCord J, Lee MY, Carmichael P, Thomas RS. The
Alginate Immobilization of Metabolic Enzymes Platform Retrofits an Estrogen Receptor Transactivation Assay
With Metabolic Competence. Toxicol Sci. 2020 Dec 1;178(2):281-301. doi: 10.1093/toxsci/kfaal47. PMID:
32991717; PMCID: PMC8154005., Hopperstad, K., DeGroot, D. E., Zurlinden, T., Brinkman, C., Thomas, R. S., &
Deisenroth, C. (2022). Chemical Screening in an Estrogen Receptor Transactivation Assay With Metabolic
Competence. Toxicological sciences : an official journal of the Society of Toxicology, 187(1), 112-126.
https://doi.org/10.1093/toxsci/kfac019, Jacobs, M. N., Janssens, W., Bernauer, U., Brandon, E., Coecke, S.,
Combes, R., Edwards, P., Freidig, A., Freyberger, A., Kolanczyk, R., Mc Ardle, C., Mekenyan, O., Schmieder, P.,
Schrader, T., Takeyoshi, M., & van der Burg, B. (2008). The use of metabolising systems for in vitro testing of
endocrine disruptors. Current drug metabolism, 9(8), 796-826. https://doi.org/10.2174/138920008786049294,
Jacobs, M. N., Laws, S. C., Willett, K., Schmieder, P., Odum, J., & Bovee, T. F. (2013). In vitro metabolism and
bioavailability tests for endocrine active substances: what is needed next for regulatory purposes?. ALTEX, 30(3),
331-351. https://doi.Org/10.14573/altex.2013.3.331, OECD (2014), Detailed Review Paper on the Use of

-------
Metabolising Systems for In Vitro Testing of Endocrine Disruptors, OECD Series on Testing and Assessment, No.
97, OECD Publishing, Paris, https://doi.org/10.1787/9789264085497-en., Pinto, C. L., Mansouri, K., Judson, R.,
& Browne, P. (2016). Prediction of Estrogenic Bioactivity of Environmental Chemical Metabolites. Chemical
research in toxicology, 29(9), 1410-1427. https://doi.org/10.1021/acs.chemrestox.6b00079, Bjornsson, T. D.,
Callaghan, J. T., Einolf, H. J., Fischer, V., Gan, L., Grimm, S., Kao, J., King, S. P., Miwa, G., Ni, L, Kumar, G., McLeod,
J., Obach, R. S., Roberts, S., Roe, A., Shah, A., Snikeris, F., Sullivan, J. T., Tweedie, D., Vega, J. M.,... FDA Center
for Drug Evaluation and Research (CDER) (2003). The conduct of in vitro and in vivo drug-drug interaction
studies: a Pharmaceutical Research and Manufacturers of America (PhRMA) perspective. Drug metabolism and
disposition: the biological fate of chemicals, 31(7), 815-832. https://doi.Org/10.1124/dmd.31.7.815

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2485

CCTE_Deisen roth_AI M E_96WELL_CTox_Active

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment (CTOX) in CCTE's 96-well Estrogen Receptor Transactivation Assay with
AIME metabolic competence (metabolism positive), Deisenroth Lab

1.2	Assay Summary: CCTE Deisenroth AIME 96WELL(AIME metabolism assay) is a cell-based, single-readout assay
that uses VM7Luc4E2, a human mammary gland/breast cell line, with measurements taken at 24 hour after
chemical dosing in a 96-well plate. CCTE_Deisenroth_AIME_96WELL_CTox_Active is one of four assay
components measured or calculated from the CCTE_Deisenroth_AIME_96WELL assay. It is designed to make
measurements of a protease substrate, a type of viabiliy indicator, as detected with substrate cleavage to a
fluorescent probe in living cells. CCTE_Deisenroth_AIME_96WELL_CTox_Active is a component of the
CCTE_Deisenroth_AIME_96WELL assay that measures the effect of retroftting the VM7Luc ERTA assay with the
Alginate Immobolization of Metabolic Enzymes (AIME) metabolism method. AIME is optimized to impart phase
I hepatic metabolism using phenobarbital/beta-naphthoflavone induced hepatic S9 from male Sprague Dawley
rats. Parent chemicals are incubated for 2 hours in the AIME assay prior to transfer to the VM7Luc ERTA assay
plate. The AIME metabolism assay is run simultaneously in negative (inactive) and positive (active) modes to
evaluate the activity of parent and metabolites, respectively. This endpoint evaluates cytotoxicity in metabolism
positive mode. Using a type of flourescent probe in VM7Luc cells, loss-of-signal activity can be used to
understand loss of viability. This assay endpoint, CCTE_Deisenroth_AIME_96WELL_CTox_Active, was analyzed
with bidirectional fitting in the ERTA assay.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The AIME method has been successfully applied to the VM7Luc estrogen receptor
transactivation assay (ERTA) to evaluate metabolism-dependent false positive and false negative target assay
effects for a set of ER active and inactive chemicals (Deisenroth 2020, Hopperstad 2022). It has also been
employed to screen a set of 768 chemicals derived from theToxCast library to provide a model for incorporating
hepatic metabolism into existing test guideline studies including experimental design components, reference
chemical consideration, assay performance criteria, and data analysis workflows.

1.9	Assay Throughput: 96-well plate.

l.lOStatus: The assay is fully developed,

l.llAbbreviations:

AIC: Akaike Information Criterion
AOP: Adverse Outcome Pathway
CV: Coefficient of Variation
DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: CCTE_Deisenroth_AIME_96WELL_CTox_Active is designed to make measurements of a protease
substrate [Gly-Phe-AFCoumarin], a type of viabiliy indicator. The non-flourescent substrate is enzymatically
converted to a fluorescent probe in living cells. Changes in the signal are indicative of cytotoxicity.

and data are publicly available in ToxCast s invitroDB.

ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

-------
The Alginate Immobilization of Metabolic Enzymes (AIME) is a lid-based technique for retrofitting existing
bioassays with hepatic metabolic competence (Deisenroth 2020, Hopperstad 2022). The platform consists of
custom microplate peg lids with encapsulated hepatic S9-alginate microspheres attached to solid supports that
metabolize test compounds for 96- and 384-well based assays.

2.2	Scientific Principles: A potential limitation to the use of in vitro assay data in regulatory decision-making is the
lack of coverage for xenobiotic metabolic processes occurring within an organism. Indeed, hepatic- and
peripheral-tissue metabolism can yield metabolites that exhibit greater activity than the parent compound
(bioactivation) or are inactive (bioinactivation) for a given biological target. Interpretation of biological effect
data for chemicals screened in high-throughput screening (HTS) assays may benefit from incorporation of
xenobiotic metabolic competence to decrease uncertainty associated with predicting potential human health
hazards from in vitro assay data (Jacobs 2008, Jacobs 2013, OECD 2014). The Alginate Immobilization of
Metabolic Enzymes (AIME) platform consists of custom 96- or 384-well microplate lids containing solid supports
attached to encapsulated hepatic S9-alginate microspheres. The optimized platform uses rat or human hepatic
S9 to evaluate phase I xenobiotic metabolism while minimizing cytotoxic and assay interference effects.
Deployment to an estrogen receptor transactivation (ERTA) assay with a test set of compounds previously
identified by estrogen receptor quantitative structure-activity relationship (ER-QSAR) analysis to generate ER-
active metabolites6 demonstrated utility for identification of false positive and false negative target assay
effects, reprioritization of hazard based on metabolism-dependent bioactivity, and enhanced in vivo
concordance with the rodent uterotrophic bioassay.

2.3	Experimental System: adherent VM7Luc4E2 cell line used. The AIME platform consists of a custom polystyrene
lids with 96- or 384- solid support pegs designed to be compatible with all ANSI/SLAS standard microplates. Each
peg of the AIME lid contains a single microsphere of alginate hydrogel mixed with hepatic S9 fraction
(metabolism positive mode) or alginate hydrogel mixed with water (metabolism negative) and reach a standard
depth that provides sufficient distance to the well bottom. The method is compatible with biochemical and cell-
based target assays.

2.4	Metabolic Competence: Initial characterization of the AIME microsphere function focused on two key areas: 1)
the ability of small molecules to freely enter the microsphere and undergo metabolism by the cytochrome P450
enzymes, and 2) the duration of cytochrome P450 enzyme activity of the S9. To evaluate these features, a panel
of five reference substrates (phenacetin, dextromethorphan, bupropion, diclofenac, and chlorzoxazone), known
to be metabolized by human cytochrome P450 enzymes to specific and identifiable metabolites (Bjornsson
2003), were incubated with the AIME lids for 0, 2, 4, and 8 hours and analyzed by LC-MS/MS. The results
demonstrate that S9 functioned well within the alginate microsphere and formed detectable metabolites from
the probe substrates.

2.5	Exposure Regime: The AIME procedure consists of a preparative "pre-stamping" step using a barium
chloride/poly-L-lysine (PLL) solution followed by "stamping" the lid in alginate-hepatic S9 (metabolism positive)
or alginate-water (metabolism negative) to form microspheres. Once stamping is completed, the AIME lids are
placed in duplicate microplates (metabolism positive or negative) containing chemical solutions and incubated
for two hours to facilitate metabolic reactions. The conditioned medium, containing parent chemical or
metabolites, is then transferred to the cell-based VM7Luc Estrogen Receptor Transactivation Assay (ERTA) for
24 hours. A typical experimental design tests a 10-point concentration series (0.002 - 200 uM) for 8 compounds
(96-well format)l or 32 compounds (384-well format)2 in single technical replicate. AIME assay controls include
trans-stilbene (bioactivated control) and ethylparaben (bioinactivated control). ERTA assay plate controls
include 17-beta estradiol (positive control) and DMSO (solvent control). Experiments typically comprise a
minimum of four experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

-------
0.002 nM
Key positive control:
NA

200 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.019
Response cutoff threshold used to determine hit calls: 0.189
Detection technology used: multimode microplate reader (Fluorescence)

2.6	Response: The ERTA assay evaluates binding and transactivation of an estrogen receptor (ER)-responsive
luciferase reporter to determine endogenous ER activity. AIME assay controls were selected for compatibility
with the ERTA assay and are intended to demonstrate metabolic-dependent shifts in estrogenic bioactivity. A
fluorescence-based cell viability reagent is used to measure treatment dependent cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Well-level data obtained as raw luminescence units for the ERTA luciferase assay are normalized
as percent activity of positive control. Well-level data obtained as raw fluorescence intensity units for the
cytotoxicity assay are normalized as log2 fold change to the DMSO control. Concentration-response modeling
of the 'percent activity' and 'log2 fold change' endpoints is used to derive quantitative potency and efficacy
values. Metabolism-dependent 'shifts' in bioactivity are evaluated by modeling the difference in concentration-
dependent effects.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

7: bmadlO (Add a cutoff value of 10 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.), 27:
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 73

Number of chemicals tested: 73

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
14

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	58290.5

Neutral control median absolute deviation, by plate: nmad	808.017

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	1.31%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	7111.5

Positive control well median absolute deviation, by plate: pmad	1653.099

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-23.793

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Deisenroth C, DeGroot DE, Zurlinden T, Eicher A, McCord J, Lee MY, Carmichael P, Thomas RS. The
Alginate Immobilization of Metabolic Enzymes Platform Retrofits an Estrogen Receptor Transactivation Assay
With Metabolic Competence. Toxicol Sci. 2020 Dec 1;178(2):281-301. doi: 10.1093/toxsci/kfaal47. PMID:
32991717; PMCID: PMC8154005., Hopperstad, K., DeGroot, D. E., Zurlinden, T., Brinkman, C., Thomas, R. S., &
Deisenroth, C. (2022). Chemical Screening in an Estrogen Receptor Transactivation Assay With Metabolic
Competence. Toxicological sciences : an official journal of the Society of Toxicology, 187(1), 112-126.
https://doi.org/10.1093/toxsci/kfac019, Jacobs, M. N., Janssens, W., Bernauer, U., Brandon, E., Coecke, S.,
Combes, R., Edwards, P., Freidig, A., Freyberger, A., Kolanczyk, R., Mc Ardle, C., Mekenyan, O., Schmieder, P.,
Schrader, T., Takeyoshi, M., & van der Burg, B. (2008). The use of metabolising systems for in vitro testing of
endocrine disruptors. Current drug metabolism, 9(8), 796-826. https://doi.org/10.2174/138920008786049294,
Jacobs, M. N., Laws, S. C., Willett, K., Schmieder, P., Odum, J., & Bovee, T. F. (2013). In vitro metabolism and
bioavailability tests for endocrine active substances: what is needed next for regulatory purposes?. ALTEX, 30(3),
331-351. https://doi.Org/10.14573/altex.2013.3.331, OECD (2014), Detailed Review Paper on the Use of
Metabolising Systems for In Vitro Testing of Endocrine Disruptors, OECD Series on Testing and Assessment, No.
97, OECD Publishing, Paris, https://doi.org/10.1787/9789264085497-en., Pinto, C. L., Mansouri, K., Judson, R.,
& Browne, P. (2016). Prediction of Estrogenic Bioactivity of Environmental Chemical Metabolites. Chemical
research in toxicology, 29(9), 1410-1427. https://doi.org/10.1021/acs.chemrestox.6b00079, Bjornsson, T. D.,
Callaghan, J. T., Einolf, H. J., Fischer, V., Gan, L., Grimm, S., Kao, J., King, S. P., Miwa, G., Ni, L, Kumar, G., McLeod,
J., Obach, R. S., Roberts, S., Roe, A., Shah, A., Snikeris, F., Sullivan, J. T., Tweedie, D., Vega, J. M.,... FDA Center
for Drug Evaluation and Research (CDER) (2003). The conduct of in vitro and in vivo drug-drug interaction

-------
studies: a Pharmaceutical Research and Manufacturers of America (PhRMA) perspective. Drug metabolism and
disposition: the biological fate of chemicals, 31(7), 815-832. https://doi.Org/10.1124/dmd.31.7.815

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2486

CCTE_Deisen roth_AI M E_96WELL_LUC_I nactive

1. General Information

1.1	Assay Title: CCTE's 96-well Estrogen Receptor Transactivation Assay without AIME metabolic competence (ERTA
luciferase; metabolism negative), Deisenroth Lab

1.2	Assay Summary: CCTE Deisenroth AIME 96WELL(AIME metabolism assay) is a cell-based, single-readout assay
that uses VM7Luc4E2, a human mammary gland/breast cell line, with measurements taken at 24 hour after
chemical dosing in a 96-well plate. CCTE_Deisenroth_AIME_96WELL_LUC_lnactive is one of four assay
components measured or calculated from the CCTE_Deisenroth_AIME_96WELL assay. It is designed to make
measurements of luciferase induction, a form of inducible reporter, as detected with bioluminescence signals
by luciferase-coupled ATP quantitation technology. CCTE_Deisenroth_AIME_96WELL_LUC_lnactive is a
component of the CCTE_Deisenroth_AIME_96WELL assay that measures the effect of retroftting the VM7Luc
ERTA assay with the Alginate Immobolization of Metabolic Enzymes (AIME) metabolism method. AIME is
optimized to impart phase I hepatic metabolism using phenobarbital/beta-naphthoflavone induced hepatic S9
from male Sprague Dawley rats. Parent chemicals are incubated for 2 hours in the AIME assay prior to transfer
to the VM7Luc ERTA assay plate. The AIME metabolism assay is run simultaneously in negative (inactive) and
positive (active) modes to evaluate the activity of parent and metabolites, respectively. This endpoint evaluates
reporter induction in metabolism negative mode. Using a type of inducible reporter in VM7Luc cells, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to xenobiotic metabolism
and the gene ESR1. This assay endpoint, CCTE_Deisenroth_AIME_96WELL_LUC_lnactive, was analyzed in the
positive analysis fitting direction relative to 17beta-estradiol as the positive control in the ERTA assay. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The AIME method has been successfully applied to the VM7Luc estrogen receptor
transactivation assay (ERTA) to evaluate metabolism-dependent false positive and false negative target assay
effects for a set of ER active and inactive chemicals (Deisenroth 2020, Hopperstad 2022). It has also been
employed to screen a set of 768 chemicals derived from theToxCast library to provide a model for incorporating
hepatic metabolism into existing test guideline studies including experimental design components, reference
chemical consideration, assay performance criteria, and data analysis workflows.

1.9	Assay Throughput: 96-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: CCTE_Deisenroth_AIME_96WELL_LUC_lnactive was designed to measure changes to
bioluminescence signals produced from an enzymatic reaction involving the key substrate [Bright-Glo], Changes
are indicative of changes in transcriptional gene expression due to agonist activity regulated by the human
estrogen receptor 1 [GeneSymbokESRl | GenelD:2099 | Uniprot_SwissProt_Accession:P03372].

The Alginate Immobilization of Metabolic Enzymes (AIME) is a lid-based technique for retrofitting existing
bioassays with hepatic metabolic competence (Deisenroth 2020, Hopperstad 2022). The platform consists of
custom microplate peg lids with encapsulated hepatic S9-alginate microspheres attached to solid supports that
metabolize test compounds for 96- and 384-well based assays.

2.2	Scientific Principles: A potential limitation to the use of in vitro assay data in regulatory decision-making is the
lack of coverage for xenobiotic metabolic processes occurring within an organism. Indeed, hepatic- and
peripheral-tissue metabolism can yield metabolites that exhibit greater activity than the parent compound
(bioactivation) or are inactive (bioinactivation) for a given biological target. Interpretation of biological effect
data for chemicals screened in high-throughput screening (HTS) assays may benefit from incorporation of
xenobiotic metabolic competence to decrease uncertainty associated with predicting potential human health
hazards from in vitro assay data (Jacobs 2008, Jacobs 2013, OECD 2014). The Alginate Immobilization of
Metabolic Enzymes (AIME) platform consists of custom 96- or 384-well microplate lids containing solid supports
attached to encapsulated hepatic S9-alginate microspheres. The optimized platform uses rat or human hepatic
S9 to evaluate phase I xenobiotic metabolism while minimizing cytotoxic and assay interference effects.
Deployment to an estrogen receptor transactivation (ERTA) assay with a test set of compounds previously
identified by estrogen receptor quantitative structure-activity relationship (ER-QSAR) analysis to generate ER-
active metabolites6 demonstrated utility for identification of false positive and false negative target assay
effects, reprioritization of hazard based on metabolism-dependent bioactivity, and enhanced in vivo
concordance with the rodent uterotrophic bioassay.

2.3	Experimental System: adherent VM7Luc4E2 cell line used. The AIME platform consists of a custom polystyrene
lids with 96- or 384- solid support pegs designed to be compatible with all ANSI/SLAS standard microplates. Each
peg of the AIME lid contains a single microsphere of alginate hydrogel mixed with hepatic S9 fraction
(metabolism positive mode) or alginate hydrogel mixed with water (metabolism negative) and reach a standard
depth that provides sufficient distance to the well bottom. The method is compatible with biochemical and cell-
based target assays.

2.4	Metabolic Competence: Initial characterization of the AIME microsphere function focused on two key areas: 1)
the ability of small molecules to freely enter the microsphere and undergo metabolism by the cytochrome P450
enzymes, and 2) the duration of cytochrome P450 enzyme activity of the S9. To evaluate these features, a panel
of five reference substrates (phenacetin, dextromethorphan, bupropion, diclofenac, and chlorzoxazone), known
to be metabolized by human cytochrome P450 enzymes to specific and identifiable metabolites (Bjornsson
2003), were incubated with the AIME lids for 0, 2, 4, and 8 hours and analyzed by LC-MS/MS. The results
demonstrate that S9 functioned well within the alginate microsphere and formed detectable metabolites from
the probe substrates.

2.5	Exposure Regime: The AIME procedure consists of a preparative "pre-stamping" step using a barium
chloride/poly-L-lysine (PLL) solution followed by "stamping" the lid in alginate-hepatic S9 (metabolism positive)
or alginate-water (metabolism negative) to form microspheres. Once stamping is completed, the AIME lids are
placed in duplicate microplates (metabolism positive or negative) containing chemical solutions and incubated
for two hours to facilitate metabolic reactions. The conditioned medium, containing parent chemical or
metabolites, is then transferred to the cell-based VM7Luc Estrogen Receptor Transactivation Assay (ERTA) for
24 hours. A typical experimental design tests a 10-point concentration series (0.002 - 200 uM) for 8 compounds
(96-well format)l or 32 compounds (384-well format)2 in single technical replicate. AIME assay controls include
trans-stilbene (bioactivated control) and ethylparaben (bioinactivated control). ERTA assay plate controls
include 17-beta estradiol (positive control) and DMSO (solvent control). Experiments typically comprise a
minimum of four experimental replicates (n=4).

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.002 nM
Key positive control:

17beta-estradiol

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.188
Response cutoff threshold used to determine hit calls: 1.878
Detection technology used: multimode microplate reader (Luminescence)

2.6	Response: The ERTA assay evaluates binding and transactivation of an estrogen receptor (ER)-responsive
luciferase reporter to determine endogenous ER activity. AIME assay controls were selected for compatibility
with the ERTA assay and are intended to demonstrate metabolic-dependent shifts in estrogenic bioactivity. A
fluorescence-based cell viability reagent is used to measure treatment dependent cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Well-level data obtained as raw luminescence units for the ERTA luciferase assay are normalized
as percent activity of positive control. Well-level data obtained as raw fluorescence intensity units for the
cytotoxicity assay are normalized as log2 fold change to the DMSO control. Concentration-response modeling
of the 'percent activity' and 'log2 fold change' endpoints is used to derive quantitative potency and efficacy
values. Metabolism-dependent 'shifts' in bioactivity are evaluated by modeling the difference in concentration-
dependent effects.

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise maximum,
by assay plate ID (apid), of the medians of the corrected values (cval) forgain-of-signal single- or multiple-
concentration negative control wells (wilt = m or o) by apid, well type, and concentration.), 11:
bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 43: resp.pc.pval.cor (Calculate
the normalized response (resp) as a percent of control, i.e. the ratio of the difference between the
corrected (cval) and baseline (bval) values divided the positive control (pval) value multiplied by 100;
resp = (cval-bval)/pval*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

7: bmadlO (Add a cutoff value of 10 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical

-------
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 73

Number of chemicals tested: 73

Active hit count: hitc>0.9
47

ACTIVITY HIT CALLS

Inactive hit count: Oihitc 0.9
19

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	16

gain-loss (gnls) model:	15

power(pow) model:	6

linear-polynomial (polyl) model:	3

quadratic-polynomial(poly2) model:	7

exponential-2 (exp2) model:	1

exponential-3 (exp3) model:	0

exponential-4 (exp4) model:	13

exponential-5 (exp5) model:	11

NA hit count: hitc^O
7

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.

-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	44.325

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

812
61.528
8.78%

33134.5
687.185

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 11.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Deisenroth C, DeGroot DE, Zurlinden T, Eicher A, McCord J, Lee MY, Carmichael P, Thomas RS. The
Alginate Immobilization of Metabolic Enzymes Platform Retrofits an Estrogen Receptor Transactivation Assay
With Metabolic Competence. Toxicol Sci. 2020 Dec 1;178(2):281-301. doi: 10.1093/toxsci/kfaal47. PMID:
32991717; PMCID: PMC8154005., Hopperstad, K., DeGroot, D. E., Zurlinden, T., Brinkman, C., Thomas, R. S., &
Deisenroth, C. (2022). Chemical Screening in an Estrogen Receptor Transactivation Assay With Metabolic
Competence. Toxicological sciences : an official journal of the Society of Toxicology, 187(1), 112-126.
https://doi.org/10.1093/toxsci/kfac019, Jacobs, M. N., Janssens, W., Bernauer, U., Brandon, E., Coecke, S.,
Combes, R., Edwards, P., Freidig, A., Freyberger, A., Kolanczyk, R., Mc Ardle, C., Mekenyan, O., Schmieder, P.,
Schrader, T., Takeyoshi, M., & van der Burg, B. (2008). The use of metabolising systems for in vitro testing of
endocrine disruptors. Current drug metabolism, 9(8), 796-826. https://doi.org/10.2174/138920008786049294,
Jacobs, M. N., Laws, S. C., Willett, K., Schmieder, P., Odum, J., & Bovee, T. F. (2013). In vitro metabolism and
bioavailability tests for endocrine active substances: what is needed next for regulatory purposes?. ALTEX, 30(3),
331-351. https://doi.Org/10.14573/altex.2013.3.331, OECD (2014), Detailed Review Paper on the Use of

-------
Metabolising Systems for In Vitro Testing of Endocrine Disruptors, OECD Series on Testing and Assessment, No.
97, OECD Publishing, Paris, https://doi.org/10.1787/9789264085497-en., Pinto, C. L., Mansouri, K., Judson, R.,
& Browne, P. (2016). Prediction of Estrogenic Bioactivity of Environmental Chemical Metabolites. Chemical
research in toxicology, 29(9), 1410-1427. https://doi.org/10.1021/acs.chemrestox.6b00079, Bjornsson, T. D.,
Callaghan, J. T., Einolf, H. J., Fischer, V., Gan, L., Grimm, S., Kao, J., King, S. P., Miwa, G., Ni, L, Kumar, G., McLeod,
J., Obach, R. S., Roberts, S., Roe, A., Shah, A., Snikeris, F., Sullivan, J. T., Tweedie, D., Vega, J. M.,... FDA Center
for Drug Evaluation and Research (CDER) (2003). The conduct of in vitro and in vivo drug-drug interaction
studies: a Pharmaceutical Research and Manufacturers of America (PhRMA) perspective. Drug metabolism and
disposition: the biological fate of chemicals, 31(7), 815-832. https://doi.Org/10.1124/dmd.31.7.815

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2487

CCTE_Deisen roth_AI M E_96WELL_CToxJ nactive

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment (CTOX) in CCTE's 96-well Estrogen Receptor Transactivation Assay without
AIME metabolic competence (metabolism negative), Deisenroth Lab

1.2	Assay Summary: CCTE Deisenroth AIME 96WELL(AIME metabolism assay) is a cell-based, single-readout assay
that uses VM7Luc4E2, a human mammary gland/breast cell line, with measurements taken at 24 hour after
chemical dosing in a 96-well plate. CCTE_Deisenroth_AIME_96WELL_CTox_lnactive is one of four assay
components measured or calculated from the CCTE_Deisenroth_AIME_96WELL assay. It is designed to make
measurements of a protease substrate, a type of viabiliy indicator, as detected with substrate cleavage to a
fluorescent probe in living cells. CCTE_Deisenroth_AIME_96WELL_CTox_lnactive is a component of the
CCTE_Deisenroth_AIME_96WELL assay that measures the effect of retroftting the VM7Luc ERTA assay with the
Alginate Immobolization of Metabolic Enzymes (AIME) metabolism method. AIME is optimized to impart phase
I hepatic metabolism using phenobarbital/beta-naphthoflavone induced hepatic S9 from male Sprague Dawley
rats. Parent chemicals are incubated for 2 hours in the AIME assay prior to transfer to the VM7Luc ERTA assay
plate. The AIME metabolism assay is run simultaneously in negative (inactive) and positive (active) modes to
evaluate the activity of parent and metabolites, respectively. This endpoint evaluates cytotoxicity in metabolism
negative mode. Using a type of flourescent probe in VM7Luc cells, loss-of-signal activity can be used to
understand loss of viability. This assay endpoint, CCTE_Deisenroth_AIME_96WELL_CTox_lnactive, was analyzed
with bidirectional fitting in the ERTA assay.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The AIME method has been successfully applied to the VM7Luc estrogen receptor
transactivation assay (ERTA) to evaluate metabolism-dependent false positive and false negative target assay
effects for a set of ER active and inactive chemicals (Deisenroth 2020, Hopperstad 2022). It has also been
employed to screen a set of 768 chemicals derived from theToxCast library to provide a model for incorporating
hepatic metabolism into existing test guideline studies including experimental design components, reference
chemical consideration, assay performance criteria, and data analysis workflows.

1.9	Assay Throughput: 96-well plate.

l.lOStatus: The assay is fully developed,

l.llAbbreviations:

AIC: Akaike Information Criterion
AOP: Adverse Outcome Pathway
CV: Coefficient of Variation
DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: CCTE_Deisenroth_AIME_96WELL_CTox_lnactive is designed to make measurements of a protease
substrate [Gly-Phe-AFCoumarin], a type of viabiliy indicator. The non-flourescent substrate is enzymatically
converted to a fluorescent probe in living cells. Changes in the signal are indicative of cytotoxicity.

and data are publicly available in ToxCast s invitroDB.

ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

-------
The Alginate Immobilization of Metabolic Enzymes (AIME) is a lid-based technique for retrofitting existing
bioassays with hepatic metabolic competence (Deisenroth 2020, Hopperstad 2022). The platform consists of
custom microplate peg lids with encapsulated hepatic S9-alginate microspheres attached to solid supports that
metabolize test compounds for 96- and 384-well based assays.

2.2	Scientific Principles: A potential limitation to the use of in vitro assay data in regulatory decision-making is the
lack of coverage for xenobiotic metabolic processes occurring within an organism. Indeed, hepatic- and
peripheral-tissue metabolism can yield metabolites that exhibit greater activity than the parent compound
(bioactivation) or are inactive (bioinactivation) for a given biological target. Interpretation of biological effect
data for chemicals screened in high-throughput screening (HTS) assays may benefit from incorporation of
xenobiotic metabolic competence to decrease uncertainty associated with predicting potential human health
hazards from in vitro assay data (Jacobs 2008, Jacobs 2013, OECD 2014). The Alginate Immobilization of
Metabolic Enzymes (AIME) platform consists of custom 96- or 384-well microplate lids containing solid supports
attached to encapsulated hepatic S9-alginate microspheres. The optimized platform uses rat or human hepatic
S9 to evaluate phase I xenobiotic metabolism while minimizing cytotoxic and assay interference effects.
Deployment to an estrogen receptor transactivation (ERTA) assay with a test set of compounds previously
identified by estrogen receptor quantitative structure-activity relationship (ER-QSAR) analysis to generate ER-
active metabolites6 demonstrated utility for identification of false positive and false negative target assay
effects, reprioritization of hazard based on metabolism-dependent bioactivity, and enhanced in vivo
concordance with the rodent uterotrophic bioassay.

2.3	Experimental System: adherent VM7Luc4E2 cell line used. The AIME platform consists of a custom polystyrene
lids with 96- or 384- solid support pegs designed to be compatible with all ANSI/SLAS standard microplates. Each
peg of the AIME lid contains a single microsphere of alginate hydrogel mixed with hepatic S9 fraction
(metabolism positive mode) or alginate hydrogel mixed with water (metabolism negative) and reach a standard
depth that provides sufficient distance to the well bottom. The method is compatible with biochemical and cell-
based target assays.

2.4	Metabolic Competence: Initial characterization of the AIME microsphere function focused on two key areas: 1)
the ability of small molecules to freely enter the microsphere and undergo metabolism by the cytochrome P450
enzymes, and 2) the duration of cytochrome P450 enzyme activity of the S9. To evaluate these features, a panel
of five reference substrates (phenacetin, dextromethorphan, bupropion, diclofenac, and chlorzoxazone), known
to be metabolized by human cytochrome P450 enzymes to specific and identifiable metabolites (Bjornsson
2003), were incubated with the AIME lids for 0, 2, 4, and 8 hours and analyzed by LC-MS/MS. The results
demonstrate that S9 functioned well within the alginate microsphere and formed detectable metabolites from
the probe substrates.

2.5	Exposure Regime: The AIME procedure consists of a preparative "pre-stamping" step using a barium
chloride/poly-L-lysine (PLL) solution followed by "stamping" the lid in alginate-hepatic S9 (metabolism positive)
or alginate-water (metabolism negative) to form microspheres. Once stamping is completed, the AIME lids are
placed in duplicate microplates (metabolism positive or negative) containing chemical solutions and incubated
for two hours to facilitate metabolic reactions. The conditioned medium, containing parent chemical or
metabolites, is then transferred to the cell-based VM7Luc Estrogen Receptor Transactivation Assay (ERTA) for
24 hours. A typical experimental design tests a 10-point concentration series (0.002 - 200 uM) for 8 compounds
(96-well format)l or 32 compounds (384-well format)2 in single technical replicate. AIME assay controls include
trans-stilbene (bioactivated control) and ethylparaben (bioinactivated control). ERTA assay plate controls
include 17-beta estradiol (positive control) and DMSO (solvent control). Experiments typically comprise a
minimum of four experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

-------
0.002 nM
Key positive control:
NA

200 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.017
Response cutoff threshold used to determine hit calls: 0.166
Detection technology used: multimode microplate reader (Fluorescence)

2.6	Response: The ERTA assay evaluates binding and transactivation of an estrogen receptor (ER)-responsive
luciferase reporter to determine endogenous ER activity. AIME assay controls were selected for compatibility
with the ERTA assay and are intended to demonstrate metabolic-dependent shifts in estrogenic bioactivity. A
fluorescence-based cell viability reagent is used to measure treatment dependent cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Well-level data obtained as raw luminescence units for the ERTA luciferase assay are normalized
as percent activity of positive control. Well-level data obtained as raw fluorescence intensity units for the
cytotoxicity assay are normalized as log2 fold change to the DMSO control. Concentration-response modeling
of the 'percent activity' and 'log2 fold change' endpoints is used to derive quantitative potency and efficacy
values. Metabolism-dependent 'shifts' in bioactivity are evaluated by modeling the difference in concentration-
dependent effects.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

7: bmadlO (Add a cutoff value of 10 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.), 27:
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 73

Number of chemicals tested: 73

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
16

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	48035.5

Neutral control median absolute deviation, by plate: nmad	558.199

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	1.15%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	7390.5

Positive control well median absolute deviation, by plate: pmad	2000.027

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-18.433

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Deisenroth C, DeGroot DE, Zurlinden T, Eicher A, McCord J, Lee MY, Carmichael P, Thomas RS. The
Alginate Immobilization of Metabolic Enzymes Platform Retrofits an Estrogen Receptor Transactivation Assay
With Metabolic Competence. Toxicol Sci. 2020 Dec 1;178(2):281-301. doi: 10.1093/toxsci/kfaal47. PMID:
32991717; PMCID: PMC8154005., Hopperstad, K., DeGroot, D. E., Zurlinden, T., Brinkman, C., Thomas, R. S., &
Deisenroth, C. (2022). Chemical Screening in an Estrogen Receptor Transactivation Assay With Metabolic
Competence. Toxicological sciences : an official journal of the Society of Toxicology, 187(1), 112-126.
https://doi.org/10.1093/toxsci/kfac019, Jacobs, M. N., Janssens, W., Bernauer, U., Brandon, E., Coecke, S.,
Combes, R., Edwards, P., Freidig, A., Freyberger, A., Kolanczyk, R., Mc Ardle, C., Mekenyan, O., Schmieder, P.,
Schrader, T., Takeyoshi, M., & van der Burg, B. (2008). The use of metabolising systems for in vitro testing of
endocrine disruptors. Current drug metabolism, 9(8), 796-826. https://doi.org/10.2174/138920008786049294,
Jacobs, M. N., Laws, S. C., Willett, K., Schmieder, P., Odum, J., & Bovee, T. F. (2013). In vitro metabolism and
bioavailability tests for endocrine active substances: what is needed next for regulatory purposes?. ALTEX, 30(3),
331-351. https://doi.Org/10.14573/altex.2013.3.331, OECD (2014), Detailed Review Paper on the Use of
Metabolising Systems for In Vitro Testing of Endocrine Disruptors, OECD Series on Testing and Assessment, No.
97, OECD Publishing, Paris, https://doi.org/10.1787/9789264085497-en., Pinto, C. L., Mansouri, K., Judson, R.,
& Browne, P. (2016). Prediction of Estrogenic Bioactivity of Environmental Chemical Metabolites. Chemical
research in toxicology, 29(9), 1410-1427. https://doi.org/10.1021/acs.chemrestox.6b00079, Bjornsson, T. D.,
Callaghan, J. T., Einolf, H. J., Fischer, V., Gan, L., Grimm, S., Kao, J., King, S. P., Miwa, G., Ni, L, Kumar, G., McLeod,
J., Obach, R. S., Roberts, S., Roe, A., Shah, A., Snikeris, F., Sullivan, J. T., Tweedie, D., Vega, J. M.,... FDA Center
for Drug Evaluation and Research (CDER) (2003). The conduct of in vitro and in vivo drug-drug interaction

-------
studies: a Pharmaceutical Research and Manufacturers of America (PhRMA) perspective. Drug metabolism and
disposition: the biological fate of chemicals, 31(7), 815-832. https://doi.Org/10.1124/dmd.31.7.815

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2488

CCTE_Deisenroth_AIME_384WELL_LUC_Active

1. General Information

1.1	Assay Title: CCTE's 384-well Estrogen Receptor Transactivation Assay with AIME metabolic competence (ERTA
luciferase; metabolism positive), Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_AIME_384WELL (AIME metabolism assay) is a cell-based, single-readout
assay that uses VM7Luc4E2, a human mammary gland/breast cell line, with measurements taken at 24 hour
after chemical dosing in a 384-well plate. CCTE_Deisenroth_AIME_384WELL_LUC_Active is one of four assay
components measured or calculated from the CCTE_Deisenroth_AIME_384WELL assay. It is designed to make
measurements of luciferase induction, a form of inducible reporter, as detected with bioluminescence signals
by luciferase-coupled ATP quantitation technology. CCTE_Deisenroth_AIME_384WELL_LUC_Active is a
component of the CCTE_Deisenroth_AIME_384WELL assay that measures the effect of retroftting the VM7Luc
ERTA assay with the Alginate Immobolization of Metabolic Enzymes (AIME) metabolism method. AIME is
optimized to impart phase I hepatic metabolism using phenobarbital/beta-naphthoflavone induced hepatic S9
from male Sprague Dawley rats. Parent chemicals are incubated for 2 hours in the AIME assay prior to transfer
to the VM7Luc ERTA assay plate. The AIME metabolism assay is run simultaneously in negative (inactive) and
positive (active) modes to evaluate the activity of parent and metabolites, respectively. This endpoint evaluates
reporter induction in metabolism positive mode. Using a type of inducible reporter in VM7Luc cells, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to xenobiotic metabolism
and the gene ESR1. This assay endpoint, CCTE_Deisenroth_AIME_384WELL_LUC_Active, was analyzed in the
positive analysis fitting direction relative to 17beta-estradiol as the positive control in the ERTA assay. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The AIME method has been successfully applied to the VM7Luc estrogen receptor
transactivation assay (ERTA) to evaluate metabolism-dependent false positive and false negative target assay
effects for a set of ER active and inactive chemicals (Deisenroth 2020, Hopperstad 2022). It has also been
employed to screen a set of 768 chemicals derived from theToxCast library to provide a model for incorporating
hepatic metabolism into existing test guideline studies including experimental design components, reference
chemical consideration, assay performance criteria, and data analysis workflows.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: CCTE_Deisenroth_AIME_384WELL_LUC_Active was designed to measure changes to
bioluminescence signals produced from an enzymatic reaction involving the key substrate [Bright-Glo], Changes
are indicative of changes in transcriptional gene expression due to agonist activity regulated by the human
estrogen receptor 1 [GeneSymbokESRl | GenelD:2099 | Uniprot_SwissProt_Accession:P03372].

The Alginate Immobilization of Metabolic Enzymes (AIME) is a lid-based technique for retrofitting existing
bioassays with hepatic metabolic competence (Deisenroth 2020, Hopperstad 2022). The platform consists of
custom microplate peg lids with encapsulated hepatic S9-alginate microspheres attached to solid supports that
metabolize test compounds for 96- and 384-well based assays.

2.2	Scientific Principles: A potential limitation to the use of in vitro assay data in regulatory decision-making is the
lack of coverage for xenobiotic metabolic processes occurring within an organism. Indeed, hepatic- and
peripheral-tissue metabolism can yield metabolites that exhibit greater activity than the parent compound
(bioactivation) or are inactive (bioinactivation) for a given biological target. Interpretation of biological effect
data for chemicals screened in high-throughput screening (HTS) assays may benefit from incorporation of
xenobiotic metabolic competence to decrease uncertainty associated with predicting potential human health
hazards from in vitro assay data (Jacobs 2008, Jacobs 2013, OECD 2014). The Alginate Immobilization of
Metabolic Enzymes (AIME) platform consists of custom 96- or 384-well microplate lids containing solid supports
attached to encapsulated hepatic S9-alginate microspheres. The optimized platform uses rat or human hepatic
S9 to evaluate phase I xenobiotic metabolism while minimizing cytotoxic and assay interference effects.
Deployment to an estrogen receptor transactivation (ERTA) assay with a test set of compounds previously
identified by estrogen receptor quantitative structure-activity relationship (ER-QSAR) analysis to generate ER-
active metabolites6 demonstrated utility for identification of false positive and false negative target assay
effects, reprioritization of hazard based on metabolism-dependent bioactivity, and enhanced in vivo
concordance with the rodent uterotrophic bioassay.

2.3	Experimental System: adherent VM7Luc4E2 cell line used. The AIME platform consists of a custom polystyrene
lids with 96- or 384- solid support pegs designed to be compatible with all ANSI/SLAS standard microplates. Each
peg of the AIME lid contains a single microsphere of alginate hydrogel mixed with hepatic S9 fraction
(metabolism positive mode) or alginate hydrogel mixed with water (metabolism negative) and reach a standard
depth that provides sufficient distance to the well bottom. The method is compatible with biochemical and cell-
based target assays.

2.4	Metabolic Competence: Initial characterization of the AIME microsphere function focused on two key areas: 1)
the ability of small molecules to freely enter the microsphere and undergo metabolism by the cytochrome P450
enzymes, and 2) the duration of cytochrome P450 enzyme activity of the S9. To evaluate these features, a panel
of five reference substrates (phenacetin, dextromethorphan, bupropion, diclofenac, and chlorzoxazone), known
to be metabolized by human cytochrome P450 enzymes to specific and identifiable metabolites (Bjornsson
2003), were incubated with the AIME lids for 0, 2, 4, and 8 hours and analyzed by LC-MS/MS. The results
demonstrate that S9 functioned well within the alginate microsphere and formed detectable metabolites from
the probe substrates.

2.5	Exposure Regime: The AIME procedure consists of a preparative "pre-stamping" step using a barium
chloride/poly-L-lysine (PLL) solution followed by "stamping" the lid in alginate-hepatic S9 (metabolism positive)
or alginate-water (metabolism negative) to form microspheres. Once stamping is completed, the AIME lids are
placed in duplicate microplates (metabolism positive or negative) containing chemical solutions and incubated
for two hours to facilitate metabolic reactions. The conditioned medium, containing parent chemical or
metabolites, is then transferred to the cell-based VM7Luc Estrogen Receptor Transactivation Assay (ERTA) for
24 hours. A typical experimental design tests a 10-point concentration series (0.002 - 200 uM) for 8 compounds
(96-well format)l or 32 compounds (384-well format)2 in single technical replicate. AIME assay controls include
trans-stilbene (bioactivated control) and ethylparaben (bioinactivated control). ERTA assay plate controls
include 17-beta estradiol (positive control) and DMSO (solvent control). Experiments typically comprise a
minimum of four experimental replicates (n=4).

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.002 nM
Key positive control:

17beta-estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 1.476
Response cutoff threshold used to determine hit calls: 14.763
Detection technology used: multimode microplate reader (Luminescense)

2.6	Response: The ERTA assay evaluates binding and transactivation of an estrogen receptor (ER)-responsive
luciferase reporter to determine endogenous ER activity. AIME assay controls were selected for compatibility
with the ERTA assay and are intended to demonstrate metabolic-dependent shifts in estrogenic bioactivity. A
fluorescence-based cell viability reagent is used to measure treatment dependent cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Well-level data obtained as raw luminescence units for the ERTA luciferase assay are normalized
as percent activity of positive control. Well-level data obtained as raw fluorescence intensity units for the
cytotoxicity assay are normalized as log2 fold change to the DMSO control. Concentration-response modeling
of the 'percent activity' and 'log2 fold change' endpoints is used to derive quantitative potency and efficacy
values. Metabolism-dependent 'shifts' in bioactivity are evaluated by modeling the difference in concentration-
dependent effects.

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

3: rmneg (Exclude wells with negative corrected response values (cval) and downgrading their well
quality (wllq); if cval < 0, wllq = 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal
to zero and downgrading their well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise maximum,
by assay plate ID (apid), of the medians of the corrected values (cval) forgain-of-signal single- or multiple-
concentration negative control wells (wilt = m or o) by apid, well type, and concentration.), 11:
bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 43: resp.pc.pval.cor (Calculate
the normalized response (resp) as a percent of control, i.e. the ratio of the difference between the
corrected (cval) and baseline (bval) values divided the positive control (pval) value multiplied by 100;
resp = (cval-bval)/pval*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

7: bmadlO (Add a cutoff value of 10 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and

-------
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 768

Active hit count: hitc>0.9
109

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: Oihitc 0.9
570

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	71

gain-loss (gnls) model:	127

power(pow) model:	43

linear-polynomial (polyl) model:	73

quadratic-polynomial(poly2) model:	73

exponential-2 (exp2) model:	6

exponential-3 (exp3) model:	0

exponential-4 (exp4) model:	303

exponential-5 (exp5) model:	67

Number of chemicals tested: 768

NA hit count: hitc^O
89

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside

-------
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1912.75

Neutral control median absolute deviation, by plate: nmad	191.255

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.87%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	12992.25

Positive control well median absolute deviation, by plate: pmad	1267.623

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	8.135

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 67.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Deisenroth C, DeGroot DE, Zurlinden T, Eicher A, McCord J, Lee MY, Carmichael P, Thomas RS. The
Alginate Immobilization of Metabolic Enzymes Platform Retrofits an Estrogen Receptor Transactivation Assay
With Metabolic Competence. Toxicol Sci. 2020 Dec 1;178(2):281-301. doi: 10.1093/toxsci/kfaal47. PMID:
32991717; PMCID: PMC8154005., Hopperstad, K., DeGroot, D. E., Zurlinden, T., Brinkman, C., Thomas, R. S., &
Deisenroth, C. (2022). Chemical Screening in an Estrogen Receptor Transactivation Assay With Metabolic
Competence. Toxicological sciences : an official journal of the Society of Toxicology, 187(1), 112-126.
https://doi.org/10.1093/toxsci/kfac019, Jacobs, M. N., Janssens, W., Bernauer, U., Brandon, E., Coecke, S.,
Combes, R., Edwards, P., Freidig, A., Freyberger, A., Kolanczyk, R., Mc Ardle, C., Mekenyan, O., Schmieder, P.,
Schrader, T., Takeyoshi, M., & van der Burg, B. (2008). The use of metabolising systems for in vitro testing of
endocrine disruptors. Current drug metabolism, 9(8), 796-826. https://doi.org/10.2174/138920008786049294,
Jacobs, M. N., Laws, S. C., Willett, K., Schmieder, P., Odum, J., & Bovee, T. F. (2013). In vitro metabolism and
bioavailability tests for endocrine active substances: what is needed next for regulatory purposes?. ALTEX, 30(3),

-------
331-351. https://doi.Org/10.14573/altex.2013.3.331, OECD (2014), Detailed Review Paper on the Use of
Metabolising Systems for In Vitro Testing of Endocrine Disruptors, OECD Series on Testing and Assessment, No.
97, OECD Publishing, Paris, https://doi.org/10.1787/9789264085497-en., Pinto, C. L., Mansouri, K., Judson, R.,
& Browne, P. (2016). Prediction of Estrogenic Bioactivity of Environmental Chemical Metabolites. Chemical
research in toxicology, 29(9), 1410-1427. https://doi.org/10.1021/acs.chemrestox.6b00079, Bjornsson, T. D.,
Callaghan, J. T., Einolf, H. J., Fischer, V., Gan, L., Grimm, S., Kao, J., King, S. P., Miwa, G., Ni, L, Kumar, G., McLeod,
J., Obach, R. S., Roberts, S., Roe, A., Shah, A., Snikeris, F., Sullivan, J. T., Tweedie, D., Vega, J. M., ... FDA Center
for Drug Evaluation and Research (CDER) (2003). The conduct of in vitro and in vivo drug-drug interaction
studies: a Pharmaceutical Research and Manufacturers of America (PhRMA) perspective. Drug metabolism and
disposition: the biological fate of chemicals, 31(7), 815-832. https://doi.Org/10.1124/dmd.31.7.815

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2489

CCTE_Deisen roth_AI M E_384WELL_CTox_Active

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment (CTOX) in CCTE's 384-well Estrogen Receptor Transactivation Assay with
AIME metabolic competence (metabolism positive), Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_AIME_384WELL (AIME metabolism assay) is a cell-based, single-readout
assay that uses VM7Luc4E2, a human mammary gland/breast cell line, with measurements taken at 24 hour
after chemical dosing in a 384-well plate. CCTE_Deisenroth_AIME_384WELL_CTox_Active is one of four assay
components measured or calculated from the CCTE_Deisenroth_AIME_384WELL assay. It is designed to make
measurements of a protease substrate, a type of viabiliy indicator, as detected with substrate cleavage to a
fluorescent probe in living cells. CCTE_Deisenroth_AIME_384WELL_CTox_Active is a component of the
CCTE_Deisenroth_AIME_384WELL assay that measures the effect of retroftting the VM7Luc ERTA assay with
the Alginate Immobolization of Metabolic Enzymes (AIME) metabolism method. AIME is optimized to impart
phase I hepatic metabolism using phenobarbital/beta-naphthoflavone induced hepatic S9 from male Sprague
Dawley rats. Parent chemicals are incubated for 2 hours in the AIME assay prior to transfer to the VM7Luc ERTA
assay plate. The AIME metabolism assay is run simultaneously in negative (inactive) and positive (active) modes
to evaluate the activity of parent and metabolites, respectively. This endpoint evaluates cytotoxicity in
metabolism positive mode. Using a type of flourescent probe in VM7Luc cells, loss-of-signal activity can be used
to understand loss of viability. This assay endpoint, CCTE_Deisenroth_AIME_384WELL_CTox_Active, was
analyzed with bidirectional fitting in the ERTA assay.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The AIME method has been successfully applied to the VM7Luc estrogen receptor
transactivation assay (ERTA) to evaluate metabolism-dependent false positive and false negative target assay
effects for a set of ER active and inactive chemicals (Deisenroth 2020, Hopperstad 2022). It has also been
employed to screen a set of 768 chemicals derived from theToxCast library to provide a model for incorporating
hepatic metabolism into existing test guideline studies including experimental design components, reference
chemical consideration, assay performance criteria, and data analysis workflows.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed,

l.llAbbreviations:

AIC: Akaike Information Criterion
AOP: Adverse Outcome Pathway
CV: Coefficient of Variation
DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: CCTE_Deisenroth_AIME_384WELL_CTox_Active is designed to make measurements of a protease
substrate [Gly-Phe-AFCoumarin], a type of viabiliy indicator. The non-flourescent substrate is enzymatically
converted to a fluorescent probe in living cells. Changes in the signal are indicative of cytotoxicity.

and data are publicly available in ToxCast s invitroDB.

ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

-------
The Alginate Immobilization of Metabolic Enzymes (AIME) is a lid-based technique for retrofitting existing
bioassays with hepatic metabolic competence (Deisenroth 2020, Hopperstad 2022). The platform consists of
custom microplate peg lids with encapsulated hepatic S9-alginate microspheres attached to solid supports that
metabolize test compounds for 96- and 384-well based assays.

2.2	Scientific Principles: A potential limitation to the use of in vitro assay data in regulatory decision-making is the
lack of coverage for xenobiotic metabolic processes occurring within an organism. Indeed, hepatic- and
peripheral-tissue metabolism can yield metabolites that exhibit greater activity than the parent compound
(bioactivation) or are inactive (bioinactivation) for a given biological target. Interpretation of biological effect
data for chemicals screened in high-throughput screening (HTS) assays may benefit from incorporation of
xenobiotic metabolic competence to decrease uncertainty associated with predicting potential human health
hazards from in vitro assay data (Jacobs 2008, Jacobs 2013, OECD 2014). The Alginate Immobilization of
Metabolic Enzymes (AIME) platform consists of custom 96- or 384-well microplate lids containing solid supports
attached to encapsulated hepatic S9-alginate microspheres. The optimized platform uses rat or human hepatic
S9 to evaluate phase I xenobiotic metabolism while minimizing cytotoxic and assay interference effects.
Deployment to an estrogen receptor transactivation (ERTA) assay with a test set of compounds previously
identified by estrogen receptor quantitative structure-activity relationship (ER-QSAR) analysis to generate ER-
active metabolites6 demonstrated utility for identification of false positive and false negative target assay
effects, reprioritization of hazard based on metabolism-dependent bioactivity, and enhanced in vivo
concordance with the rodent uterotrophic bioassay.

2.3	Experimental System: adherent VM7Luc4E2 cell line used. The AIME platform consists of a custom polystyrene
lids with 96- or 384- solid support pegs designed to be compatible with all ANSI/SLAS standard microplates. Each
peg of the AIME lid contains a single microsphere of alginate hydrogel mixed with hepatic S9 fraction
(metabolism positive mode) or alginate hydrogel mixed with water (metabolism negative) and reach a standard
depth that provides sufficient distance to the well bottom. The method is compatible with biochemical and cell-
based target assays.

2.4	Metabolic Competence: Initial characterization of the AIME microsphere function focused on two key areas: 1)
the ability of small molecules to freely enter the microsphere and undergo metabolism by the cytochrome P450
enzymes, and 2) the duration of cytochrome P450 enzyme activity of the S9. To evaluate these features, a panel
of five reference substrates (phenacetin, dextromethorphan, bupropion, diclofenac, and chlorzoxazone), known
to be metabolized by human cytochrome P450 enzymes to specific and identifiable metabolites (Bjornsson
2003), were incubated with the AIME lids for 0, 2, 4, and 8 hours and analyzed by LC-MS/MS. The results
demonstrate that S9 functioned well within the alginate microsphere and formed detectable metabolites from
the probe substrates.

2.5	Exposure Regime: The AIME procedure consists of a preparative "pre-stamping" step using a barium
chloride/poly-L-lysine (PLL) solution followed by "stamping" the lid in alginate-hepatic S9 (metabolism positive)
or alginate-water (metabolism negative) to form microspheres. Once stamping is completed, the AIME lids are
placed in duplicate microplates (metabolism positive or negative) containing chemical solutions and incubated
for two hours to facilitate metabolic reactions. The conditioned medium, containing parent chemical or
metabolites, is then transferred to the cell-based VM7Luc Estrogen Receptor Transactivation Assay (ERTA) for
24 hours. A typical experimental design tests a 10-point concentration series (0.002 - 200 uM) for 8 compounds
(96-well format) (Deisenroth 2020) or 32 compounds (384-well format) (Hopperstad 2022) in single technical
replicate. AIME assay controls include trans-stilbene (bioactivated control) and ethylparaben (bioinactivated
control). ERTA assay plate controls include 17-beta estradiol (positive control) and DMSO (solvent control).
Experiments typically comprise a minimum of four experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

-------
0.002 nM
Key positive control:
NA

200 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.14
Response cutoff threshold used to determine hit calls: 0.421
Detection technology used: multimode microplate reader (Fluorescence)

2.6	Response: The ERTA assay evaluates binding and transactivation of an estrogen receptor (ER)-responsive
luciferase reporter to determine endogenous ER activity. AIME assay controls were selected for compatibility
with the ERTA assay and are intended to demonstrate metabolic-dependent shifts in estrogenic bioactivity. A
fluorescence-based cell viability reagent is used to measure treatment dependent cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Well-level data obtained as raw luminescence units for the ERTA luciferase assay are normalized
as percent activity of positive control. Well-level data obtained as raw fluorescence intensity units for the
cytotoxicity assay are normalized as log2 fold change to the DMSO control. Concentration-response modeling
of the 'percent activity' and 'log2 fold change' endpoints is used to derive quantitative potency and efficacy
values. Metabolism-dependent 'shifts' in bioactivity are evaluated by modeling the difference in concentration-
dependent effects.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

3: rmneg (Exclude wells with negative corrected response values (cval) and downgrading their well
quality (wllq); if cval < 0, wllq = 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal
to zero and downgrading their well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 768

Number of chemicals tested: 768

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
89

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	45289.25

Neutral control median absolute deviation, by plate: nmad	4503.027

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.98%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-847

Positive control well median absolute deviation, by plate: pmad	3692.415

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-7.808

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 25.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Deisenroth C, DeGroot DE, Zurlinden T, Eicher A, McCord J, Lee MY, Carmichael P, Thomas RS. The
Alginate Immobilization of Metabolic Enzymes Platform Retrofits an Estrogen Receptor Transactivation Assay
With Metabolic Competence. Toxicol Sci. 2020 Dec 1;178(2):281-301. doi: 10.1093/toxsci/kfaal47. PMID:
32991717; PMCID: PMC8154005., Hopperstad, K., DeGroot, D. E., Zurlinden, T., Brinkman, C., Thomas, R. S., &
Deisenroth, C. (2022). Chemical Screening in an Estrogen Receptor Transactivation Assay With Metabolic
Competence. Toxicological sciences : an official journal of the Society of Toxicology, 187(1), 112-126.
https://doi.org/10.1093/toxsci/kfac019, Jacobs, M. N., Janssens, W., Bernauer, U., Brandon, E., Coecke, S.,
Combes, R., Edwards, P., Freidig, A., Freyberger, A., Kolanczyk, R., Mc Ardle, C., Mekenyan, O., Schmieder, P.,
Schrader, T., Takeyoshi, M., & van der Burg, B. (2008). The use of metabolising systems for in vitro testing of
endocrine disruptors. Current drug metabolism, 9(8), 796-826. https://doi.org/10.2174/138920008786049294,
Jacobs, M. N., Laws, S. C., Willett, K., Schmieder, P., Odum, J., & Bovee, T. F. (2013). In vitro metabolism and
bioavailability tests for endocrine active substances: what is needed next for regulatory purposes?. ALTEX, 30(3),
331-351. https://doi.Org/10.14573/altex.2013.3.331, OECD (2014), Detailed Review Paper on the Use of
Metabolising Systems for In Vitro Testing of Endocrine Disruptors, OECD Series on Testing and Assessment, No.
97, OECD Publishing, Paris, https://doi.org/10.1787/9789264085497-en., Pinto, C. L., Mansouri, K., Judson, R.,
& Browne, P. (2016). Prediction of Estrogenic Bioactivity of Environmental Chemical Metabolites. Chemical
research in toxicology, 29(9), 1410-1427. https://doi.org/10.1021/acs.chemrestox.6b00079, Bjornsson, T. D.,
Callaghan, J. T., Einolf, H. J., Fischer, V., Gan, L., Grimm, S., Kao, J., King, S. P., Miwa, G., Ni, L, Kumar, G., McLeod,
J., Obach, R. S., Roberts, S., Roe, A., Shah, A., Snikeris, F., Sullivan, J. T., Tweedie, D., Vega, J. M.,... FDA Center
for Drug Evaluation and Research (CDER) (2003). The conduct of in vitro and in vivo drug-drug interaction

-------
studies: a Pharmaceutical Research and Manufacturers of America (PhRMA) perspective. Drug metabolism and
disposition: the biological fate of chemicals, 31(7), 815-832. https://doi.Org/10.1124/dmd.31.7.815

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2490

CCTE_Deisenroth_AIME_384WELL_LUC_lnactive

1. General Information

1.1	Assay Title: CCTE's 384-well Estrogen Receptor Transactivation Assay without AIME metabolic competence
(ERTA luciferase; metabolism negative), Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_AIME_384WELL (AIME metabolism assay) is a cell-based, single-readout
assay that uses VM7Luc4E2, a human mammary gland/breast cell line, with measurements taken at 24 hour
after chemical dosing in a 384-well plate. CCTE_Deisenroth_AIME_384WELL_LUC_lnactive is one of four assay
components measured or calculated from the CCTE_Deisenroth_AIME_384WELL assay. It is designed to make
measurements of luciferase induction, a form of inducible reporter, as detected with bioluminescence signals
by luciferase-coupled ATP quantitation technology. CCTE_Deisenroth_AIME_384WELL_LUC_lnactive is a
component of the CCTE_Deisenroth_AIME_384WELL assay that measures the effect of retroftting the VM7Luc
ERTA assay with the Alginate Immobolization of Metabolic Enzymes (AIME) metabolism method. AIME is
optimized to impart phase I hepatic metabolism using phenobarbital/beta-naphthoflavone induced hepatic S9
from male Sprague Dawley rats. Parent chemicals are incubated for 2 hours in the AIME assay prior to transfer
to the VM7Luc ERTA assay plate. The AIME metabolism assay is run simultaneously in negative (inactive) and
positive (active) modes to evaluate the activity of parent and metabolites, respectively. This endpoint evaluates
reporter induction in metabolism negative mode. Using a type of inducible reporter in VM7Luc cells, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to xenobiotic metabolism
and the gene ESR1. This assay endpoint, CCTE_Deisenroth_AIME_384WELL_LUC_lnactive, was analyzed in the
positive analysis fitting direction relative to 17beta-estradiol as the positive control in the ERTA assay. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The AIME method has been successfully applied to the VM7Luc estrogen receptor
transactivation assay (ERTA) to evaluate metabolism-dependent false positive and false negative target assay
effects for a set of ER active and inactive chemicals (Deisenroth 2020, Hopperstad 2022). It has also been
employed to screen a set of 768 chemicals derived from theToxCast library to provide a model for incorporating
hepatic metabolism into existing test guideline studies including experimental design components, reference
chemical consideration, assay performance criteria, and data analysis workflows.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: CCTE_Deisenroth_AIME_384WELL_LUC_lnactive was designed to measure changes to
bioluminescence signals produced from an enzymatic reaction involving the key substrate [Bright-Glo], Changes
are indicative of changes in transcriptional gene expression due to agonist activity regulated by the human
estrogen receptor 1 [GeneSymbokESRl | GenelD:2099 | Uniprot_SwissProt_Accession:P03372].

The Alginate Immobilization of Metabolic Enzymes (AIME) is a lid-based technique for retrofitting existing
bioassays with hepatic metabolic competence (Deisenroth 2020, Hopperstad 2022). The platform consists of
custom microplate peg lids with encapsulated hepatic S9-alginate microspheres attached to solid supports that
metabolize test compounds for 96- and 384-well based assays.

2.2	Scientific Principles: A potential limitation to the use of in vitro assay data in regulatory decision-making is the
lack of coverage for xenobiotic metabolic processes occurring within an organism. Indeed, hepatic- and
peripheral-tissue metabolism can yield metabolites that exhibit greater activity than the parent compound
(bioactivation) or are inactive (bioinactivation) for a given biological target. Interpretation of biological effect
data for chemicals screened in high-throughput screening (HTS) assays may benefit from incorporation of
xenobiotic metabolic competence to decrease uncertainty associated with predicting potential human health
hazards from in vitro assay data (Jacobs 2008, Jacobs 2013, OECD 2014). The Alginate Immobilization of
Metabolic Enzymes (AIME) platform consists of custom 96- or 384-well microplate lids containing solid supports
attached to encapsulated hepatic S9-alginate microspheres. The optimized platform uses rat or human hepatic
S9 to evaluate phase I xenobiotic metabolism while minimizing cytotoxic and assay interference effects.
Deployment to an estrogen receptor transactivation (ERTA) assay with a test set of compounds previously
identified by estrogen receptor quantitative structure-activity relationship (ER-QSAR) analysis to generate ER-
active metabolites6 demonstrated utility for identification of false positive and false negative target assay
effects, reprioritization of hazard based on metabolism-dependent bioactivity, and enhanced in vivo
concordance with the rodent uterotrophic bioassay.

2.3	Experimental System: adherent VM7Luc4E2 cell line used. The AIME platform consists of a custom polystyrene
lids with 96- or 384- solid support pegs designed to be compatible with all ANSI/SLAS standard microplates. Each
peg of the AIME lid contains a single microsphere of alginate hydrogel mixed with hepatic S9 fraction
(metabolism positive mode) or alginate hydrogel mixed with water (metabolism negative) and reach a standard
depth that provides sufficient distance to the well bottom. The method is compatible with biochemical and cell-
based target assays.

2.4	Metabolic Competence: Initial characterization of the AIME microsphere function focused on two key areas: 1)
the ability of small molecules to freely enter the microsphere and undergo metabolism by the cytochrome P450
enzymes, and 2) the duration of cytochrome P450 enzyme activity of the S9. To evaluate these features, a panel
of five reference substrates (phenacetin, dextromethorphan, bupropion, diclofenac, and chlorzoxazone), known
to be metabolized by human cytochrome P450 enzymes to specific and identifiable metabolites (Bjornsson
2003), were incubated with the AIME lids for 0, 2, 4, and 8 hours and analyzed by LC-MS/MS. The results
demonstrate that S9 functioned well within the alginate microsphere and formed detectable metabolites from
the probe substrates.

2.5	Exposure Regime: The AIME procedure consists of a preparative "pre-stamping" step using a barium
chloride/poly-L-lysine (PLL) solution followed by "stamping" the lid in alginate-hepatic S9 (metabolism positive)
or alginate-water (metabolism negative) to form microspheres. Once stamping is completed, the AIME lids are
placed in duplicate microplates (metabolism positive or negative) containing chemical solutions and incubated
for two hours to facilitate metabolic reactions. The conditioned medium, containing parent chemical or
metabolites, is then transferred to the cell-based VM7Luc Estrogen Receptor Transactivation Assay (ERTA) for
24 hours. A typical experimental design tests a 10-point concentration series (0.002 - 200 uM) for 8 compounds
(96-well format)l or 32 compounds (384-well format)2 in single technical replicate. AIME assay controls include
trans-stilbene (bioactivated control) and ethylparaben (bioinactivated control). ERTA assay plate controls
include 17-beta estradiol (positive control) and DMSO (solvent control). Experiments typically comprise a
minimum of four experimental replicates (n=4).

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.002 nM
Key positive control:

17beta-estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 2.227
Response cutoff threshold used to determine hit calls: 22.274
Detection technology used: multimode microplate reader (Luminescense)

2.6	Response: The ERTA assay evaluates binding and transactivation of an estrogen receptor (ER)-responsive
luciferase reporter to determine endogenous ER activity. AIME assay controls were selected for compatibility
with the ERTA assay and are intended to demonstrate metabolic-dependent shifts in estrogenic bioactivity. A
fluorescence-based cell viability reagent is used to measure treatment dependent cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Well-level data obtained as raw luminescence units for the ERTA luciferase assay are normalized
as percent activity of positive control. Well-level data obtained as raw fluorescence intensity units for the
cytotoxicity assay are normalized as log2 fold change to the DMSO control. Concentration-response modeling
of the 'percent activity' and 'log2 fold change' endpoints is used to derive quantitative potency and efficacy
values. Metabolism-dependent 'shifts' in bioactivity are evaluated by modeling the difference in concentration-
dependent effects.

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

3: rmneg (Exclude wells with negative corrected response values (cval) and downgrading their well
quality (wllq); if cval < 0, wllq = 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal
to zero and downgrading their well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise maximum,
by assay plate ID (apid), of the medians of the corrected values (cval) forgain-of-signal single- or multiple-
concentration negative control wells (wilt = m or o) by apid, well type, and concentration.), 11:
bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 43: resp.pc.pval.cor (Calculate
the normalized response (resp) as a percent of control, i.e. the ratio of the difference between the
corrected (cval) and baseline (bval) values divided the positive control (pval) value multiplied by 100;
resp = (cval-bval)/pval*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

7: bmadlO (Add a cutoff value of 10 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.), 28:
ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis
direction. Typically used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and

-------
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 768

Active hit count: hitc>0.9
104

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: Oihitc 0.9
581

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	94

gain-loss (gnls) model:	147

power(pow) model:	69

linear-polynomial (polyl) model:	77

quadratic-polynomial(poly2) model:	71

exponential-2 (exp2) model:	6

exponential-3 (exp3) model:	6

exponential-4 (exp4) model:	213

exponential-5 (exp5) model:	80

Number of chemicals tested: 768

NA hit count: hitc^O
S3

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside

-------
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2667.5

Neutral control median absolute deviation, by plate: nmad	316.906

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.86%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	13077.25

Positive control well median absolute deviation, by plate: pmad	1190.528

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	8.3

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 80.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Deisenroth C, DeGroot DE, Zurlinden T, Eicher A, McCord J, Lee MY, Carmichael P, Thomas RS. The
Alginate Immobilization of Metabolic Enzymes Platform Retrofits an Estrogen Receptor Transactivation Assay
With Metabolic Competence. Toxicol Sci. 2020 Dec 1;178(2):281-301. doi: 10.1093/toxsci/kfaal47. PMID:
32991717; PMCID: PMC8154005., Hopperstad, K., DeGroot, D. E., Zurlinden, T., Brinkman, C., Thomas, R. S., &
Deisenroth, C. (2022). Chemical Screening in an Estrogen Receptor Transactivation Assay With Metabolic
Competence. Toxicological sciences : an official journal of the Society of Toxicology, 187(1), 112-126.
https://doi.org/10.1093/toxsci/kfac019, Jacobs, M. N., Janssens, W., Bernauer, U., Brandon, E., Coecke, S.,
Combes, R., Edwards, P., Freidig, A., Freyberger, A., Kolanczyk, R., Mc Ardle, C., Mekenyan, O., Schmieder, P.,
Schrader, T., Takeyoshi, M., & van der Burg, B. (2008). The use of metabolising systems for in vitro testing of
endocrine disruptors. Current drug metabolism, 9(8), 796-826. https://doi.org/10.2174/138920008786049294,
Jacobs, M. N., Laws, S. C., Willett, K., Schmieder, P., Odum, J., & Bovee, T. F. (2013). In vitro metabolism and
bioavailability tests for endocrine active substances: what is needed next for regulatory purposes?. ALTEX, 30(3),

-------
331-351. https://doi.Org/10.14573/altex.2013.3.331, OECD (2014), Detailed Review Paper on the Use of
Metabolising Systems for In Vitro Testing of Endocrine Disruptors, OECD Series on Testing and Assessment, No.
97, OECD Publishing, Paris, https://doi.org/10.1787/9789264085497-en., Pinto, C. L., Mansouri, K., Judson, R.,
& Browne, P. (2016). Prediction of Estrogenic Bioactivity of Environmental Chemical Metabolites. Chemical
research in toxicology, 29(9), 1410-1427. https://doi.org/10.1021/acs.chemrestox.6b00079, Bjornsson, T. D.,
Callaghan, J. T., Einolf, H. J., Fischer, V., Gan, L., Grimm, S., Kao, J., King, S. P., Miwa, G., Ni, L, Kumar, G., McLeod,
J., Obach, R. S., Roberts, S., Roe, A., Shah, A., Snikeris, F., Sullivan, J. T., Tweedie, D., Vega, J. M., ... FDA Center
for Drug Evaluation and Research (CDER) (2003). The conduct of in vitro and in vivo drug-drug interaction
studies: a Pharmaceutical Research and Manufacturers of America (PhRMA) perspective. Drug metabolism and
disposition: the biological fate of chemicals, 31(7), 815-832. https://doi.Org/10.1124/dmd.31.7.815

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2491

CCTE_Deisen roth_AI M E_384WELL_CTox_l nactive

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment (CTOX) in CCTE's 384-well Estrogen Receptor Transactivation Assay without
AIME metabolic competence (metabolism negative), Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_AIME_384WELL (AIME metabolism assay) is a cell-based, single-readout
assay that uses VM7Luc4E2, a human mammary gland/breast cell line, with measurements taken at 24 hour
after chemical dosing in a 384-well plate. CCTE_Deisenroth_AIME_384WELL_CTox_lnactive is one of four assay
components measured or calculated from the CCTE_Deisenroth_AIME_384WELL assay. It is designed to make
measurements of a protease substrate, a type of viabiliy indicator, as detected with substrate cleavage to a
fluorescent probe in living cells. CCTE_Deisenroth_AIME_384WELL_CTox_lnactive is a component of the
CCTE_Deisenroth_AIME_384WELL assay that measures the effect of retroftting the VM7Luc ERTA assay with
the Alginate Immobolization of Metabolic Enzymes (AIME) metabolism method. AIME is optimized to impart
phase I hepatic metabolism using phenobarbital/beta-naphthoflavone induced hepatic S9 from male Sprague
Dawley rats. Parent chemicals are incubated for 2 hours in the AIME assay prior to transfer to the VM7Luc ERTA
assay plate. The AIME metabolism assay is run simultaneously in negative (inactive) and positive (active) modes
to evaluate the activity of parent and metabolites, respectively. This endpoint evaluates cytotoxicity in
metabolism negative mode. Using a type of flourescent probe in VM7Luc cells, loss-of-signal activity can be used
to understand loss of viability. This assay endpoint, CCTE_Deisenroth_AIME_384WELL_CTox_lnactive, was
analyzed with bidirectional fitting in the ERTA assay.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The AIME method has been successfully applied to the VM7Luc estrogen receptor
transactivation assay (ERTA) to evaluate metabolism-dependent false positive and false negative target assay
effects for a set of ER active and inactive chemicals (Deisenroth 2020, Hopperstad 2022). It has also been
employed to screen a set of 768 chemicals derived from theToxCast library to provide a model for incorporating
hepatic metabolism into existing test guideline studies including experimental design components, reference
chemical consideration, assay performance criteria, and data analysis workflows.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed,

l.llAbbreviations:

AIC: Akaike Information Criterion
AOP: Adverse Outcome Pathway
CV: Coefficient of Variation
DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: CCTE_Deisenroth_AIME_384WELL_CTox_lnactive is designed to make measurements of a
protease substrate [Gly-Phe-AFCoumarin], a type of viabiliy indicator. The non-flourescent substrate is
enzymatically converted to a fluorescent probe in living cells. Changes in the signal are indicative of cytotoxicity.

and data are publicly available in ToxCast s invitroDB.

ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

-------
The Alginate Immobilization of Metabolic Enzymes (AIME) is a lid-based technique for retrofitting existing
bioassays with hepatic metabolic competence (Deisenroth 2020, Hopperstad 2022). The platform consists of
custom microplate peg lids with encapsulated hepatic S9-alginate microspheres attached to solid supports that
metabolize test compounds for 96- and 384-well based assays.

2.2	Scientific Principles: A potential limitation to the use of in vitro assay data in regulatory decision-making is the
lack of coverage for xenobiotic metabolic processes occurring within an organism. Indeed, hepatic- and
peripheral-tissue metabolism can yield metabolites that exhibit greater activity than the parent compound
(bioactivation) or are inactive (bioinactivation) for a given biological target. Interpretation of biological effect
data for chemicals screened in high-throughput screening (HTS) assays may benefit from incorporation of
xenobiotic metabolic competence to decrease uncertainty associated with predicting potential human health
hazards from in vitro assay data (Jacobs 2008, Jacobs 2013, OECD 2014). The Alginate Immobilization of
Metabolic Enzymes (AIME) platform consists of custom 96- or 384-well microplate lids containing solid supports
attached to encapsulated hepatic S9-alginate microspheres. The optimized platform uses rat or human hepatic
S9 to evaluate phase I xenobiotic metabolism while minimizing cytotoxic and assay interference effects.
Deployment to an estrogen receptor transactivation (ERTA) assay with a test set of compounds previously
identified by estrogen receptor quantitative structure-activity relationship (ER-QSAR) analysis to generate ER-
active metabolites6 demonstrated utility for identification of false positive and false negative target assay
effects, reprioritization of hazard based on metabolism-dependent bioactivity, and enhanced in vivo
concordance with the rodent uterotrophic bioassay.

2.3	Experimental System: adherent VM7Luc4E2 cell line used. The AIME platform consists of a custom polystyrene
lids with 96- or 384- solid support pegs designed to be compatible with all ANSI/SLAS standard microplates. Each
peg of the AIME lid contains a single microsphere of alginate hydrogel mixed with hepatic S9 fraction
(metabolism positive mode) or alginate hydrogel mixed with water (metabolism negative) and reach a standard
depth that provides sufficient distance to the well bottom. The method is compatible with biochemical and cell-
based target assays.

2.4	Metabolic Competence: Initial characterization of the AIME microsphere function focused on two key areas: 1)
the ability of small molecules to freely enter the microsphere and undergo metabolism by the cytochrome P450
enzymes, and 2) the duration of cytochrome P450 enzyme activity of the S9. To evaluate these features, a panel
of five reference substrates (phenacetin, dextromethorphan, bupropion, diclofenac, and chlorzoxazone), known
to be metabolized by human cytochrome P450 enzymes to specific and identifiable metabolites (Bjornsson
2003), were incubated with the AIME lids for 0, 2, 4, and 8 hours and analyzed by LC-MS/MS. The results
demonstrate that S9 functioned well within the alginate microsphere and formed detectable metabolites from
the probe substrates.

2.5	Exposure Regime: The AIME procedure consists of a preparative "pre-stamping" step using a barium
chloride/poly-L-lysine (PLL) solution followed by "stamping" the lid in alginate-hepatic S9 (metabolism positive)
or alginate-water (metabolism negative) to form microspheres. Once stamping is completed, the AIME lids are
placed in duplicate microplates (metabolism positive or negative) containing chemical solutions and incubated
for two hours to facilitate metabolic reactions. The conditioned medium, containing parent chemical or
metabolites, is then transferred to the cell-based VM7Luc Estrogen Receptor Transactivation Assay (ERTA) for
24 hours. A typical experimental design tests a 10-point concentration series (0.002 - 200 uM) for 8 compounds
(96-well format)l or 32 compounds (384-well format)2 in single technical replicate. AIME assay controls include
trans-stilbene (bioactivated control) and ethylparaben (bioinactivated control). ERTA assay plate controls
include 17-beta estradiol (positive control) and DMSO (solvent control). Experiments typically comprise a
minimum of four experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

-------
0.002 nM
Key positive control:
NA

200 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.122
Response cutoff threshold used to determine hit calls: 0.366
Detection technology used: multimode microplate reader (Fluorescence)

2.6	Response: The ERTA assay evaluates binding and transactivation of an estrogen receptor (ER)-responsive
luciferase reporter to determine endogenous ER activity. AIME assay controls were selected for compatibility
with the ERTA assay and are intended to demonstrate metabolic-dependent shifts in estrogenic bioactivity. A
fluorescence-based cell viability reagent is used to measure treatment dependent cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of protease.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Well-level data obtained as raw luminescence units for the ERTA luciferase assay are normalized
as percent activity of positive control. Well-level data obtained as raw fluorescence intensity units for the
cytotoxicity assay are normalized as log2 fold change to the DMSO control. Concentration-response modeling
of the 'percent activity' and 'log2 fold change' endpoints is used to derive quantitative potency and efficacy
values. Metabolism-dependent 'shifts' in bioactivity are evaluated by modeling the difference in concentration-
dependent effects.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

3: rmneg (Exclude wells with negative corrected response values (cval) and downgrading their well
quality (wllq); if cval < 0, wllq = 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal
to zero and downgrading their well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 768

Number of chemicals tested: 768

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
104

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	39236

Neutral control median absolute deviation, by plate: nmad	3339.186

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.01%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-3313.75

Positive control well median absolute deviation, by plate: pmad	3099.375

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-9.024

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 30.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Deisenroth C, DeGroot DE, Zurlinden T, Eicher A, McCord J, Lee MY, Carmichael P, Thomas RS. The
Alginate Immobilization of Metabolic Enzymes Platform Retrofits an Estrogen Receptor Transactivation Assay
With Metabolic Competence. Toxicol Sci. 2020 Dec 1;178(2):281-301. doi: 10.1093/toxsci/kfaal47. PMID:
32991717; PMCID: PMC8154005., Hopperstad, K., DeGroot, D. E., Zurlinden, T., Brinkman, C., Thomas, R. S., &
Deisenroth, C. (2022). Chemical Screening in an Estrogen Receptor Transactivation Assay With Metabolic
Competence. Toxicological sciences : an official journal of the Society of Toxicology, 187(1), 112-126.
https://doi.org/10.1093/toxsci/kfac019, Jacobs, M. N., Janssens, W., Bernauer, U., Brandon, E., Coecke, S.,
Combes, R., Edwards, P., Freidig, A., Freyberger, A., Kolanczyk, R., Mc Ardle, C., Mekenyan, O., Schmieder, P.,
Schrader, T., Takeyoshi, M., & van der Burg, B. (2008). The use of metabolising systems for in vitro testing of
endocrine disruptors. Current drug metabolism, 9(8), 796-826. https://doi.org/10.2174/138920008786049294,
Jacobs, M. N., Laws, S. C., Willett, K., Schmieder, P., Odum, J., & Bovee, T. F. (2013). In vitro metabolism and
bioavailability tests for endocrine active substances: what is needed next for regulatory purposes?. ALTEX, 30(3),
331-351. https://doi.Org/10.14573/altex.2013.3.331, OECD (2014), Detailed Review Paper on the Use of
Metabolising Systems for In Vitro Testing of Endocrine Disruptors, OECD Series on Testing and Assessment, No.
97, OECD Publishing, Paris, https://doi.org/10.1787/9789264085497-en., Pinto, C. L., Mansouri, K., Judson, R.,
& Browne, P. (2016). Prediction of Estrogenic Bioactivity of Environmental Chemical Metabolites. Chemical
research in toxicology, 29(9), 1410-1427. https://doi.org/10.1021/acs.chemrestox.6b00079, Bjornsson, T. D.,
Callaghan, J. T., Einolf, H. J., Fischer, V., Gan, L., Grimm, S., Kao, J., King, S. P., Miwa, G., Ni, L, Kumar, G., McLeod,
J., Obach, R. S., Roberts, S., Roe, A., Shah, A., Snikeris, F., Sullivan, J. T., Tweedie, D., Vega, J. M.,... FDA Center
for Drug Evaluation and Research (CDER) (2003). The conduct of in vitro and in vivo drug-drug interaction

-------
studies: a Pharmaceutical Research and Manufacturers of America (PhRMA) perspective. Drug metabolism and
disposition: the biological fate of chemicals, 31(7), 815-832. https://doi.Org/10.1124/dmd.31.7.815

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2492

CCTE_Deisenroth_AIME_384WELL_LUC_Shift

1. General Information

1.1	Assay Title: CCTE's 384-well Estrogen Receptor Transactivation Assay with AIME metabolic competence data
shift (ERTA luciferase; difference in metabolism positive and negative), Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_AIME_384WELL (AIME metabolism assay) is a cell-based, single-readout
assay that uses VM7Luc4E2, a human mammary gland/breast cell line, with measurements taken at 24 hour
after chemical dosing in a 384-well plate. CCTE_Deisenroth_AIME_384WELL_LUC_Shift is one of two analysis
components calculated from the CCTE_Deisenroth_AIME_384WELL assay. It is designed to evaluate the shift in
metabolism-dependent changes of the inducible luciferase reporter between positive (active) and negative
(inactive) assay modes. CCTE_Deisenroth_AIME_384WELL_LUC_Shift is a component of the
CCTE_Deisenroth_AIME_384WELL assay that measures the effect of retroftting the VM7Luc ERTA assay with
the Alginate Immobolization of Metabolic Enzymes (AIME) metabolism method. AIME is optimized to impart
phase I hepatic metabolism using phenobarbital/beta-naphthoflavone induced hepatic S9 from male Sprague
Dawley rats. Parent chemicals are incubated for 2 hours in the AIME assay prior to transfer to the VM7Luc ERTA
assay plate. The AIME metabolism assay is run simultaneously in negative (inactive) and positive (active) modes
to evaluate the activity of parent and metabolites, respectively.
This endpoint evaluates the shift in metabolism-dependent changes between positive (active) and negative
(inactive) assay modes. The metabolic curve shift response is defined as the pairwise difference between the
positive (AEID 2488) and negative (AEID 2490) normalized response values at each concentration for a given
chemical across replicate experiments. The pairwise differences form the basis for a concentration-response
curve. Standard curve fit analysis identfiied chemicals where the calculated shift was greater than the shift-
derived baseline cutoff. A positive hit designation in this endpoint indicates a significant shift trend in the
positive direction for bioactivation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The AIME method has been successfully applied to the VM7Luc estrogen receptor
transactivation assay (ERTA) to evaluate metabolism-dependent false positive and false negative target assay
effects for a set of ER active and inactive chemicals (Deisenroth 2020, Hopperstad 2022). It has also been
employed to screen a set of 768 chemicals derived from theToxCast library to provide a model for incorporating
hepatic metabolism into existing test guideline studies including experimental design components, reference
chemical consideration, assay performance criteria, and data analysis workflows.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: CCTE_Deisenroth_AIME_384WELL_LUC_Shift was designed to measure the shift in
bioluminescence signals produced from an enzymatic reaction involving the key substrate [Bright-Glo]between
metabolism positive (active) and negative (inactive) assay modes. Shifts are indicative of changes in
transcriptional gene expression due to agonist activity regulated by the human estrogen receptor 1
[GeneSymbokESRl | GenelD:2099 | Uniprot_SwissProt_Accession:P03372]. Positive shifts are indicative of
bioactivation.

The Alginate Immobilization of Metabolic Enzymes (AIME) is a lid-based technique for retrofitting existing
bioassays with hepatic metabolic competence (Deisenroth 2020, Hopperstad 2022). The platform consists of
custom microplate peg lids with encapsulated hepatic S9-alginate microspheres attached to solid supports that
metabolize test compounds for 96- and 384-well based assays.

2.2	Scientific Principles: A potential limitation to the use of in vitro assay data in regulatory decision-making is the
lack of coverage for xenobiotic metabolic processes occurring within an organism. Indeed, hepatic- and
peripheral-tissue metabolism can yield metabolites that exhibit greater activity than the parent compound
(bioactivation) or are inactive (bioinactivation) for a given biological target. Interpretation of biological effect
data for chemicals screened in high-throughput screening (HTS) assays may benefit from incorporation of
xenobiotic metabolic competence to decrease uncertainty associated with predicting potential human health
hazards from in vitro assay data (Jacobs 2008, Jacobs 2013, OECD 2014). The Alginate Immobilization of
Metabolic Enzymes (AIME) platform consists of custom 96- or 384-well microplate lids containing solid supports
attached to encapsulated hepatic S9-alginate microspheres. The optimized platform uses rat or human hepatic
S9 to evaluate phase I xenobiotic metabolism while minimizing cytotoxic and assay interference effects.
Deployment to an estrogen receptor transactivation (ERTA) assay with a test set of compounds previously
identified by estrogen receptor quantitative structure-activity relationship (ER-QSAR) analysis to generate ER-
active metabolites6 demonstrated utility for identification of false positive and false negative target assay
effects, reprioritization of hazard based on metabolism-dependent bioactivity, and enhanced in vivo
concordance with the rodent uterotrophic bioassay.

2.3	Experimental System: adherent VM7Luc4E2 cell line used. The AIME platform consists of a custom polystyrene
lids with 96- or 384- solid support pegs designed to be compatible with all ANSI/SLAS standard microplates. Each
peg of the AIME lid contains a single microsphere of alginate hydrogel mixed with hepatic S9 fraction
(metabolism positive mode) or alginate hydrogel mixed with water (metabolism negative) and reach a standard
depth that provides sufficient distance to the well bottom. The method is compatible with biochemical and cell-
based target assays.

2.4	Metabolic Competence: Initial characterization of the AIME microsphere function focused on two key areas: 1)
the ability of small molecules to freely enter the microsphere and undergo metabolism by the cytochrome P450
enzymes, and 2) the duration of cytochrome P450 enzyme activity of the S9. To evaluate these features, a panel
of five reference substrates (phenacetin, dextromethorphan, bupropion, diclofenac, and chlorzoxazone), known
to be metabolized by human cytochrome P450 enzymes to specific and identifiable metabolites (Bjornsson
2003), were incubated with the AIME lids for 0, 2, 4, and 8 hours and analyzed by LC-MS/MS. The results
demonstrate that S9 functioned well within the alginate microsphere and formed detectable metabolites from
the probe substrates.

2.5	Exposure Regime: The AIME procedure consists of a preparative "pre-stamping" step using a barium
chloride/poly-L-lysine (PLL) solution followed by "stamping" the lid in alginate-hepatic S9 (metabolism positive)
or alginate-water (metabolism negative) to form microspheres. Once stamping is completed, the AIME lids are
placed in duplicate microplates (metabolism positive or negative) containing chemical solutions and incubated
for two hours to facilitate metabolic reactions. The conditioned medium, containing parent chemical or
metabolites, is then transferred to the cell-based VM7Luc Estrogen Receptor Transactivation Assay (ERTA) for
24 hours. A typical experimental design tests a 10-point concentration series (0.002 - 200 uM) for 8 compounds
(96-well format)l or 32 compounds (384-well format)2 in single technical replicate. AIME assay controls include
trans-stilbene (bioactivated control) and ethylparaben (bioinactivated control). ERTA assay plate controls

-------
include 17-beta estradiol (positive control) and DMSO (solvent control). Experiments typically comprise a
minimum of four experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.00199999999999999 nM
Key positive control:

17beta-estradiol

Target (nominal) number of replicates:

9

Standard maximum concentration tested:

199.999999999999 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 3.314
Response cutoff threshold used to determine hit calls: 9.941
Detection technology used: multimode microplate reader (Luminescense)

2.6	Response: The ERTA assay evaluates binding and transactivation of an estrogen receptor (ER)-responsive
luciferase reporter to determine endogenous ER activity. AIME assay controls were selected for compatibility
with the ERTA assay and are intended to demonstrate metabolic-dependent shifts in estrogenic bioactivity. A
fluorescence-based cell viability reagent is used to measure treatment dependent cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Well-level data obtained as raw luminescence units for the ERTA luciferase assay are normalized
as percent activity of positive control. Well-level data obtained as raw fluorescence intensity units for the
cytotoxicity assay are normalized as log2 fold change to the DMSO control. Concentration-response modeling
of the 'percent activity' and 'log2 fold change' endpoints is used to derive quantitative potency and efficacy
values. Metabolism-dependent 'shifts' in bioactivity are evaluated by modeling the difference in concentration-
dependent effects.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

NA

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 768

Number of chemicals tested: 768

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
103

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	NA

Neutral control median absolute deviation, by plate: nmad	NA

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 101.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Deisenroth C, DeGroot DE, Zurlinden T, Eicher A, McCord J, Lee MY, Carmichael P, Thomas RS. The
Alginate Immobilization of Metabolic Enzymes Platform Retrofits an Estrogen Receptor Transactivation Assay
With Metabolic Competence. Toxicol Sci. 2020 Dec 1;178(2):281-301. doi: 10.1093/toxsci/kfaal47. PMID:
32991717; PMCID: PMC8154005., Hopperstad, K., DeGroot, D. E., Zurlinden, T., Brinkman, C., Thomas, R. S., &
Deisenroth, C. (2022). Chemical Screening in an Estrogen Receptor Transactivation Assay With Metabolic
Competence. Toxicological sciences : an official journal of the Society of Toxicology, 187(1), 112-126.
https://doi.org/10.1093/toxsci/kfac019, Jacobs, M. N., Janssens, W., Bernauer, U., Brandon, E., Coecke, S.,
Combes, R., Edwards, P., Freidig, A., Freyberger, A., Kolanczyk, R., Mc Ardle, C., Mekenyan, O., Schmieder, P.,
Schrader, T., Takeyoshi, M., & van der Burg, B. (2008). The use of metabolising systems for in vitro testing of
endocrine disruptors. Current drug metabolism, 9(8), 796-826. https://doi.org/10.2174/138920008786049294,
Jacobs, M. N., Laws, S. C., Willett, K., Schmieder, P., Odum, J., & Bovee, T. F. (2013). In vitro metabolism and
bioavailability tests for endocrine active substances: what is needed next for regulatory purposes?. ALTEX, 30(3),
331-351. https://doi.Org/10.14573/altex.2013.3.331, OECD (2014), Detailed Review Paper on the Use of
Metabolising Systems for In Vitro Testing of Endocrine Disruptors, OECD Series on Testing and Assessment, No.
97, OECD Publishing, Paris, https://doi.org/10.1787/9789264085497-en., Pinto, C. L., Mansouri, K., Judson, R.,
& Browne, P. (2016). Prediction of Estrogenic Bioactivity of Environmental Chemical Metabolites. Chemical
research in toxicology, 29(9), 1410-1427. https://doi.org/10.1021/acs.chemrestox.6b00079, Bjornsson, T. D.,
Callaghan, J. T., Einolf, H. J., Fischer, V., Gan, L., Grimm, S., Kao, J., King, S. P., Miwa, G., Ni, L, Kumar, G., McLeod,
J., Obach, R. S., Roberts, S., Roe, A., Shah, A., Snikeris, F., Sullivan, J. T., Tweedie, D., Vega, J. M.,... FDA Center
for Drug Evaluation and Research (CDER) (2003). The conduct of in vitro and in vivo drug-drug interaction

-------
studies: a Pharmaceutical Research and Manufacturers of America (PhRMA) perspective. Drug metabolism and
disposition: the biological fate of chemicals, 31(7), 815-832. https://doi.Org/10.1124/dmd.31.7.815

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3072

CCTE_Deisen roth_5AR_N BTE_a utof I uor

1.	General Information

1.1	Assay Title: CCTE's 5alpha-reductase NanoBRET Target Engagement Assay - Donor Signal, Deisenroth Lab

1.2	Assay Summary: CCTE Deisenroth 5AR NBTE (5alpha-reductase NanoBRET Target Engagement Assay) is a cell-
based, single-readout assay that uses HEK293T, a human kidney cell line, with measurements taken at 5 hours
after chemical dosing in a 384 well plate. This assay includes a primary and secondary screen as well as an
autofluorscence screen. CCTE_Deisenroth_5AR_NBTE_autofluor is one of four assay components measured
from the CCTE_Deisenroth_5AR NBTE assay. It is designed to make measurements of a compound's
autofluorescence, as detected by fluorescence intensity emission signals using spectrometry technology. Data
from the assay component CCTE_Deisenroth_5AR NBTE_autofluor was analyzed into 1 assay endpoint. This
assay endpoint, CCTE_Deisenroth_5AR NBTE_autofluor, was analyzed with bidirectional fitting relative to
median test wells for the baseline of activity. Using a type of background reporter, gain-of-signal activity can be
used to understand autofluorescence. Furthermore, this assay endpoint can be referred to as a secondary
readout, because this assay has produced multiple assay endpoints where this one serves an artifact detection
function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The SRD5A2-NBTE assay has been developed and applied to screening the ToxCast
chemical library.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity are indicative of the test substance having some physical feature
that alters or influences the background fluorescence.

The SRD5A2-NBTE assay employs NanoBRET Target Engagement (NBTE) assay technology to dynamically
evaluate modulation of testosterone binding to SRD5A2 (5 alpha reductase enzyme) in living cells. Altered
steroid hormone binding to SRD5A2 indicates the potential to disrupt androgenic hormone homeostasis.

2.2	Scientific Principles: Altered steroid hormone biosynthesis and metabolism can disrupt sex hormone
homeostasis, leading to impaired reproductive and sexual development (Sanderson 2006, Sidorkiewicz 2017).
In males, incomplete masculinization, or virilization, directly stems from deficiencies in androgen
steroidogenesis, target-tissue metabolism and activity. Within some androgen sensitive tissues, steroid 5alpha-
reductases 1 (SRD5A1) and 2 (SRD5A2) play an important role in androgen metabolism by catalyzing the
conversion of testosterone into the more potent androgen 5alpha-dihydrotestosterone (DHT) (Russell 1994).

-------
The type 2 isozyme, SRD5A2, is the predominant enzyme detectable in fetal genital skin, accessory sex organs,
and the prostate (Thigpen, 1993). Genetic lesions in SRD5A2 manifests as a clinical syndrome consistent with
decreased virilization that includes hypospadias, undescended testes, underdeveloped prostate, and
pseudohermaphroditism (Mendonca 2016, Wilson 1993). Environmental chemical exposures are another factor
contributing to androgen disruption that can be attributed to a number of mechanisms including inhibition of
androgen synthesis, modulation of the androgen receptor or co-factor recruitment, or inhibition of androgen
metabolizing enzymes like 5 alpha- reductase (Sanderson 2006). Indeed, certain pesticides and industrial
compounds with anti-androgen activity have previously been evaluated for 5 alpha-reductase inhibition,
pointing to 5 alpha-reductase deficiency as a potentially important, but largely unexplored, mode-of-action
when evaluating the impact of environmental chemical exposures on androgen function during human
development (Lo 2007).

2.3	Experimental System: adherent HEK293T cell line used. NanoBRETTarget Engagement assay technology utilizes
bioluminescence resonance energy transfer (BRET) for quantitative analysis of small molecule pharmacology of
cellular proteins in both equilibrium and non-equilibrium conditions. The NanoBRET Target Engagement assay
for 5alpha-reductase inhibition was designed to directly evaluate the physical engagement of testosterone
substrate with SRD5A2 target enzyme within the context of a dynamic, intracellular environment. The assay
employs HEK293T human embryonic kidney cells transiently transfected with full-length human SRD5A2 fused
with a flexible linker to NanoLuc luciferase enzyme. NanoLuc enzyme converts furimazine substrate to
furimamide metabolite, resulting in the generation of C02 and bioluminescent light. A high-affinity, cell-
permeable tracer consisting of a testosterone backbone fused with a flexible linker to NanoBRET 590 SE acceptor
fluorophore binds to the ligand binding domain of SRD5A2. A bioluminescence resonance energy transfer (BRET)
signal is produced when tracer is directly bound to the SRD5A2 enzyme.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: For assay setup, 384-well non-binding surface (NBS) microplates are pre-dispensed with test
chemical across an 8-point concentration series (10 nM - 29.9 uM) with two technical replicates per run. Cell
suspensions are dispensed into the microplates and incubated with test chemical for a duration of three hours.
Following pre-incubation with test chemical, the testosterone tracer is dispensed into each well and incubated
an additional two hours. The total incubation period with test chemical is five hours, with testosterone tracer
added the final two hours to reach target engagement equilibrium. For the NanoBRET endpoint measurements,
a solution of Nano-Glo substrate and extracellular NanoLuc inhibitor solution is added to each test well and
immediately scanned on a plate reader configured with multichromatic emission settings for donor
luminescence (460-20 nM) and acceptor fluorescence (650-100 nM). DMSO solvent controls are included as
baseline control. Plate-matched samples are run with mock tracer conditions or testosterone tracer to
discriminate any potential non-specific substrate interference.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
NA

Standard minimum concentration tested:

NA nM
Key positive control:

Tracer Fluorogenic Probe

Target (nominal) number of replicates:

NA

Standard maximum concentration tested:

NA nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): NA

Response cutoff threshold used to determine hit calls: NA

Detection technology used: multimode microplate reader (Fluorescence)

2.6 Response: Altered testosterone binding to the SRD5A2 enzyme as measured by a bioluminescence ('donor
signal1) resonance energy transfer fluorescence emission ('acceptor signal') generated in the SRD5A2-NanoBRET
Target Engagement Assay. The acceptor/donor ratio constitutes the 'ratio signal'.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For the NanoBRET endpoint measurements, plates were scanned on a multi-mode plate reader
configured with multichromatic emission settings for donor luminescence (460-20 nM) and acceptor
fluorescence (650-100 nM). Well-level normalization of raw data reads was calculated with the following
equation: mBU = (Acceptor Em(650)/Donor Em(460)) x 1,000, where mBU are milliBRET Units, Acceptor Em 650
is the acceptor fluorescence signal, and Donor Em 460 is the donor luminescence signal. The normalized mBU
ratio values were expressed as a percent of the DMSO solvent control response using the following equation:
%DMSO = (mBU i/median mBUdmso) x 100, where mBUi denotes the well-level mBU ratio value for the test
chemical, and mBUdmso is the plate-level median mBU ratio value for the solvent controls. The baseline median
absolute deviation (BMAD) of solvent controls were calculated with the following equation: BMAD = 1.4826 x
median(|yi-median(y)|), where yi denotes each DMSO-normalized mBU value, and y is the median DMSO-
normalized mBU value for all the plate-based DMSO solvent controls. A threshold cutoff of 3 x BMAD was
established to define the noise band. A tested chemical with any single concentration value above or below the
statistical noise band was defined as an active hit.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

NA

Level 3: Endpoint-specific normalization include:

-------
NA

Level 4: Baseline and required tcplFit2 parameters defined by:

NA

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:
NA

Level 6: Cautionary flagging include:

NA

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: NA

Number of chemicals tested: NA

Active hit count: hitc>0.9
NA

ACTIVITY HIT CALLS

Inactive hit count: Oihitc 0.9
NA

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	NA

gain-loss (gnls) model:	NA

power(pow) model:	NA

linear-polynomial (polyl) model:	NA

quadratic-polynomial(poly2) model:	NA

exponential-2 (exp2) model:	NA

exponential-3 (exp3) model:	NA

exponential-4 (exp4) model:	NA

exponential-5 (exp5) model:	NA

NA hit count: hitc^O
NA

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.

-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

NA
NA
NA%

NA
NA

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: NA.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Kleinstreuer NC, Browne P, Chang X, Judson R, Casey W, Ceger P, Deisenroth C, Baker N, Markey
K, Thomas RS. Evaluation of androgen assay results using a curated Hershberger database. Reprod Toxicol. 2018
Oct;81:272-280. doi: 10.1016/j.reprotox.2018.08.017. Epub 2018 Sep 8. PMID: 30205137; PMCID:
PMC7171594., Sanderson J. T. (2006). The steroid hormone biosynthesis pathway as a target for endocrine-
disrupting chemicals. Toxicological sciences : an official journal of the Society of Toxicology, 94(1), 3-21.
https://doi.org/10.1093/toxsci/kfl051, Sidorkiewicz, I., Zariba, K., Wotczynski, S., & Czerniecki, J. (2017).
Endocrine-disrupting chemicals-Mechanisms of action on male reproductive system. Toxicology and industrial
health, 33(7), 601-609. https://doi.org/10.1177/0748233717695160, Russell, D. W., & Wilson, J. D. (1994).
Steroid 5 alpha-reductase: two genes/two enzymes. Annual review of biochemistry, 63, 25-61.
https://doi.org/10.1146/annurev.bi.63.070194.000325, Thigpen, A. E., Silver, R. I., Guileyardo, J. M., Casey, M.
L., McConnell, J. D., & Russell, D. W. (1993). Tissue distribution and ontogeny of steroid 5 alpha-reductase
isozyme expression. The Journal of clinical investigation, 92(2), 903-910. https://doi.org/10.1172/JCI116665,
Mendonca, B. B., Batista, R. L, Domenice, S., Costa, E. M., Arnhold, I. J., Russell, D. W., & Wilson, J. D. (2016).

-------
Steroid 5a-reductase 2 deficiency. The Journal of steroid biochemistry and molecular biology, 163, 206-211.
https://doi.Org/10.1016/j.jsbmb.2016.05.020, Wilson, J. D., Griffin, J. E., & Russell, D. W. (1993). Steroid 5 alpha-
reductase 2 deficiency. Endocrine reviews, 14(5), 577-593. https://doi.org/10.1210/edrv-14-5-577, Lo, S., King,
I., Allera, A., & Klingmuller, D. (2007). Effects of various pesticides on human 5alpha-reductase activity in
prostate and LNCaP cells. Toxicology in vitro : an international journal published in association with BIBRA, 21(3),
502-508. https://doi.Org/10.1016/j.tiv.2006.10.016

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3074

CCTE_Deisen roth_5AR_N BTE_donor

1.	General Information

1.1	Assay Title: CCTE's 5alpha-reductase NanoBRET Target Engagement Assay - Acceptor Signal, Deisenroth Lab

1.2	Assay Summary: CCTE Deisenroth 5AR NBTE (5alpha-reductase NanoBRET Target Engagement Assay) is a cell-
based, single-readout assay that uses HEK293T, a human kidney cell line, with measurements taken at 5 hours
after chemical dosing in a 384 well plate. This assay includes a primary and secondary screen as well as an
autofluorscence screen. CCTE_Deisenroth_5AR NBTE_donor is one of four assay components measured from
the CCTE_Deisenroth_5AR NBTE assay. It is designed to make measurements of donor luminescence, as
detected by bioluminescence intensity emission signals using spectrometry technology. Data from the assay
component CCTE_Deisenroth_5AR NBTE_donor was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Deisenroth_5AR NBTE_donor, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity can be used to
understand changes in the donor luminescence as related to 5alpha-reductase catalytic function. Furthermore,
this assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves an artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The SRD5A2-NBTE assay has been developed and applied to screening the ToxCast
chemical library.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: The technology is based on the concept of Bioluminescence Resonance Energy Transfer BRET.
Target proteins fused to NanoLuc luciferase generate luminescence signal in the presence of furimazine
substrate. BRET occurs when custom fluorescent tracers, bound to the target protein, resonate luminescent
energy from the luciferase enzyme to the tracer molecule to yield a lower-energy fluorescent emission signal.

The SRD5A2-NBTE assay employs NanoBRET Target Engagement (NBTE) assay technology to dynamically
evaluate modulation of testosterone binding to SRD5A2 (5 alpha reductase enzyme) in living cells. Altered
steroid hormone binding to SRD5A2 indicates the potential to disrupt androgenic hormone homeostasis.

2.2	Scientific Principles: Altered steroid hormone biosynthesis and metabolism can disrupt sex hormone
homeostasis, leading to impaired reproductive and sexual development (Sanderson 2006, Sidorkiewicz 2017).
In males, incomplete masculinization, or virilization, directly stems from deficiencies in androgen
steroidogenesis, target-tissue metabolism and activity. Within some androgen sensitive tissues, steroid 5alpha-

-------
reductases 1 (SRD5A1) and 2 (SRD5A2) play an important role in androgen metabolism by catalyzing the
conversion of testosterone into the more potent androgen 5alpha-dihydrotestosterone (DHT) (Russell 1994).
The type 2 isozyme, SRD5A2, is the predominant enzyme detectable in fetal genital skin, accessory sex organs,
and the prostate (Thigpen, 1993). Genetic lesions in SRD5A2 manifests as a clinical syndrome consistent with
decreased virilization that includes hypospadias, undescended testes, underdeveloped prostate, and
pseudohermaphroditism (Mendonca 2016, Wilson 1993). Environmental chemical exposures are another factor
contributing to androgen disruption that can be attributed to a number of mechanisms including inhibition of
androgen synthesis, modulation of the androgen receptor or co-factor recruitment, or inhibition of androgen
metabolizing enzymes like 5 alpha- reductase (Sanderson 2006). Indeed, certain pesticides and industrial
compounds with anti-androgen activity have previously been evaluated for 5 alpha-reductase inhibition,
pointing to 5 alpha-reductase deficiency as a potentially important, but largely unexplored, mode-of-action
when evaluating the impact of environmental chemical exposures on androgen function during human
development (Lo 2007).

2.3	Experimental System: adherent HEK293T cell line used. NanoBRETTarget Engagement assay technology utilizes
bioluminescence resonance energy transfer (BRET) for quantitative analysis of small molecule pharmacology of
cellular proteins in both equilibrium and non-equilibrium conditions. The NanoBRET Target Engagement assay
for 5alpha-reductase inhibition was designed to directly evaluate the physical engagement of testosterone
substrate with SRD5A2 target enzyme within the context of a dynamic, intracellular environment. The assay
employs HEK293T human embryonic kidney cells transiently transfected with full-length human SRD5A2 fused
with a flexible linker to NanoLuc luciferase enzyme. NanoLuc enzyme converts furimazine substrate to
furimamide metabolite, resulting in the generation of C02 and bioluminescent light. A high-affinity, cell-
permeable tracer consisting of a testosterone backbone fused with a flexible linker to NanoBRET 590 SE acceptor
fluorophore binds to the ligand binding domain of SRD5A2. A bioluminescence resonance energy transfer (BRET)
signal is produced when tracer is directly bound to the SRD5A2 enzyme.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: For transient expression of the SRD5A2-NanoLuc fusion protein, HEK293T cells are reverse
transfected with pNLFl-C-SRD5A2 expression vector and incubated for 20 hours to facilitate fusion protein
expression. For assay setup, 384-well non-binding surface (NBS) microplates are pre-dispensed with test
chemical across an 8-point concentration series (10 nM - 29.9 uM) with two technical replicates per run. Cell
suspensions are dispensed into the microplates and incubated with test chemical for a duration of three hours.
Following pre-incubation with test chemical, the testosterone tracer is dispensed into each well and incubated
an additional two hours. The total incubation period with test chemical is five hours, with testosterone tracer
added the final two hours to reach target engagement equilibrium. For the NanoBRET endpoint measurements,
a solution of Nano-Glo substrate and extracellular NanoLuc inhibitor solution is added to each test well and
immediately scanned on a plate reader configured with multichromatic emission settings for donor
luminescence (460-20 nM) and acceptor fluorescence (650-100 nM). DMSO solvent controls are included as
baseline control. Plate-matched samples are run with mock tracer conditions or testosterone tracer to
discriminate any potential non-specific substrate interference.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

0.000125 nM
Key positive control:

Dutasteride, Finasteride

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.448
Response cutoff threshold used to determine hit calls: 1.343
Detection technology used: multimode microplate reader (Luminescence)

-------
2.6	Response: Altered testosterone binding to the SRD5A2 enzyme as measured by a bioluminescence ('donor
signal1) resonance energy transfer fluorescence emission ('acceptor signal') generated in the SRD5A2-NanoBRET
Target Engagement Assay. The acceptor/donor ratio constitutes the 'ratio signal'.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For the NanoBRET endpoint measurements, plates were scanned on a multi-mode plate reader
configured with multichromatic emission settings for donor luminescence (460-20 nM) and acceptor
fluorescence (650-100 nM). Well-level normalization of raw data reads was calculated with the following
equation: mBU = (Acceptor Em(650)/Donor Em(460)) x 1,000, where mBU are milliBRET Units, Acceptor Em 650
is the acceptor fluorescence signal, and Donor Em 460 is the donor luminescence signal. The normalized mBU
ratio values were expressed as a percent of the DMSO solvent control response using the following equation:
%DMSO = (mBU i/median mBUdmso) x 100, where mBUi denotes the well-level mBU ratio value for the test
chemical, and mBUdmso is the plate-level median mBU ratio value for the solvent controls. The baseline median
absolute deviation (BMAD) of solvent controls were calculated with the following equation: BMAD = 1.4826 x
median(|yi-median(y)|), where yi denotes each DMSO-normalized mBU value, and y is the median DMSO-
normalized mBU value for all the plate-based DMSO solvent controls. A threshold cutoff of 3 x BMAD was
established to define the noise band. A tested chemical with any single concentration value above or below the
statistical noise band was defined as an active hit.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 228	Number of chemicals tested: 228

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
61

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) / sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

257984.5
87858.876
32.64%

350944
119715.502

NA

0.602

NA
NA

NA
NA
NA

NA
NA
NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Kleinstreuer NC, Browne P, Chang X, Judson R, Casey W, Ceger P, Deisenroth C, Baker N, Markey
K, Thomas RS. Evaluation of androgen assay results using a curated Hershberger database. Reprod Toxicol. 2018
Oct;81:272-280. doi: 10.1016/j.reprotox.2018.08.017. Epub 2018 Sep 8. PMID: 30205137; PMCID:
PMC7171594., Sanderson J. T. (2006). The steroid hormone biosynthesis pathway as a target for endocrine-
disrupting chemicals. Toxicological sciences : an official journal of the Society of Toxicology, 94(1), 3-21.
https://doi.org/10.1093/toxsci/kfl051, Sidorkiewicz, I., Zariba, K., Wotczynski, S., & Czerniecki, J. (2017).
Endocrine-disrupting chemicals-Mechanisms of action on male reproductive system. Toxicology and industrial
health, 33(7), 601-609. https://doi.org/10.1177/0748233717695160, Russell, D. W., & Wilson, J. D. (1994).
Steroid 5 alpha-reductase: two genes/two enzymes. Annual review of biochemistry, 63, 25-61.
https://doi.org/10.1146/annurev.bi.63.070194.000325, Thigpen, A. E., Silver, R. I., Guileyardo, J. M., Casey, M.
L., McConnell, J. D., & Russell, D. W. (1993). Tissue distribution and ontogeny of steroid 5 alpha-reductase
isozyme expression. The Journal of clinical investigation, 92(2), 903-910. https://doi.org/10.1172/JCI116665,
Mendonca, B. B., Batista, R. L, Domenice, S., Costa, E. M., Arnhold, I. J., Russell, D. W., & Wilson, J. D. (2016).
Steroid 5a-reductase 2 deficiency. The Journal of steroid biochemistry and molecular biology, 163, 206-211.
https://doi.Org/10.1016/j.jsbmb.2016.05.020, Wilson, J. D., Griffin, J. E., & Russell, D. W. (1993). Steroid 5 alpha-
reductase 2 deficiency. Endocrine reviews, 14(5), 577-593. https://doi.org/10.1210/edrv-14-5-577, Lo, S., King,
I., Allera, A., & Klingmuller, D. (2007). Effects of various pesticides on human 5alpha-reductase activity in
prostate and LNCaP cells. Toxicology in vitro : an international journal published in association with BIBRA, 21(3),
502-508. https://doi.Org/10.1016/j.tiv.2006.10.016

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3076

CCTE_Deisen roth_5AR_N BTE_acceptor

1.	General Information

1.1	Assay Title: CCTE's 5alpha-reductase NanoBRET Target Engagement Assay - Autofluorescence Screen,
Deisenroth Lab

1.2	Assay Summary: CCTE Deisenroth 5AR NBTE (5alpha-reductase NanoBRET Target Engagement Assay) is a cell-
based, single-readout assay that uses HEK293T, a human kidney cell line, with measurements taken at 5 hours
after chemical dosing in a 384 well plate. This assay includes a primary and secondary screen as well as an
autofluorscence screen. CCTE_Deisenroth_5AR NBTE_acceptor is one of four assay components measured from
the CCTE_Deisenroth_5AR NBTE assay. It is designed to make measurements of resonant fluorescence
generated by bioluminescent energy transfer to a target fluorophore, as detected by fluorescence intensity
emission signals using spectrometry technology. Data from the assay component CCTE_Deisenroth_5AR
NBTE_acceptor was analyzed into 1 assay endpoint. This assay endpoint, CCTE_Deisenroth_5AR
NBTE_acceptor, was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, gain or loss-of-signal activity can be used to understand changes in
the acceptor fluorescence as related to 5alpha-reductase catalytic function. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves an artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed,

l.llAbbreviations:

AIC: Akaike Information Criterion
AOP: Adverse Outcome Pathway
CV: Coefficient of Variation
DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: The technology is based on the concept of Bioluminescence Resonance Energy Transfer BRET.
Target proteins fused to NanoLuc luciferase generate luminescence signal in the presence of furimazine
substrate. BRET occurs when custom fluorescent tracers, bound to the target protein, resonate luminescent
energy from the luciferase enzyme to the tracer molecule to yield a lower-energy fluorescent emission signal.

The SRD5A2-NBTE assay employs NanoBRET Target Engagement (NBTE) assay technology to dynamically
evaluate modulation of testosterone binding to SRD5A2 (5 alpha reductase enzyme) in living cells. Altered
steroid hormone binding to SRD5A2 indicates the potential to disrupt androgenic hormone homeostasis.

2.2	Scientific Principles: Altered steroid hormone biosynthesis and metabolism can disrupt sex hormone
homeostasis, leading to impaired reproductive and sexual development (Sanderson 2006, Sidorkiewicz 2017).
In males, incomplete masculinization, or virilization, directly stems from deficiencies in androgen

and data are publicly available in ToxCast s invitroDB.

ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

-------
steroidogenesis, target-tissue metabolism and activity. Within some androgen sensitive tissues, steroid 5alpha-
reductases 1 (SRD5A1) and 2 (SRD5A2) play an important role in androgen metabolism by catalyzing the
conversion of testosterone into the more potent androgen 5alpha-dihydrotestosterone (DHT) (Russell 1994).
The type 2 isozyme, SRD5A2, is the predominant enzyme detectable in fetal genital skin, accessory sex organs,
and the prostate (Thigpen, 1993). Genetic lesions in SRD5A2 manifests as a clinical syndrome consistent with
decreased virilization that includes hypospadias, undescended testes, underdeveloped prostate, and
pseudohermaphroditism (Mendonca 2016, Wilson 1993). Environmental chemical exposures are another factor
contributing to androgen disruption that can be attributed to a number of mechanisms including inhibition of
androgen synthesis, modulation of the androgen receptor or co-factor recruitment, or inhibition of androgen
metabolizing enzymes like 5 alpha- reductase (Sanderson 2006). Indeed, certain pesticides and industrial
compounds with anti-androgen activity have previously been evaluated for 5 alpha-reductase inhibition,
pointing to 5 alpha-reductase deficiency as a potentially important, but largely unexplored, mode-of-action
when evaluating the impact of environmental chemical exposures on androgen function during human
development (Lo 2007).

2.3	Experimental System: adherent HEK293T cell line used. NanoBRETTarget Engagement assay technology utilizes
bioluminescence resonance energy transfer (BRET) for quantitative analysis of small molecule pharmacology of
cellular proteins in both equilibrium and non-equilibrium conditions. The NanoBRET Target Engagement assay
for 5alpha-reductase inhibition was designed to directly evaluate the physical engagement of testosterone
substrate with SRD5A2 target enzyme within the context of a dynamic, intracellular environment. The assay
employs HEK293T human embryonic kidney cells transiently transfected with full-length human SRD5A2 fused
with a flexible linker to NanoLuc luciferase enzyme. NanoLuc enzyme converts furimazine substrate to
furimamide metabolite, resulting in the generation of C02 and bioluminescent light. A high-affinity, cell-
permeable tracer consisting of a testosterone backbone fused with a flexible linker to NanoBRET 590 SE acceptor
fluorophore binds to the ligand binding domain of SRD5A2. A bioluminescence resonance energy transfer (BRET)
signal is produced when tracer is directly bound to the SRD5A2 enzyme.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Chemicals from the blinded ToxCast library were diluted 1000-fold in DMSO to an assumed
nominal concentration of 100 nM. Samples were scanned in three independent runs (n=3) on a multi-mode
plate reader at a fixed excitation wavelength of 460 nM with emission filter set at 650-100 nM. For reference,
tracer was included at DMSO-solubilized concentrations of 100 nM (the top tested concentration for each test
chemical) and 600 nM (the final working concentration of the tracer probe).

Baseline median absolute deviation for the assay (bmad): 0.477
Response cutoff threshold used to determine hit calls: 1.43
Detection technology used: multimode microplate reader (Fluorescence)

2.6 Response: Autofluorescence interference of test substances can be a confounding issue in data interpretation
in fluorescence-based assays. This endpoint evaluates the fluorescence emission of test substances using the
peak excitation/emission spectra used in the SRD5A2-NanoBRET Target Engagement Assay Protocol.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

0.000125 nM
Key positive control:

Dutasteride, Finasteride

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Statistical analysis of the plate reader data was performed with a robust version of SSMD*(MM).
A threshold cutoff of Log2 ± 2 was established to define the noise band. A tested chemical with a value above
the statistical noise band was defined as a hit and used to flag bioactivity hits for potential autofluorescence
interference.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

-------
1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 228	Number of chemicals tested: 228

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
55

Inactive hit count: Oihitc 0.9
173

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

18
8

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

29

62

quadratic-polynomialfpoly2) model: 9

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:

82

6

1

-------
exponentials (exp5) model:

13

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

14257

Neutral control median absolute deviation, by plate: nmad

5014.153

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

33.24%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

9277.369

27647

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	1.115

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 13.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Kleinstreuer NC, Browne P, Chang X, Judson R, Casey W, Ceger P, Deisenroth C, Baker N, Markey
K, Thomas RS. Evaluation of androgen assay results using a curated Hershberger database. Reprod Toxicol. 2018
Oct;81:272-280. doi: 10.1016/j.reprotox.2018.08.017. Epub 2018 Sep 8. PMID: 30205137; PMCID:
PMC7171594., Sanderson J. T. (2006). The steroid hormone biosynthesis pathway as a target for endocrine-
disrupting chemicals. Toxicological sciences : an official journal of the Society of Toxicology, 94(1), 3-21.
https://doi.org/10.1093/toxsci/kfl051, Sidorkiewicz, I., Zariba, K., Wotczynski, S., & Czerniecki, J. (2017).
Endocrine-disrupting chemicals-Mechanisms of action on male reproductive system. Toxicology and industrial
health, 33(7), 601-609. https://doi.org/10.1177/0748233717695160, Russell, D. W., & Wilson, J. D. (1994).
Steroid 5 alpha-reductase: two genes/two enzymes. Annual review of biochemistry, 63, 25-61.
https://doi.org/10.1146/annurev.bi.63.070194.000325, Thigpen, A. E., Silver, R. I., Guileyardo, J. M., Casey, M.
L., McConnell, J. D., & Russell, D. W. (1993). Tissue distribution and ontogeny of steroid 5 alpha-reductase
isozyme expression. The Journal of clinical investigation, 92(2), 903-910. https://doi.org/10.1172/JCI116665,
Mendonca, B. B., Batista, R. L, Domenice, S., Costa, E. M., Arnhold, I. J., Russell, D. W., & Wilson, J. D. (2016).
Steroid 5a-reductase 2 deficiency. The Journal of steroid biochemistry and molecular biology, 163, 206-211.
https://doi.Org/10.1016/j.jsbmb.2016.05.020, Wilson, J. D., Griffin, J. E., & Russell, D. W. (1993). Steroid 5 alpha-
reductase 2 deficiency. Endocrine reviews, 14(5), 577-593. https://doi.org/10.1210/edrv-14-5-577, Lo, S., King,
I., Allera, A., & Klingmuller, D. (2007). Effects of various pesticides on human 5alpha-reductase activity in
prostate and LNCaP cells. Toxicology in vitro : an international journal published in association with BIBRA, 21(3),
502-508. https://doi.Org/10.1016/j.tiv.2006.10.016

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3078

CCTE_Deisen rot h_5AR_N BTE_ratio

1.	General Information

1.1	Assay Title: CCTE's 5alpha-reductase NanoBRET Target Engagement Assay - Ratio, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_5AR NBTE (5alpha-reductase NanoBRET Target Engagement Assay) is a cell-
based, single-readout assay that uses HEK293T, a human kidney cell line, with measurements taken at 5 hours
after chemical dosing in a 384 well plate. This assay includes a primary and secondary screen as well as an
autofluorscence screen. CCTE_Deisenroth_5AR NBTE_ratio is one of four assay components calculated from the
CCTE_Deisenroth_5AR NBTE assay. It is designed to measure the ratio of target acceptor fluorophore resonance
to donor bioluminescent signals. Data from the assay component CCTE_Deisenroth_5AR NBTE_ratio was
analyzed into 1 assay endpoint. This assay endpoint, CCTE_Deisenroth_5AR NBTE_ratio, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, gain or loss-of-signal activity can be used to understand changes in 5alpha-reductasecatalyticfunction.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the misc protein intended target family, where the
subfamily is 5-alpha reductase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The SRD5A2-NBTE assay has been developed and applied to screening the ToxCast
chemical library.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: The ratio of acceptor signal fluorescence to donor signal luminescence yields a normalized
quantitative value in milliBRET Units mBU that can be used to evaluate direct disruption of small molecule target
protein interactions of 5 alpha reductase.

The SRD5A2-NBTE assay employs NanoBRET Target Engagement (NBTE) assay technology to dynamically
evaluate modulation of testosterone binding to SRD5A2 (5 alpha reductase enzyme) in living cells. Altered
steroid hormone binding to SRD5A2 indicates the potential to disrupt androgenic hormone homeostasis.

2.2	Scientific Principles: Altered steroid hormone biosynthesis and metabolism can disrupt sex hormone
homeostasis, leading to impaired reproductive and sexual development (Sanderson 2006, Sidorkiewicz 2017).
In males, incomplete masculinization, or virilization, directly stems from deficiencies in androgen
steroidogenesis, target-tissue metabolism and activity. Within some androgen sensitive tissues, steroid 5alpha-
reductases 1 (SRD5A1) and 2 (SRD5A2) play an important role in androgen metabolism by catalyzing the

-------
conversion of testosterone into the more potent androgen 5alpha-dihydrotestosterone (DHT) (Russell 1994).
The type 2 isozyme, SRD5A2, is the predominant enzyme detectable in fetal genital skin, accessory sex organs,
and the prostate (Thigpen, 1993). Genetic lesions in SRD5A2 manifests as a clinical syndrome consistent with
decreased virilization that includes hypospadias, undescended testes, underdeveloped prostate, and
pseudohermaphroditism (Mendonca 2016, Wilson 1993). Environmental chemical exposures are another factor
contributing to androgen disruption that can be attributed to a number of mechanisms including inhibition of
androgen synthesis, modulation of the androgen receptor or co-factor recruitment, or inhibition of androgen
metabolizing enzymes like 5 alpha- reductase (Sanderson 2006). Indeed, certain pesticides and industrial
compounds with anti-androgen activity have previously been evaluated for 5 alpha-reductase inhibition,
pointing to 5 alpha-reductase deficiency as a potentially important, but largely unexplored, mode-of-action
when evaluating the impact of environmental chemical exposures on androgen function during human
development (Lo 2007).

2.3	Experimental System: adherent HEK293T cell line used. NanoBRETTarget Engagement assay technology utilizes
bioluminescence resonance energy transfer (BRET) for quantitative analysis of small molecule pharmacology of
cellular proteins in both equilibrium and non-equilibrium conditions. The NanoBRET Target Engagement assay
for 5alpha-reductase inhibition was designed to directly evaluate the physical engagement of testosterone
substrate with SRD5A2 target enzyme within the context of a dynamic, intracellular environment. The assay
employs HEK293T human embryonic kidney cells transiently transfected with full-length human SRD5A2 fused
with a flexible linker to NanoLuc luciferase enzyme. NanoLuc enzyme converts furimazine substrate to
furimamide metabolite, resulting in the generation of C02 and bioluminescent light. A high-affinity, cell-
permeable tracer consisting of a testosterone backbone fused with a flexible linker to NanoBRET 590 SE acceptor
fluorophore binds to the ligand binding domain of SRD5A2. A bioluminescence resonance energy transfer (BRET)
signal is produced when tracer is directly bound to the SRD5A2 enzyme.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: For transient expression of the SRD5A2-NanoLuc fusion protein, HEK293T cells are reverse
transfected with pNLFl-C-SRD5A2 expression vector and incubated for 20 hours to facilitate fusion protein
expression. For assay setup, 384-well non-binding surface (NBS) microplates are pre-dispensed with test
chemical across an 8-point concentration series (10 nM - 29.9 uM) with two technical replicates per run. Cell
suspensions are dispensed into the microplates and incubated with test chemical for a duration of three hours.
Following pre-incubation with test chemical, the testosterone tracer is dispensed into each well and incubated
an additional two hours. The total incubation period with test chemical is five hours, with testosterone tracer
added the final two hours to reach target engagement equilibrium. For the NanoBRET endpoint measurements,
a solution of Nano-Glo substrate and extracellular NanoLuc inhibitor solution is added to each test well and
immediately scanned on a plate reader configured with multichromatic emission settings for donor
luminescence (460-20 nM) and acceptor fluorescence (650-100 nM). DMSO solvent controls are included as
baseline control. Plate-matched samples are run with mock tracer conditions or testosterone tracer to
discriminate any potential non-specific substrate interference.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

0.000125 nM
Key positive control:

Dutasteride, Finasteride
Baseline median absolute deviation for the assay (bmad): 0.05

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

Response cutoff threshold used to determine hit calls: 0.15

Detection technology used: multimode microplate reader (Bioluminescence Resonance Energy Transfer)

-------
2.6	Response: Altered testosterone binding to the SRD5A2 enzyme as measured by a bioluminescence ('donor
signal1) resonance energy transfer fluorescence emission ('acceptor signal') generated in the SRD5A2-NanoBRET
Target Engagement Assay. The acceptor/donor ratio constitutes the 'ratio signal'.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of misc protein.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For the NanoBRET endpoint measurements, plates were scanned on a multi-mode plate reader
configured with multichromatic emission settings for donor luminescence (460-20 nM) and acceptor
fluorescence (650-100 nM). Well-level normalization of raw data reads was calculated with the following
equation: mBU = (Acceptor Em(650)/Donor Em(460)) x 1,000, where mBU are milliBRET Units, Acceptor Em 650
is the acceptor fluorescence signal, and Donor Em 460 is the donor luminescence signal. The normalized mBU
ratio values were expressed as a percent of the DMSO solvent control response using the following equation:
%DMSO = (mBU i/median mBUdmso) x 100, where mBUi denotes the well-level mBU ratio value for the test
chemical, and mBUdmso is the plate-level median mBU ratio value for the solvent controls. The baseline median
absolute deviation (BMAD) of solvent controls were calculated with the following equation: BMAD = 1.4826 x
median(|yi-median(y)|), where yi denotes each DMSO-normalized mBU value, and y is the median DMSO-
normalized mBU value for all the plate-based DMSO solvent controls. A threshold cutoff of 3 x BMAD was
established to define the noise band. A tested chemical with any single concentration value above or below the
statistical noise band was defined as an active hit.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 46: resp.incr.zerocenter.fc
(Calculate the normalized response (resp) as a zero centerfold change, i.e. the ratio of the the corrected
(cval) and baseline (bval) values minus 1; resp = cval/bval -1. Typically used for increasing responses.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 228	Number of chemicals tested: 228

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
110

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	56.14

Neutral control median absolute deviation, by plate: nmad	2.921

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.14%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	78.214

Positive control well median absolute deviation, by plate: pmad	6.353

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	3.373

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 15.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Kleinstreuer NC, Browne P, Chang X, Judson R, Casey W, Ceger P, Deisenroth C, Baker N, Markey
K, Thomas RS. Evaluation of androgen assay results using a curated Hershberger database. Reprod Toxicol. 2018
Oct;81:272-280. doi: 10.1016/j.reprotox.2018.08.017. Epub 2018 Sep 8. PMID: 30205137; PMCID:
PMC7171594., Sanderson J. T. (2006). The steroid hormone biosynthesis pathway as a target for endocrine-
disrupting chemicals. Toxicological sciences : an official journal of the Society of Toxicology, 94(1), 3-21.
https://doi.org/10.1093/toxsci/kfl051, Sidorkiewicz, I., Zariba, K., Wotczynski, S., & Czerniecki, J. (2017).
Endocrine-disrupting chemicals-Mechanisms of action on male reproductive system. Toxicology and industrial
health, 33(7), 601-609. https://doi.org/10.1177/0748233717695160, Russell, D. W., & Wilson, J. D. (1994).
Steroid 5 alpha-reductase: two genes/two enzymes. Annual review of biochemistry, 63, 25-61.
https://doi.org/10.1146/annurev.bi.63.070194.000325, Thigpen, A. E., Silver, R. I., Guileyardo, J. M., Casey, M.
L., McConnell, J. D., & Russell, D. W. (1993). Tissue distribution and ontogeny of steroid 5 alpha-reductase
isozyme expression. The Journal of clinical investigation, 92(2), 903-910. https://doi.org/10.1172/JCI116665,
Mendonca, B. B., Batista, R. L, Domenice, S., Costa, E. M., Arnhold, I. J., Russell, D. W., & Wilson, J. D. (2016).
Steroid 5a-reductase 2 deficiency. The Journal of steroid biochemistry and molecular biology, 163, 206-211.
https://doi.Org/10.1016/j.jsbmb.2016.05.020, Wilson, J. D., Griffin, J. E., & Russell, D. W. (1993). Steroid 5 alpha-
reductase 2 deficiency. Endocrine reviews, 14(5), 577-593. https://doi.org/10.1210/edrv-14-5-577, Lo, S., King,
I., Allera, A., & Klingmuller, D. (2007). Effects of various pesticides on human 5alpha-reductase activity in
prostate and LNCaP cells. Toxicology in vitro : an international journal published in association with BIBRA, 21(3),
502-508. https://doi.Org/10.1016/j.tiv.2006.10.016

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3094

CCTE_Deisen roth_DEVTOX-G LR_legacy_Soxl7

1. General Information

1.1	Assay Title: (Legacy) CCTE's DevTox GLR-Endo Assay Evaluation of SRY-box transcription factor 17 (SOX17)
Protein Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_legacy assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_legacy_Soxl7 is one of 4 assay components calculated from
the CCTE_Deisenroth_DEVTOX-GLR_legacy assay and it measures an increase in the percentage of cells in an
endoderm state. It is designed to make measurements of inducible reporter a form of fluorescent protein
induction as detected with optical fluorescence microscopy. Percentage Soxl7+ cells are determined by
calculating the number of Soxl7+ cells divided by cell count. A cell is Soxl7+ if the cell Soxl7 mean intensity is
5x BMAD above the pluripotent control median cell Soxl7 mean. The assay endpoints were analyzed with
bidirectional fitting relative to 0.2% DMSO control in the CCTE_Deisenroth_DEVTOX-GLR_Endo assay. Data from
the assay component CCTE_Deisenroth_DEVTOX-GLR_legacy_Soxl7 was analyzed into 2 assay endpoint. This
assay endpoint, CCTE_Deisenroth_DEVTOX-GLR_legacy_Soxl7, was analyzed with bidirectional fitting relative
to DMSO as the negative control and baseline activity. Using a type of inducible reporter, gain or loss-of-signal
activity can be used to understand changes in gene expression. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves as a
measure of endoderm differentiation. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the transcription factor intended target family, where the subfamily is SRY-related
HMG-box.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

-------
l.llAbbreviations:

AIC: Akaike Information Criterion
AOP: Adverse Outcome Pathway
CV: Coefficient of Variation

ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_legacy_Soxl7 was designed to measure changes in cellular Soxl7
expression. Changes are indicative of changes in Soxl7 gene expression due to perturbations in endoderm
differentiation.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:

9

Standard minimum concentration tested:

10	nM

Target (nominal) number of replicates:

5

Standard maximum concentration tested:
200 nM

-------
Key positive control:	Neutral vehicle control:

SB431542	DMSO

Baseline median absolute deviation for the assay (bmad): 6.752
Response cutoff threshold used to determine hit calls: 20.256
Detection technology used: microscopy (microscopy)

2.6	Response: The DevTox GLR-Endo assay evaluates SOX17 protein expression as a biomarker for early endoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 11:
bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive
control value (pval) to 0; pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 67

Number of chemicals tested: 67

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
37

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	63.804

Neutral control median absolute deviation, by plate: nmad	4.533

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.9%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	69.457

Positive control well median absolute deviation, by plate: pmad	3.784

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.875

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 18.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and

-------
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3095

CCTE_Deisen roth_DEVTOX-G LR_legacy_Sox2

1. General Information

1.1	Assay Title: (Legacy) CCTE's DevTox GLR-Endo Assay Evaluation of SRY-box transcription factor 2 (SOX2) Protein
Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_legacy assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_legacy_Sox2 is one of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_legacy assay that measures an increase in the percentage of cells in an
ectodermal or pluripotent state. It is designed to make measurements of inducible reporter a form of
fluorescent protein induction as detected with optical fluorescence microscopy. Percentage Sox2+ cells are
determined by calculating the number of Sox2+ cells divided by cell count. A cell is Sox2+ if the cell Sox2 mean
intensity is 5x BMAD above the directed endoderm control median cell Sox2 mean. The assay endpoint was
analyzed with bidirectional fitting with the 0.2% DMSO control median percent Sox2 in the
CCTE_Deisenroth_DEVTOX-GLR_Endo assay subtracted. Data from the assay component
CCTE_Deisenroth_DEVTOX-GLR_legacy_Sox2 was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Deisenroth_DEVTOX-GLR_legacy_Sox2, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline activity. Using a type of inducible reporter, gain-of-signal activity can
be used to understand changes in gene expression. Furthermore, this assay endpoint can be referred to as a
secondary readout, because this assay has produced multiple assay endpoints where this one serves as a
measure of pluripotency or ectoderm differentiation. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the transcription factor intended target family, where the subfamily
is SRY-related HMG-box.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

-------
l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.
l.llAbbreviations:

AIC: Akaike Information Criterion
AOP: Adverse Outcome Pathway
CV: Coefficient of Variation

ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_legacy_Sox2 was designed to measure changes in cellular Sox2
expression. Changes are indicative of changes in Sox2 protein expression due to perturbations in endoderm
differentiation.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

-------
10 nM
Key positive control:
NA

200 nM
Neutral vehicle control:
DMSO

Baseline median absolute deviation for the assay (bmad): 0.164
Response cutoff threshold used to determine hit calls: 0.491
Detection technology used: microscopy (microscopy)

2.6	Response: The DevTox GLR-Endo assay evaluates SOX17 protein expression as a biomarker for early endoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 67

Number of chemicals tested: 67

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
31

Inactive hit count: Oihitc 0.9
36

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

8

9

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

4

2

quadratic-polynomialfpoly2) model: 4

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

24

1

1

13

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.387

Neutral control median absolute deviation, by plate: nmad	0.107

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	29.19%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 13.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3096

CCTE_Deisen roth_DEVTOX-G LR_legacy_Bra

1. General Information

1.1	Assay Title: (Legacy) CCTE's DevTox GLR-Endo Assay Evaluation of Brachyury (BRA) Protein Expression,
Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_legacy assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_legacy_Bra is one of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_legacy assay that measures an increase in the percentage of cells in an
endoderm state. It is designed to make measurements of inducible reporter a form of fluorescent protein
induction as detected with optical fluorescence microscopy. Percentage Bra+ cells are determined by calculating
the number of Bra+ cells divided by cell count. A cell is Bra+ if the cell Bra mean intensity is 5x BMAD above the
pluripotent control median cell Bra mean. The assay endpoint was analyzed with bidirectional fitting with the
0.2% DMSO control median percent Bra in the DEVTOX-GLR assay subtracted. Data from the assay component
CCTE_Deisenroth_DEVTOX-GLR_legacy_Bra was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Deisenroth_DEVTOX-GLR_legacy_Bra, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline activity. Using a type of inducible reporter, gain-of-signal activity can
be used to understand changes in gene expression. Furthermore, this assay endpoint can be referred to as a
secondary readout, because this assay has produced multiple assay endpoints where this one serves as a
measure of mesoderm differentiation. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the transcription factor intended target family, where the subfamily is T-box protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

-------
AIC: Akaike Information Criterion
AOP: Adverse Outcome Pathway
CV: Coefficient of Variation

ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_legacy_Bra was designed to measure changes in cellular Bra
expression. Changes are indicative of changes in Bra gene expression due to perturbations in endoderm
differentiation

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:

9

Standard minimum concentration tested:

10	nM

Key positive control:

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 0.258
Response cutoff threshold used to determine hit calls: 0.774
Detection technology used: microscopy (microscopy)

2.6	Response: The DevTox GLR-Endo assay evaluates SOX17 protein expression as a biomarker for early endoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval
component-specific corrections.)

= cval. No additional mc2 methods needed for

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 67	Number of chemicals tested: 67

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
37

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.834

Neutral control median absolute deviation, by plate: nmad	0.17

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	23.05%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 14.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3098

CCTE_Deisen roth_DEVTOX-G LR_legacy_Cel ICou nt

1. General Information

1.1	Assay Title: (Legacy) CCTE's DevTox GLR-Endo Assay Evaluation of Total Cell Counts, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_legacy assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_legacy_CellCount is one of 4 assay components calculated from
the CCTE_Deisenroth_DEVTOX-GLR_legacy assay that measures an increase in cell count. It is designed to make
measurements of nuclei count, a form of cell viability, as detected with optical fluorescence microscopy by
Perkin Elmer Harmony nuclei detection algorithm. The assay endpoints were analyzed with bidirectional fitting
relative to 0.2% DMSO control in the DEVTOX-GLR assay. Data from the assay component
CCTE_Deisenroth_DEVTOX-GLR_legacy_CellCount was analyzed into 2 assay endpoint. This assay endpoint,
CCTE_Deisenroth_DEVTOX-GLR_legacy_CellCount, was analyzed with bidirectional fitting relative to DMSO as
the negative control and baseline activity. Using nuclei counting, gain or loss-of-signal activity can be used to
understand changes in cell viability. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves as a measure of cytotoxicity.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell cycle
intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_legacy_CellCount was designed to measure changes in the
number cells. Changes are indicative of cytotoxicity.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:

9

Standard minimum concentration tested:

10	nM

Key positive control:

NA

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 6.292
Response cutoff threshold used to determine hit calls: 18.877

Detection technology used: Perkin Elmer Harmony nuclei detection algorithm (microscopy)

-------
2.6	Response: The DevTox GLR-Endo assay evaluates SOX17 protein expression as a biomarker for early endoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 11:
bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID

-------
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive
control value (pval) to 0; pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 67	Number of chemicals tested: 67

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
38

29

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

8
2

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

6

9

quadratic-polynomialfpoly2) model: 8

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

3

1

15

14

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4043

Neutral control median absolute deviation, by plate: nmad	223.502

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.49%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 14.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3161

CCTE_Deisenroth_H295R-HTRF_384WELL_ESTRADIOL

1.	General Information

1.1	Assay Title: CCTE's H295R-HTRF Assay Evaluation of Estradiol Concentration, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_H295R-HTRF_384WELL assay is a cell-based, multiplex-readout assay that
uses H295R, a human adrenocortical carcinoma cell line, with measurements taken at 48 hours after chemical
dosing in a 384-well microplate CCTE_Deisenroth_H295R-HTRF_384WELL_ESTRADIOL is one of 3 assay
components calculated in the CCTE_Deisenroth_H295R-HTRF_384WELL assay (2 hormones and 1 viability
components). It is designed to make measurements of hormone induction, a form of inducible reporter, as
detected with Homogenous Time Resolved Fluorescence immunoassay technology. This assay endpoint,
CCTE_Deisenroth_H295R-HTRF_384WELL_ESTRADIOL, was analyzed with bidirectional fitting relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity
using Homogenous Time Resolved Fluorescence technology was used to understand synthesis of estradiol in
the H295R cell line at 48hr of chemical exposure. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the steroid hormone intended target family, where the subfamily is
estrogens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The H295R-HTRF assay (Garnovskaya 2023) was evaluated against a training set of 36
chemicals comprising reference chemicals from the OECD inter-laboratory validation study (Hecker 2011), U.S.
EPA Office of Chemical Safety and Pollution Prevention (OCSPP) Guideline 890.1200 aromatase assay (EPA
2011), and azole fungicides active in the HT-H295R assay (Karmaus 2011). The selected chemicals represent
diverse chemical structures and mechanisms amenable to evaluation of overall predictivity and performance.
Results were evaluated accordingly for quality control specifications, ability to detect proficiency chemicals, and
predictive performance across the chemical training set. Generally, all active chemicals in the training set
demonstrated equivalent or greater potency in the H295R-HTRF assay, supporting the sensitivity of the
platform. The adaptation of HTRF technology to the H295R model provides an efficient way to evaluate E2 and
T modulators in accordance with guideline specifications.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed,

l.llAbbreviations:

AIC: Akaike Information Criterion
AOP: Adverse Outcome Pathway
CV: Coefficient of Variation
DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the fluorescence ratio are indicative of induction (gain of signal) or inhibition (loss of
signal) of steroid hormone biosynthesis.

and data are publicly available in ToxCast s invitroDB.

ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

-------
This H295R-HTRF assay utilizes Homogenous Time Resolved Fluorescence (HTRF) immunoassay technology to
advance screening throughput in the H295R steroidogenesis assay by rapidly measuring E2 and T levels in a 384-
well format.

2.2	Scientific Principles: Changes in steroid hormone biosynthesis and metabolism (i.e., steroidogenesis) can alter
hormone levels, contributing to endocrine disruption that may result in impaired reproductive and sexual
development (Sanderson 2006, Sidorkiewicz 2017). Screening for altered steroidogenesis is an integral part of
the EDSP tiered testing paradigm where it is included in the Series 890 Tier 1 assay battery (EPA 2011). The assay
utilizes the NCI-H295R (H295R) adrenocortical carcinoma cell line which contains steroidogenic enzymes that
mediate the conversion of cholesterol to a series of steroid hormones and has been demonstrated to be a useful
in vitro model for steroidogenic pathways and processes (Hecker 2006, Gracia 2006, Hilscherova 2004,
Sanderson 2002). The cell line maintains features of zonally undifferentiated human fetal adrenal cells with the
capacity to produce steroid hormones found in both the adult adrenal cortex and the gonads including
androgens, estrogens, corticosteroids, and mineralocorticoids (Gazdar 1990). Both the Organization for
Economic Cooperation and Development (OECD) Test Guideline 456 (OECD TG 456) and the EPA series 890.1550
validated H295R assay evaluate the effect of test substances on the synthesis of E2 and T (OECD 20114, EPA
2017). In these protocols, cells seeded into 24-well plates are exposed for 48 hours to test chemical, and changes
in the concentration of E2 and T are measured in the culture medium. The EPA has recently developed a higher
throughput version of the H295R assay (HT-H295R) that is run in 96-well format and uses high-performance
liquid chromatography followed by tandem mass spectrometry (HPLC-MS/MS) analysis to quantify E2, T, and
nine other hormones (Karmaus 2016). The assay was used to screen 2,060 chemicals in the ToxCast inventory
for effects on steroid biosynthesis with 232 demonstrating concentration-dependent effects on E2 and T4. Novel
statistical methods have been employed to evaluate and prioritize compound effects based on the multi-
hormone outputs (Haggard 2019,2018), but there are still hundreds of additional data-poor chemicals predicted
to act as estrogen and androgen modulators that are potential candidates for steroidogenesis screening (Foster
2022). This H295R-HTRF assay utilizes Homogenous Time Resolved Fluorescence (HTRF) immunoassay
technology to further advance screening throughput by rapidly measuring E2 and T levels in a 384-well format.

2.3	Experimental System: adherent H295R cell-based used. The NCI-H295R adrenocortical carcinoma adherent cell
line is used in the steroidogenesis assay. Hormone analytes are measured with Homogenous Time Resolved
Fluorescence (HTRF); a type of competitive immunoassay technology that employs the principle of Fluorescence
Resonance Energy Transfer (FRET) using long-lived donor/acceptor fluorophore pairs that eliminate non-
specific, short-lived background fluorescence using time-resolved (TR) measurements (Degorce 2018). Anti-
hormone antibodies conjugated to a donor fluorophore (e.g., Europium Cryptate) bind to standardized hormone
conjugated to an acceptor fluorophore (e.g., XL665) to produce a TR-FRET emission signal. Disruption of the
interaction by non-labelled hormone present in the conditioned medium results in a loss-of-signal that is
inversely correlated to the concentration of hormone analyte in the medium. The technology is ideal for high-
throughput screening and eliminates the need for more labor-intensive analytical methodologies used in the
previous HT-H295R implementation.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Standard operating procedures for the cell culture medium, maintenance, expansion, and
cryopreservation of the NCI-H295R adrenocortical carcinoma adherent cell line were adopted directly from
guideline specifications7,13. The assay workflow follows a three-day protocol consistent with the paradigm
defined in OECD TG 456 and U.S. EPA OCSPP Guideline 890.1550. On day 0, cells are plated in 384-well
microplates and acclimated for 24 hours. On Day 1, test chemicals are dispensed into separate 384-well
microplates in a nine-point 2X dilution series (0.002 - 200 uM). Without aspirating the growth medium on the
assay plates, test chemical suspension is dispensed to achieve a final IX nominal concentration. Each chemical
is run in technical triplicate at a final dilution series of 0.001-100 uM. On Day 3, following a 48-hour incubation
period with test chemical, the conditioned medium samples are collected and transferred to 384-well sample
storage plates. HTRF direct immunoassay technology is employed to detect E2 and T analytes in H295R
conditioned medium. HTRF assays for E2 and T are performed with matched conditioned medium samples

-------
according to the manufacturer's protocol. Fluorescent endpoint measurements are conducted with a microplate
reader equipped with an HTRF-compatible filter set (ex: 320 (10) nM; em: 620 (10) and 665 (10) nM). A
multiplexed cell viability assay is implemented on the same plates used for chemical testing using the Promega
CellTiter-Glo 2.0 assay. In this homogenous assay format, cells are lysed in the presence of a thermostable
luciferase to generate a luminescent signal that is directly proportional to the amount of ATP present, providing
an indirect indication of cytotoxicity in the sample. After collecting conditioned medium samples on Day 3, the
viability assay is immediately performed and luminescence readings measured on a microplate reader. A total
of five experimental replicates is conducted for each chemical (n=5).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

9	17

Standard minimum concentration tested:	Standard maximum concentration tested:

0.00104 nM	100 nM

Key positive control:	Neutral vehicle control:

forskolin for gain; prochloraz for loss	DMSO

Baseline median absolute deviation for the assay (bmad): 0.067
Response cutoff threshold used to determine hit calls: 0.201

Detection technology used: Homogenous Time Resolved Fluorescence (Fluorescence)

2.6 Response: Homogenous Time Resolved Fluorescence (HTRF) immunoassay technology is used to measure 170-
estradiol (E2) and testosterone (T) synthesis. Cytotoxicity is evaluated with a viability assay that measures ATP
concentrations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For steroid hormone measurements, well-level raw fluorescence unit (RFU) values are
normalized as a ratio according to the following equation: RFU Ratio = (Signal Em 665 nM/Signal Em 620 nM) X
10,000. RFU ratios are interpolated against a standard curve to derive the concentrations for E2 (ng/ml) and T
(nM), respectively. Testosterone concentrations are further converted to ng/ml to harmonize the measurement

-------
units between analytes and enable comparisons to historical H295R data. For cytotoxicity measurements, well-
level raw luminescence units (RLU) values are normalized to the median plate-based solvent control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 46: resp.incr.zerocenter.fc
(Calculate the normalized response (resp) as a zero centerfold change, i.e. the ratio of the the corrected
(cval) and baseline (bval) values minus 1; resp = cval/bval -1. Typically used for increasing responses.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical

-------
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 36

Number of chemicals tested: 36

Active hit count: hitc>0.9
17

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.526

Neutral control median absolute deviation, by plate: nmad	0.036

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.1%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	2.216

Negative control well median absolute deviation value, by plate: mmad	0.447

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	2.516

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Garnovskaya, M., Feshuk, M., Stewart, W., Friedman, K. P., Thomas, R. S., & Deisenroth, C. (2023).
Evaluation of a high-throughput H295R homogenous time resolved fluorescence assay for androgen and
estrogen steroidogenesis screening. Toxicology in vitro : an international journal published in association with
BIBRA, 92,105659. https://doi.Org/10.1016/j.tiv.2023.105659, Haggard DE, Karmaus AL, Martin MT, Judson RS,
Setzer RW, Paul Friedman K. High-Throughput H295R Steroidogenesis Assay: Utility as an Alternative and a
Statistical Approach to Characterize Effects on Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi:
10.1093/toxsci/kfx274. Erratum in: Toxicol Sci. 2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer
RW, Judson RS, Paul Friedman K. Development of a prioritization method for chemical-mediated effects on
steroidogenesis using an integrated statistical analysis of high-throughput H295R data. Regul Toxicol Pharmacol.
2019 Dec; 109:104510. doi: 10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319., Sanderson J. T.
(2006). The steroid hormone biosynthesis pathway as a target for endocrine-disrupting chemicals. Toxicological
sciences : an official journal of the Society of Toxicology, 94(1), 3-21. https://doi.org/10.1093/toxsci/kfl051,
Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical Effects on
Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016 Apr;150(2):323-32. doi:
10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454., Sidorkiewicz, I., Zariba, K.,

-------
Wotczyriski, S., & Czerniecki, J. (2017). Endocrine-disrupting chemicals-Mechanisms of action on male
reproductive system. Toxicology and industrial health, 33(7), 601-609.
https://doi.org/10.1177/0748233717695160, Degorce, F., Card, A., Soh, S., Trinquet, E., Knapik, G. P., & Xie, B.
(2009). HTRF: A technology tailored for drug discovery - a review of theoretical aspects and recent applications.
Current chemical genomics, 3, 22-32. https://doi.org/10.2174/1875397300903010022, Foster, M. J., Patlewicz,
G., Shah, I., Haggard, D. E., Judson, R. S., & Paul Friedman, K. (2022). Evaluating structure-based activity in a
high-throughput assay for steroid biosynthesis. Computational toxicology (Amsterdam, Netherlands), 24,1-23.
https://doi.Org/10.1016/j.comtox.2022.100245, EPA, U. S. Continuing development of alternative high-
throughput screens to determine endocrine disruption, focusing on androgen receptor, steroidogenesis, and
thyroid pathways. FIFRASAP, November 28-30 (2017)., Gazdar, A. F., Oie, H. K., Shackleton, C. H., Chen, T. R.,
Triche, T. J., Myers, C. E., Chrousos, G. P., Brennan, M. F., Stein, C. A., & La Rocca, R. V. (1990). Establishment
and characterization of a human adrenocortical carcinoma cell line that expresses multiple pathways of steroid
biosynthesis. Cancer research, 50(17), 5488-5496., OECD. Test No. 456: H295R Steroidogenesis Assay.
doi:https://doi.org/10.1787/9789264122642-en (2011)., Induction and inhibition of aromatase (CYP19) activity
by various classes of pesticides in H295R human adrenocortical carcinoma cells., Hilscherova, K., Jones, P. D.,
Gracia, T., Newsted, J. L., Zhang, X., Sanderson, J. T., Yu, R. M., Wu, R. S., & Giesy, J. P. (2004). Assessment of the
effects of chemicals on the expression often steroidogenic genes in the H295R cell line using real-time PCR.
Toxicological sciences an official journal of the Society of Toxicology, 81(1), 78-89.
https://doi.org/10.1093/toxsci/kfhl91, Gracia, T., Hilscherova, K., Jones, P. D., Newsted, J. L, Zhang, X., Hecker,
M., Higley, E. B., Sanderson, J. T., Yu, R. M., Wu, R. S., & Giesy, J. P. (2006). The H295R system for evaluation of
endocrine-disrupting effects. Ecotoxicology and environmental safety, 65(3), 293-305.
https://doi.Org/10.1016/j.ecoenv.2006.06.012, Hecker, M., Newsted, J. L, Murphy, M. B., Higley, E. B.,Jones, P.
D., Wu, R., & Giesy, J. P. (2006). Human adrenocarcinoma (H295R) cells for rapid in vitro determination of effects
on steroidogenesis: hormone production. Toxicology and applied pharmacology, 217(1), 114-124.
https://doi.Org/10.1016/j.taap.2006.07.007, EPA. Standard Evaluation Procedure (SEP) Steroidogenesis (Human
Cell Line - H295R) OCSPP Guideline 890.1550. (2011)., EPA. Standard Evaluation Procedure (SEP) Aromatase
Assay (Human Recombinant) OCSPP Guideline 890.1200. (2011)., Hecker, M., Hollert, H., Cooper, R., Vinggaard,
A. M., Akahori, Y., Murphy, M., Nellemann, C., Higley, E., Newsted, J., Laskey, J., Buckalew, A., Grund, S., Maletz,
S., Giesy, J., &Timm, G. (2011). The OECD validation program of the H295R steroidogenesis assay: Phase 3. Final
inter-laboratory validation study. Environmental science and pollution research international, 18(3), 503-515.
https://doi.org/10.1007/sll356-010-0396-x

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3162

CCTE_Deisenroth_H295R-HTRF_384WELL_TESTOSTERONE

1.	General Information

1.1	Assay Title: CCTE's H295R-HTRF Assay Evaluation of Testosterone Concentration, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_H295R-HTRF_384WELL assay is a cell-based, multiplex-readout assay that
uses H295R, a human adrenocortical carcinoma cell line, with measurements taken at 48 hours after chemical
dosing in a 384-well microplate CCTE_Deisenroth_H295R-HTRF_384WELL_TESTOSTERONE is one of 3 assay
components calculated in the CCTE_Deisenroth_H295R-HTRF_384WELL assay (2 hormones and 1 viability
components). It is designed to make measurements of hormone induction, a form of inducible reporter, as
detected with Homogenous Time Resolved Fluorescence immunoassay technology. This assay endpoint,
CCTE_Deisenroth_H295R-HTRF_384WELL_TESTOSTERONE, was analyzed with bidirectional fitting relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal
activity using Homogenous Time Resolved Fluorescence technology was used to understand synthesis of
testosterone in the H295R cell line at 48hr of chemical exposure. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the steroid hormone intended target family, where the
subfamily is androgens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The H295R-HTRF assay (Garnovskaya 2023) was evaluated against a training set of 36
chemicals comprising reference chemicals from the OECD inter-laboratory validation study (Hecker 2011), U.S.
EPA Office of Chemical Safety and Pollution Prevention (OCSPP) Guideline 890.1200 aromatase assay (EPA
2011), and azole fungicides active in the HT-H295R assay (Karmaus 2011). The selected chemicals represent
diverse chemical structures and mechanisms amenable to evaluation of overall predictivity and performance.
Results were evaluated accordingly for quality control specifications, ability to detect proficiency chemicals, and
predictive performance across the chemical training set. Generally, all active chemicals in the training set
demonstrated equivalent or greater potency in the H295R-HTRF assay, supporting the sensitivity of the
platform. The adaptation of HTRF technology to the H295R model provides an efficient way to evaluate E2 and
T modulators in accordance with guideline specifications.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed,

l.llAbbreviations:

AIC: Akaike Information Criterion
AOP: Adverse Outcome Pathway
CV: Coefficient of Variation
DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the fluorescence ratio are indicative of induction (gain of signal) or inhibition (loss of
signal) of steroid hormone biosynthesis.

and data are publicly available in ToxCast s invitroDB.

ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

-------
This H295R-HTRF assay utilizes Homogenous Time Resolved Fluorescence (HTRF) immunoassay technology to
advance screening throughput in the H295R steroidogenesis assay by rapidly measuring E2 and T levels in a 384-
well format.

2.2	Scientific Principles: Changes in steroid hormone biosynthesis and metabolism (i.e., steroidogenesis) can alter
hormone levels, contributing to endocrine disruption that may result in impaired reproductive and sexual
development (Sanderson 2006, Sidorkiewicz 2017). Screening for altered steroidogenesis is an integral part of
the EDSP tiered testing paradigm where it is included in the Series 890 Tier 1 assay battery (EPA 2011). The assay
utilizes the NCI-H295R (H295R) adrenocortical carcinoma cell line which contains steroidogenic enzymes that
mediate the conversion of cholesterol to a series of steroid hormones and has been demonstrated to be a useful
in vitro model for steroidogenic pathways and processes (Hecker 2006, Gracia 2006, Hilscherova 2004,
Sanderson 2002). The cell line maintains features of zonally undifferentiated human fetal adrenal cells with the
capacity to produce steroid hormones found in both the adult adrenal cortex and the gonads including
androgens, estrogens, corticosteroids, and mineralocorticoids (Gazdar 1990). Both the Organization for
Economic Cooperation and Development (OECD) Test Guideline 456 (OECD TG 456) and the EPA series 890.1550
validated H295R assay evaluate the effect of test substances on the synthesis of E2 and T (OECD 20114, EPA
2017). In these protocols, cells seeded into 24-well plates are exposed for 48 hours to test chemical, and changes
in the concentration of E2 and T are measured in the culture medium. The EPA has recently developed a higher
throughput version of the H295R assay (HT-H295R) that is run in 96-well format and uses high-performance
liquid chromatography followed by tandem mass spectrometry (HPLC-MS/MS) analysis to quantify E2, T, and
nine other hormones (Karmaus 2016). The assay was used to screen 2,060 chemicals in the ToxCast inventory
for effects on steroid biosynthesis with 232 demonstrating concentration-dependent effects on E2 and T4. Novel
statistical methods have been employed to evaluate and prioritize compound effects based on the multi-
hormone outputs (Haggard 2019,2018), but there are still hundreds of additional data-poor chemicals predicted
to act as estrogen and androgen modulators that are potential candidates for steroidogenesis screening (Foster
2022). This H295R-HTRF assay utilizes Homogenous Time Resolved Fluorescence (HTRF) immunoassay
technology to further advance screening throughput by rapidly measuring E2 and T levels in a 384-well format.

2.3	Experimental System: adherent H295R cell-based used. The NCI-H295R adrenocortical carcinoma adherent cell
line is used in the steroidogenesis assay. Hormone analytes are measured with Homogenous Time Resolved
Fluorescence (HTRF); a type of competitive immunoassay technology that employs the principle of Fluorescence
Resonance Energy Transfer (FRET) using long-lived donor/acceptor fluorophore pairs that eliminate non-
specific, short-lived background fluorescence using time-resolved (TR) measurements (Degorce 2018). Anti-
hormone antibodies conjugated to a donor fluorophore (e.g., Europium Cryptate) bind to standardized hormone
conjugated to an acceptor fluorophore (e.g., XL665) to produce a TR-FRET emission signal. Disruption of the
interaction by non-labelled hormone present in the conditioned medium results in a loss-of-signal that is
inversely correlated to the concentration of hormone analyte in the medium. The technology is ideal for high-
throughput screening and eliminates the need for more labor-intensive analytical methodologies used in the
previous HT-H295R implementation.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Standard operating procedures for the cell culture medium, maintenance, expansion, and
cryopreservation of the NCI-H295R adrenocortical carcinoma adherent cell line were adopted directly from
guideline specifications7,13. The assay workflow follows a three-day protocol consistent with the paradigm
defined in OECD TG 456 and U.S. EPA OCSPP Guideline 890.1550. On day 0, cells are plated in 384-well
microplates and acclimated for 24 hours. On Day 1, test chemicals are dispensed into separate 384-well
microplates in a nine-point 2X dilution series (0.002 - 200 uM). Without aspirating the growth medium on the
assay plates, test chemical suspension is dispensed to achieve a final IX nominal concentration. Each chemical
is run in technical triplicate at a final dilution series of 0.001-100 uM. On Day 3, following a 48-hour incubation
period with test chemical, the conditioned medium samples are collected and transferred to 384-well sample
storage plates. HTRF direct immunoassay technology is employed to detect E2 and T analytes in H295R
conditioned medium. HTRF assays for E2 and T are performed with matched conditioned medium samples

-------
according to the manufacturer's protocol. Fluorescent endpoint measurements are conducted with a microplate
reader equipped with an HTRF-compatible filter set (ex: 320 (10) nM; em: 620 (10) and 665 (10) nM). A
multiplexed cell viability assay is implemented on the same plates used for chemical testing using the Promega
CellTiter-Glo 2.0 assay. In this homogenous assay format, cells are lysed in the presence of a thermostable
luciferase to generate a luminescent signal that is directly proportional to the amount of ATP present, providing
an indirect indication of cytotoxicity in the sample. After collecting conditioned medium samples on Day 3, the
viability assay is immediately performed and luminescence readings measured on a microplate reader. A total
of five experimental replicates is conducted for each chemical (n=5).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

9	17

Standard minimum concentration tested:	Standard maximum concentration tested:

0.00104 nM	100 nM

Key positive control:	Neutral vehicle control:

forskolin for gain; prochloraz for loss	DMSO

Baseline median absolute deviation for the assay (bmad): 0.131
Response cutoff threshold used to determine hit calls: 0.394

Detection technology used: Homogenous Time Resolved Fluorescence (Fluorescence)

2.6 Response: Homogenous Time Resolved Fluorescence (HTRF) immunoassay technology is used to measure 170-
estradiol (E2) and testosterone (T) synthesis. Cytotoxicity is evaluated with a viability assay that measures ATP
concentrations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For steroid hormone measurements, well-level raw fluorescence unit (RFU) values are
normalized as a ratio according to the following equation: RFU Ratio = (Signal Em 665 nM/Signal Em 620 nM) X
10,000. RFU ratios are interpolated against a standard curve to derive the concentrations for E2 (ng/ml) and T
(nM), respectively. Testosterone concentrations are further converted to ng/ml to harmonize the measurement

-------
units between analytes and enable comparisons to historical H295R data. For cytotoxicity measurements, well-
level raw luminescence units (RLU) values are normalized to the median plate-based solvent control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 46: resp.incr.zerocenter.fc
(Calculate the normalized response (resp) as a zero centerfold change, i.e. the ratio of the the corrected
(cval) and baseline (bval) values minus 1; resp = cval/bval -1. Typically used for increasing responses.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical

-------
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 36

Number of chemicals tested: 36

Active hit count: hitc>0.9
16

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.514

Neutral control median absolute deviation, by plate: nmad	0.487

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.27%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	2.798

Negative control well median absolute deviation value, by plate: mmad	1.357

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.401

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Garnovskaya, M., Feshuk, M., Stewart, W., Friedman, K. P., Thomas, R. S., & Deisenroth, C. (2023).
Evaluation of a high-throughput H295R homogenous time resolved fluorescence assay for androgen and
estrogen steroidogenesis screening. Toxicology in vitro : an international journal published in association with
BIBRA, 92,105659. https://doi.Org/10.1016/j.tiv.2023.105659, Haggard DE, Karmaus AL, Martin MT, Judson RS,
Setzer RW, Paul Friedman K. High-Throughput H295R Steroidogenesis Assay: Utility as an Alternative and a
Statistical Approach to Characterize Effects on Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi:
10.1093/toxsci/kfx274. Erratum in: Toxicol Sci. 2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer
RW, Judson RS, Paul Friedman K. Development of a prioritization method for chemical-mediated effects on
steroidogenesis using an integrated statistical analysis of high-throughput H295R data. Regul Toxicol Pharmacol.
2019 Dec; 109:104510. doi: 10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319., Sanderson J. T.
(2006). The steroid hormone biosynthesis pathway as a target for endocrine-disrupting chemicals. Toxicological
sciences : an official journal of the Society of Toxicology, 94(1), 3-21. https://doi.org/10.1093/toxsci/kfl051,
Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical Effects on
Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016 Apr;150(2):323-32. doi:
10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454., Sidorkiewicz, I., Zariba, K.,

-------
Wotczyriski, S., & Czerniecki, J. (2017). Endocrine-disrupting chemicals-Mechanisms of action on male
reproductive system. Toxicology and industrial health, 33(7), 601-609.
https://doi.org/10.1177/0748233717695160, Degorce, F., Card, A., Soh, S., Trinquet, E., Knapik, G. P., & Xie, B.
(2009). HTRF: A technology tailored for drug discovery - a review of theoretical aspects and recent applications.
Current chemical genomics, 3, 22-32. https://doi.org/10.2174/1875397300903010022, Foster, M. J., Patlewicz,
G., Shah, I., Haggard, D. E., Judson, R. S., & Paul Friedman, K. (2022). Evaluating structure-based activity in a
high-throughput assay for steroid biosynthesis. Computational toxicology (Amsterdam, Netherlands), 24,1-23.
https://doi.Org/10.1016/j.comtox.2022.100245, EPA, U. S. Continuing development of alternative high-
throughput screens to determine endocrine disruption, focusing on androgen receptor, steroidogenesis, and
thyroid pathways. FIFRASAP, November 28-30 (2017)., Gazdar, A. F., Oie, H. K., Shackleton, C. H., Chen, T. R.,
Triche, T. J., Myers, C. E., Chrousos, G. P., Brennan, M. F., Stein, C. A., & La Rocca, R. V. (1990). Establishment
and characterization of a human adrenocortical carcinoma cell line that expresses multiple pathways of steroid
biosynthesis. Cancer research, 50(17), 5488-5496., OECD. Test No. 456: H295R Steroidogenesis Assay.
doi:https://doi.org/10.1787/9789264122642-en (2011)., Induction and inhibition of aromatase (CYP19) activity
by various classes of pesticides in H295R human adrenocortical carcinoma cells., Hilscherova, K., Jones, P. D.,
Gracia, T., Newsted, J. L., Zhang, X., Sanderson, J. T., Yu, R. M., Wu, R. S., & Giesy, J. P. (2004). Assessment of the
effects of chemicals on the expression often steroidogenic genes in the H295R cell line using real-time PCR.
Toxicological sciences an official journal of the Society of Toxicology, 81(1), 78-89.
https://doi.org/10.1093/toxsci/kfhl91, Gracia, T., Hilscherova, K., Jones, P. D., Newsted, J. L, Zhang, X., Hecker,
M., Higley, E. B., Sanderson, J. T., Yu, R. M., Wu, R. S., & Giesy, J. P. (2006). The H295R system for evaluation of
endocrine-disrupting effects. Ecotoxicology and environmental safety, 65(3), 293-305.
https://doi.Org/10.1016/j.ecoenv.2006.06.012, Hecker, M., Newsted, J. L, Murphy, M. B., Higley, E. B.,Jones, P.
D., Wu, R., & Giesy, J. P. (2006). Human adrenocarcinoma (H295R) cells for rapid in vitro determination of effects
on steroidogenesis: hormone production. Toxicology and applied pharmacology, 217(1), 114-124.
https://doi.Org/10.1016/j.taap.2006.07.007, EPA. Standard Evaluation Procedure (SEP) Steroidogenesis (Human
Cell Line - H295R) OCSPP Guideline 890.1550. (2011)., EPA. Standard Evaluation Procedure (SEP) Aromatase
Assay (Human Recombinant) OCSPP Guideline 890.1200. (2011)., Hecker, M., Hollert, H., Cooper, R., Vinggaard,
A. M., Akahori, Y., Murphy, M., Nellemann, C., Higley, E., Newsted, J., Laskey, J., Buckalew, A., Grund, S., Maletz,
S., Giesy, J., &Timm, G. (2011). The OECD validation program of the H295R steroidogenesis assay: Phase 3. Final
inter-laboratory validation study. Environmental science and pollution research international, 18(3), 503-515.
https://doi.org/10.1007/sll356-010-0396-x

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3163

CCTE_Deisenroth_H295R-HTRF_384WELL_CTOX

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment in CCTE's H295R-HTRF Assay, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_H295R-HTRF_384WELL assay is a cell-based, multiplex-readout assay that
uses H295R, a human adrenocortical carcinoma cell line, with measurements taken at 48 hours after chemical
dosing in a 384-well microplate CCTE_Deisenroth_H295R-HTRF_384WELL_CTOX is one of 3 assay components
measured in the CCTE_Deisenroth_H295R-HTRF_384WELL assay (2 hormones and 1 viability components). It is
designed to make measurements of adenosine triphosphate (ATP), a form of viability reporter, as detected with
luminescence intensity signals by CellTiter-Glo 2.0 cytotoxicity assay technology. This assay endpoint,
CCTE_Deisenroth_H295R-HTRF_384WELL_CTOX, was analyzed in the positive fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of viability reporter, loss-of-signal activity using
CellTiter-Glo 2.0 technology was used to understand viability in the H295Rcell line at 48hr of chemical exposure.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell cycle
intended target family, where the subfamily is cytotoxicity

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The H295R-HTRF assay (Garnovskaya 2023) was evaluated against a training set of 36
chemicals comprising reference chemicals from the OECD inter-laboratory validation study (Hecker 2011), U.S.
EPA Office of Chemical Safety and Pollution Prevention (OCSPP) Guideline 890.1200 aromatase assay (EPA
2011), and azole fungicides active in the HT-H295R assay (Karmaus 2011). The selected chemicals represent
diverse chemical structures and mechanisms amenable to evaluation of overall predictivity and performance.
Results were evaluated accordingly for quality control specifications, ability to detect proficiency chemicals, and
predictive performance across the chemical training set. Generally, all active chemicals in the training set
demonstrated equivalent or greater potency in the H295R-HTRF assay, supporting the sensitivity of the
platform. The adaptation of HTRF technology to the H295R model provides an efficient way to evaluate E2 and
T modulators in accordance with guideline specifications.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in luminescence intensity, proportional to the amount of ATP present, are indicative of
cytotoxicity.

-------
This H295R-HTRF assay utilizes Homogenous Time Resolved Fluorescence (HTRF) immunoassay technology to
advance screening throughput in the H295R steroidogenesis assay by rapidly measuring E2 and T levels in a 384-
well format.

2.2	Scientific Principles: Changes in steroid hormone biosynthesis and metabolism (i.e., steroidogenesis) can alter
hormone levels, contributing to endocrine disruption that may result in impaired reproductive and sexual
development (Sanderson 2006, Sidorkiewicz 2017). Screening for altered steroidogenesis is an integral part of
the EDSP tiered testing paradigm where it is included in the Series 890 Tier 1 assay battery (EPA 2011). The assay
utilizes the NCI-H295R (H295R) adrenocortical carcinoma cell line which contains steroidogenic enzymes that
mediate the conversion of cholesterol to a series of steroid hormones and has been demonstrated to be a useful
in vitro model for steroidogenic pathways and processes (Hecker 2006, Gracia 2006, Hilscherova 2004,
Sanderson 2002). The cell line maintains features of zonally undifferentiated human fetal adrenal cells with the
capacity to produce steroid hormones found in both the adult adrenal cortex and the gonads including
androgens, estrogens, corticosteroids, and mineralocorticoids (Gazdar 1990). Both the Organization for
Economic Cooperation and Development (OECD) Test Guideline 456 (OECD TG 456) and the EPA series 890.1550
validated H295R assay evaluate the effect of test substances on the synthesis of E2 and T (OECD 20114, EPA
2017). In these protocols, cells seeded into 24-well plates are exposed for 48 hours to test chemical, and changes
in the concentration of E2 and T are measured in the culture medium. The EPA has recently developed a higher
throughput version of the H295R assay (HT-H295R) that is run in 96-well format and uses high-performance
liquid chromatography followed by tandem mass spectrometry (HPLC-MS/MS) analysis to quantify E2, T, and
nine other hormones (Karmaus 2016). The assay was used to screen 2,060 chemicals in the ToxCast inventory
for effects on steroid biosynthesis with 232 demonstrating concentration-dependent effects on E2 and T4. Novel
statistical methods have been employed to evaluate and prioritize compound effects based on the multi-
hormone outputs (Haggard 2019,2018), but there are still hundreds of additional data-poor chemicals predicted
to act as estrogen and androgen modulators that are potential candidates for steroidogenesis screening (Foster
2022). This H295R-HTRF assay utilizes Homogenous Time Resolved Fluorescence (HTRF) immunoassay
technology to further advance screening throughput by rapidly measuring E2 and T levels in a 384-well format.

2.3	Experimental System: adherent H295R cell-based used. The NCI-H295R adrenocortical carcinoma adherent cell
line is used in the steroidogenesis assay. Hormone analytes are measured with Homogenous Time Resolved
Fluorescence (HTRF); a type of competitive immunoassay technology that employs the principle of Fluorescence
Resonance Energy Transfer (FRET) using long-lived donor/acceptor fluorophore pairs that eliminate non-
specific, short-lived background fluorescence using time-resolved (TR) measurements (Degorce 2018). Anti-
hormone antibodies conjugated to a donor fluorophore (e.g., Europium Cryptate) bind to standardized hormone
conjugated to an acceptor fluorophore (e.g., XL665) to produce a TR-FRET emission signal. Disruption of the
interaction by non-labelled hormone present in the conditioned medium results in a loss-of-signal that is
inversely correlated to the concentration of hormone analyte in the medium. The technology is ideal for high-
throughput screening and eliminates the need for more labor-intensive analytical methodologies used in the
previous HT-H295R implementation.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Standard operating procedures for the cell culture medium, maintenance, expansion, and
cryopreservation of the NCI-H295R adrenocortical carcinoma adherent cell line were adopted directly from
guideline specifications (EPA 2011, OECD 2011). The assay workflow follows a three-day protocol consistent
with the paradigm defined in OECD TG 456 and U.S. EPA OCSPP Guideline 890.1550. On day 0, cells are plated
in 384-well microplates and acclimated for 24 hours. On Day 1, test chemicals are dispensed into separate 384-
well microplates in a nine-point 2X dilution series (0.002 - 200 uM). Without aspirating the growth medium on
the assay plates, test chemical suspension is dispensed to achieve a final IX nominal concentration. Each
chemical is run in technical triplicate at a final dilution series of 0.001 - 100 uM. On Day 3, following a 48-hour
incubation period with test chemical, the conditioned medium samples are collected and transferred to 384-
well sample storage plates. HTRF direct immunoassay technology is employed to detect E2 and T analytes in
H295R conditioned medium. HTRF assays for E2 and T are performed with matched conditioned medium

-------
samples according to the manufacturer's protocol. Fluorescent endpoint measurements are conducted with a
microplate reader equipped with an HTRF-compatible filter set (ex: 320 (10) nM; em: 620 (10) and 665 (10) nM).
A multiplexed cell viability assay is implemented on the same plates used for chemical testing using the Promega
CellTiter-Glo 2.0 assay. In this homogenous assay format, cells are lysed in the presence of a thermostable
luciferase to generate a luminescent signal that is directly proportional to the amount of ATP present, providing
an indirect indication of cytotoxicity in the sample. After collecting conditioned medium samples on Day 3, the
viability assay is immediately performed and luminescence readings measured on a microplate reader. A total
of five experimental replicates is conducted for each chemical (n=5).

Baseline median absolute deviation for the assay (bmad): 0.092
Response cutoff threshold used to determine hit calls: 0.277
Detection technology used: CellTiter-Glo 2.0 (Luminescence)

2.6	Response: Homogenous Time Resolved Fluorescence (HTRF) immunoassay technology is used to measure 170-
estradiol (E2) and testosterone (T) synthesis. Cytotoxicity is evaluated with a viability assay that measures ATP
concentrations.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For steroid hormone measurements, well-level raw fluorescence unit (RFU) values are
normalized as a ratio according to the following equation: RFU Ratio = (Signal Em 665 nM/Signal Em 620 nM) X
10,000. RFU ratios are interpolated against a standard curve to derive the concentrations for E2 (ng/ml) and T
(nM), respectively. Testosterone concentrations are further converted to ng/ml to harmonize the measurement

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

0.00104 nM
Key positive control:

NA

Target (nominal) number of replicates:

16

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
units between analytes and enable comparisons to historical H295R data. For cytotoxicity measurements, well-
level raw luminescence units (RLU) values are normalized to the median plate-based solvent control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 46: resp.incr.zerocenter.fc
(Calculate the normalized response (resp) as a zero centerfold change, i.e. the ratio of the the corrected
(cval) and baseline (bval) values minus 1; resp = cval/bval -1. Typically used for increasing responses.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 36

Number of chemicals tested: 36

Active hit count: hitc>0.9
5

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

82446.5
9071.288
10.21%

NA
NA

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	83163.25

Negative control well median absolute deviation value, by plate: mmad	2894.035

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.011

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

-------
(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Garnovskaya, M., Feshuk, M., Stewart, W., Friedman, K. P., Thomas, R. S., & Deisenroth, C. (2023).
Evaluation of a high-throughput H295R homogenous time resolved fluorescence assay for androgen and
estrogen steroidogenesis screening. Toxicology in vitro : an international journal published in association with
BIBRA, 92,105659. https://doi.Org/10.1016/j.tiv.2023.105659, Haggard DE, Karmaus AL, Martin MT, Judson RS,
Setzer RW, Paul Friedman K. High-Throughput H295R Steroidogenesis Assay: Utility as an Alternative and a
Statistical Approach to Characterize Effects on Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi:
10.1093/toxsci/kfx274. Erratum in: Toxicol Sci. 2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer
RW, Judson RS, Paul Friedman K. Development of a prioritization method for chemical-mediated effects on
steroidogenesis using an integrated statistical analysis of high-throughput H295R data. Regul Toxicol Pharmacol.
2019 Dec; 109:104510. doi: 10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319., Sanderson J. T.
(2006). The steroid hormone biosynthesis pathway as a target for endocrine-disrupting chemicals. Toxicological

-------
sciences : an official journal of the Society of Toxicology, 94(1), 3-21. https://doi.org/10.1093/toxsci/kfl051,
Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical Effects on
Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016 Apr;150(2):323-32. doi:
10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454., Sidorkiewicz, I., Zariba, K.,
Wotczynski, S., & Czerniecki, J. (2017). Endocrine-disrupting chemicals-Mechanisms of action on male
reproductive system. Toxicology and industrial health, 33(7), 601-609.
https://doi.org/10.1177/0748233717695160, Degorce, F., Card, A., Soh, S., Trinquet, E., Knapik, G. P., & Xie, B.
(2009). HTRF: A technology tailored for drug discovery - a review of theoretical aspects and recent applications.
Current chemical genomics, 3, 22-32. https://doi.org/10.2174/1875397300903010022, Foster, M. J., Patlewicz,
G., Shah, I., Haggard, D. E., Judson, R. S., & Paul Friedman, K. (2022). Evaluating structure-based activity in a
high-throughput assay for steroid biosynthesis. Computational toxicology (Amsterdam, Netherlands), 24,1-23.
https://doi.Org/10.1016/j.comtox.2022.100245, EPA, U. S. Continuing development of alternative high-
throughput screens to determine endocrine disruption, focusing on androgen receptor, steroidogenesis, and
thyroid pathways. FIFRASAP, November 28-30 (2017)., Gazdar, A. F., Oie, H. K., Shackleton, C. H., Chen, T. R.,
Triche, T. J., Myers, C. E., Chrousos, G. P., Brennan, M. F., Stein, C. A., & La Rocca, R. V. (1990). Establishment
and characterization of a human adrenocortical carcinoma cell line that expresses multiple pathways of steroid
biosynthesis. Cancer research, 50(17), 5488-5496., OECD. Test No. 456: H295R Steroidogenesis Assay.
doi:https://doi.org/10.1787/9789264122642-en (2011)., Induction and inhibition of aromatase (CYP19) activity
by various classes of pesticides in H295R human adrenocortical carcinoma cells., Hilscherova, K., Jones, P. D.,
Gracia, T., Newsted, J. L, Zhang, X., Sanderson, J. T., Yu, R. M., Wu, R. S., & Giesy, J. P. (2004). Assessment of the
effects of chemicals on the expression often steroidogenic genes in the H295R cell line using real-time PCR.
Toxicological sciences an official journal of the Society of Toxicology, 81(1), 78-89.
https://doi.org/10.1093/toxsci/kfhl91, Gracia, T., Hilscherova, K., Jones, P. D., Newsted, J. L, Zhang, X., Hecker,
M., Higley, E. B., Sanderson, J. T., Yu, R. M., Wu, R. S., & Giesy, J. P. (2006). The H295R system for evaluation of
endocrine-disrupting effects. Ecotoxicology and environmental safety, 65(3), 293-305.
https://doi.Org/10.1016/j.ecoenv.2006.06.012, Hecker, M., Newsted, J. L, Murphy, M. B., Higley, E. B., Jones, P.
D., Wu, R., & Giesy, J. P. (2006). Human adrenocarcinoma (H295R) cells for rapid in vitro determination of effects
on steroidogenesis: hormone production. Toxicology and applied pharmacology, 217(1), 114-124.
https://doi.Org/10.1016/j.taap.2006.07.007, EPA. Standard Evaluation Procedure (SEP) Steroidogenesis (Human
Cell Line - H295R) OCSPP Guideline 890.1550. (2011)., EPA. Standard Evaluation Procedure (SEP) Aromatase
Assay (Human Recombinant) OCSPP Guideline 890.1200. (2011)., Hecker, M., Hollert, H., Cooper, R., Vinggaard,
A. M., Akahori, Y., Murphy, M., Nellemann, C., Higley, E., Newsted, J., Laskey, J., Buckalew, A., Grund, S., Maletz,
S., Giesy, J., &Timm, G. (2011). The OECD validation program of the H295R steroidogenesis assay: Phase 3. Final
inter-laboratory validation study. Environmental science and pollution research international, 18(3), 503-515.
https://doi.org/10.1007/sll356-010-0396-x

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3223

CCTE_Deisenroth_DEVTOX-GLR_Endo_SOX17

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Endo Assay Evaluation of SRY-box transcription factor 17 (SOX17) Protein
Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Endo assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Endo_SOX17 is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Endo assay and it measures the percentage of cells in an endoderm state. Using
a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes in
gene expression with fluorescent microscopy. The percentage of SOX17+ cells are determined by calculating the
number of SOX17+ cells divided by the cell count. To generalize the intended target to other relatable targets,
this assay component is annotated to the transcription factor intended target family, where the subfamily is
SRY-related HMG-box. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Endo_SOX17 was
analyzed as 1 assay endpoint. The endpoint is the percent change in SOX17+ cell population relative to the 0.2%
DMSO control in the CCTE_Deisenroth_DEVTOX-GLR_Endo assay. This assay endpoint can be referred to as a
primary readout since this assay has produced multiple assay endpoints where this one serves as a measure of
endoderm differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Endo_SOX17 was designed to measure changes in cellular SOX17
expression. Changes are indicative of changes in SOX17 gene expression due to perturbations in endoderm
differentiation.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

9.98e-07 nM
Key positive control:

NA

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 6.191
Response cutoff threshold used to determine hit calls: 18.572

-------
Detection technology used: microscopy (microscopy)

2.6	Response: The DevTox GLR-Endo assay evaluates SOX17 protein expression as a biomarker for early endoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive

-------
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 66	Number of chemicals tested: 66

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
39

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	62.546

Neutral control median absolute deviation, by plate: nmad	4.076

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.6%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	67.334

Positive control well median absolute deviation, by plate: pmad	4.118

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	1.025

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 13.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3224

CCTE_Deisenroth_DEVTOX-GLR_Endo_SOX2

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Endo Assay Evaluation of SRY-box transcription factor 2 (SOX2) Protein
Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Endo assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Endo_SOX2 is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Endo assay and it measures the percentage of cells in an ectoderm state. Using
a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes in
gene expression with fluorescent microscopy. The percentage SOX2+ cells are determined by calculating the
number of SOX2+ cells divided by the cell count. To generalize the intended target to other relatable targets,
this assay component is annotated to the transcription factor intended target family, where the subfamily is
SRY-related HMG-box. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Endo_SOX2 was
analyzed as 1 assay endpoint. The endpoint is the percent SOX2+ cell population minus the 0.2% DMSO control
in the CCTE_Deisenroth_DEVTOX-GLR_Endo assay. Furthermore, this assay endpoint can be referred to as a
secondary readout, because this assay has produced multiple assay endpoints where this one serves as a
measure of ectoderm differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Endo_SOX2 was designed to measure changes in cellular SOX2
expression. Changes are indicative of changes in SOX2 protein expression due to perturbations in endoderm
differentiation.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

9.98e-07 nM
Key positive control:

NA

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.444
Response cutoff threshold used to determine hit calls: 1.331

-------
Detection technology used: microscopy (microscopy)

2.6	Response: The DevTox GLR-Endo assay evaluates SOX17 protein expression as a biomarker for early endoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 35: resp.logfc (Calculate the

-------
normalized response (resp) as the fold change of logged, i.e. the difference between corrected (cval) and
baseline (bval) log-scale values.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 66	Number of chemicals tested: 66

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
30

10

26

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

6

15

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

8

1

quadratic-polynomialfpoly2) model: 6

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

18

12

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.189

Neutral control median absolute deviation, by plate: nmad	0.433

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.06%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	5.759

Positive control well median absolute deviation, by plate: pmad	0.773

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	2.721

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 12.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3225

CCTE_Deisenroth_DEVTOX-GLR_Endo_BRA

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Endo Assay Evaluation of Brachyury (BRA) Protein Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Endo assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Endo_BRA is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Endo assay and it measures the percentage of cells in an mesoderm state.
Using a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes
in gene expression with fluorescent microscopy. The percentage of BRA+ cells are determined by calculating the
number of BRA+ cells divided by the cell count. To generalize the intended target to other relatable targets, this
assay component is annotated to the transcription factor intended target family, where the subfamily is T-box
protein. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Endo_BRA was analyzed as 1 assay
endpoint. The endpoint is the percent BRA+ cell population minus the 0.2% DMSO control in the
CCTE_Deisenroth_DEVTOX-GLR_Endo assay. Furthermore, this assay endpoint can be referred to as a secondary
readout, because this assay has produced multiple assay endpoints where this one serves as a measure of
mesoderm differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Endo_BRA was designed to measure changes in cellular Bra
expression. Changes are indicative of changes in Bra gene expression due to perturbations in endoderm
differentiation

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

9.98e-07 nM
Key positive control:

NA

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.328
Response cutoff threshold used to determine hit calls: 0.984
Detection technology used: microscopy (microscopy)

-------
2.6	Response: The DevTox GLR-Endo assay evaluates SOX17 protein expression as a biomarker for early endoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 35: resp.logfc (Calculate the
normalized response (resp) as the fold change of logged, i.e. the difference between corrected (cval) and
baseline (bval) log-scale values.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 66	Number of chemicals tested: 66

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
41

Inactive hit count: 0
-------
Number of sample-assay endpoints with winning hill model:

6

11

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

4

4

quadratic-polynomialfpoly2) model: 5

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

2

18

16

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

-------
NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.325

Neutral control median absolute deviation, by plate: nmad	0.281

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	28.16%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	1.434

Positive control well median absolute deviation, by plate: pmad	0.442

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.356

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 16.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)

-------
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

-------
researcli/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3226

CCTE_Deisen roth_DEVTOX-G LR_Endo_CellCou nt

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Endo Assay Evaluation of Total Cell Counts, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Endo assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Endo_CellCount is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Endo assay and it measures cell count. Using fluorescent nuclear staining, loss-
of-signal activity can be used to understand changes in cell viability with fluorescent microscopy. It is designed
to make measurements of nuclei counts as a surrogate for cell number, as detected with optical fluorescence
microscopy by Perkin Elmer Harmony nuclei detection algorthm. To generalize the intended target to other
relatable targets, this assay component is annotated to the cell cycle intended target family, where the
subfamily is cytotoxicity. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Endo_CellCount was
analyzed as 1 assay endpoint. The endpoint the percent change in cell count relative to the 0.2% DMSO control
in the CCTE_Deisenroth_DEVTOX-GLR_Endo assay. Furthermore, this assay endpoint can be referred to as a
primary readout, because this assay has produced multiple assay endpoints where this one serves as a measure
of cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Endo_CellCount was designed to measure changes in the number
of cells where a decrease is indicative of cytotoxicity.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

9.98e-07 nM
Key positive control:

NA

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 6.512
Response cutoff threshold used to determine hit calls: 19.535

Detection technology used: Perkin Elmer Harmony nuclei detection algorthm (microscopy)

-------
2.6	Response: The DevTox GLR-Endo assay evaluates SOX17 protein expression as a biomarker for early endoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med

-------
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 66	Number of chemicals tested: 66

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
38

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4108.75

Neutral control median absolute deviation, by plate: nmad	249.818

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.14%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	3966

Positive control well median absolute deviation, by plate: pmad	305.786

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.152

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 16.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3227

CCTE_Deisenroth_DEVTOX-GLR_Meso_SOX17

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Meso Assay Evaluation of SRY-box transcription factor 17 (SOX17) Protein
Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Meso assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Meso_SOX17 is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Meso assay and it measures an the percentage of cells in an endoderm state.
Using a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes
in gene expression with fluorescent microscopy. The percentage of SOX17+ cells are determined by calculating
the number of SOX17+ cells divided by cell count. To generalize the intended target to other relatable targets,
this assay component is annotated to the transcription factor intended target family, where the subfamily is
SRY-related HMG-box. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Meso_SOX17 was
analyzed as 1 assay endpoint. The endpoint is the percent SOX17+ cell population minus the 0.2% DMSO control
in the CCTE_Deisenroth_DEVTOX-GLR_Meso assay. This assay endpoint can be referred to as a secondary
readout since this assay has produced multiple assay endpoints where this one serves as a measure of endoderm
differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Meso_SOX17 was designed to measure changes in cellular SOX17
expression. Changes are indicative of changes in SOX17 gene expression due to perturbations in mesoderm
differentiation.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

9.98e-05 nM
Key positive control:

NA

Target (nominal) number of replicates:

8

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.077
Response cutoff threshold used to determine hit calls: 0.232

-------
Detection technology used: microscopy (microscopy)

2.6	Response: The DevTox GLR-Meso assay evaluates BRA protein expression as a biomarker for early mesoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 35: resp.logfc (Calculate the

-------
normalized response (resp) as the fold change of logged, i.e. the difference between corrected (cval) and
baseline (bval) log-scale values.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 16	Number of chemicals tested: 16

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
9

5

2

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

1

2

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

4

quadratic-polynomialfpoly2) model: 3

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

4

2

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.205

Neutral control median absolute deviation, by plate: nmad	0.097

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	51.06%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.23

Positive control well median absolute deviation, by plate: pmad	0.091

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.197

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 4.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3228

CCTE_Deisenroth_DEVTOX-GLR_Meso_SOX2

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Meso Assay Evaluation of SRY-box transcription factor 2 (SOX2) Protein
Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Meso assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Meso_SOX2 is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Meso assay and it measures an the percentage of cells in an ectoderm state.
Using a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes
in gene expression with fluorescent microscopy. The percentage of SOX2+ cells are determined by calculating
the number of SOX2+ cells divided by the cell count. To generalize the intended target to other relatable targets,
this assay component is annotated to the transcription factor intended target family, where the subfamily is
SRY-related HMG-box. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Meso_SOX2 was
analyzed as 1 assay endpoint. The endpoint is the percent SOX2+ cell population minus the 0.2% DMSO control
in the CCTE_Deisenroth_DEVTOX-GLR_Meso assay. Furthermore, this assay endpoint can be referred to as a
secondary readout, because this assay has produced multiple assay endpoints where this one serves as a
measure of ectoderm differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Meso_SOX2 was designed to measure changes in cellular SOX2
expression. Changes are indicative of changes in SOX2 protein expression due to perturbations in mesoderm
differentiation.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

9.98e-05 nM
Key positive control:

NA

Target (nominal) number of replicates:

8

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.25
Response cutoff threshold used to determine hit calls: 0.751

-------
Detection technology used: microscopy (microscopy)

2.6	Response: The DevTox GLR-Meso assay evaluates BRA protein expression as a biomarker for early mesoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 35: resp.logfc (Calculate the

-------
normalized response (resp) as the fold change of logged, i.e. the difference between corrected (cval) and
baseline (bval) log-scale values.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 16	Number of chemicals tested: 16

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
9

6

1

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

2
1

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

2

quadratic-polynomialfpoly2) model:	1

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

5

3

1

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.637

Neutral control median absolute deviation, by plate: nmad	0.249

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	42.98%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.702

Positive control well median absolute deviation, by plate: pmad	0.203

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.129

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 3.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3229

CCTE_Deisen roth_DEVTOX-G LR_Meso_BRA

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Meso Assay Evaluation of Brachyury (BRA) Protein Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Meso assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Meso_BRA is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Meso assay and it measures an the percentage of cells in an mesoderm state.
Using a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes
in gene expression with fluorescent microscopy. The percentage of BRA+ cells are determined by calculating the
number of BRA+ cells divided by the cell count. To generalize the intended target to other relatable targets, this
assay component is annotated to the transcription factor intended target family, where the subfamily is T-box
protein. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Meso_BRA was analyzed as 1 assay
endpoint. The endpoint is the percent change in BRA+ cell population relative to the 0.2% DMSO control in the
CCTE_Deisenroth_DEVTOX-GLR_Meso assay. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this one serves as a measure of
mesoderm differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Meso_BRA was designed to measure changes in cellular Bra
expression. Changes are indicative of changes in Bra gene expression due to perturbations in mesoderm
differentiation

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

9.98e-05 nM
Key positive control:

NA

Target (nominal) number of replicates:

8

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 6.436
Response cutoff threshold used to determine hit calls: 19.307
Detection technology used: microscopy (microscopy)

-------
2.6	Response: The DevTox GLR-Meso assay evaluates BRA protein expression as a biomarker for early mesoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med

-------
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 16	Number of chemicals tested: 16

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

1
3

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2

3

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

5

2

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

-------
4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	87.501

Neutral control median absolute deviation, by plate: nmad	5.739

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.41%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	87.501

Positive control well median absolute deviation, by plate: pmad	4.943

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.11

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity

-------
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3230

CCTE_Deisen roth_DEVTOX-G LR_Meso_CellCou nt

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Meso Assay Evaluation of Total Cell Counts, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Meso assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Meso_CellCount is 1 of 4 assay components calculated from
the CCTE_Deisenroth_DEVTOX-GLR_Meso assay and it measures cell count. Using fluorescent nuclei staining,
gain-of-signal activity can be used to understand changes in cell viability with fluorescent microscopy. It is
designed to make measurements of nuclei counts as a surrogate for cell number, as detected with optical
fluorescence microscopy by Perkin Elmer Harmony nuclei detection algorthm. To generalize the intended target
to other relatable targets, this assay component is annotated to the cell cycle intended target family, where the
subfamily is cytotoxicity. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Meso_CellCount was
analyzed as 1 assay endpoint. The endpoint the percent change in cell count relative to the 0.2% DMSO control
in the CCTE_Deisenroth_DEVTOX-GLR_Meso assay. Furthermore, this assay endpoint can be referred to as a
primary readout, because this assay has produced multiple assay endpoints where this one serves as a measure
of cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Meso_CellCount was designed to measure changes in the
number of cells where a decrease is indicative of cytotoxicity.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

9.98e-05 nM
Key positive control:

NA

Target (nominal) number of replicates:

8

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 7.395
Response cutoff threshold used to determine hit calls: 22.186

Detection technology used: Perkin Elmer Harmony nuclei detection algorthm (microscopy)

-------
2.6	Response: The DevTox GLR-Meso assay evaluates BRA protein expression as a biomarker for early mesoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med

-------
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 16	Number of chemicals tested: 16

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
8

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	5929

Neutral control median absolute deviation, by plate: nmad	402.897

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.9%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	5788.75

Positive control well median absolute deviation, by plate: pmad	452.564

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.35

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3231

CCTE_Deisenroth_DEVTOX-GLR_Ecto_SOX17

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Ecto Assay Evaluation of SRY-box transcription factor 17 (SOX17) Protein
Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Ecto assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Ecto_SOX17 is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Ecto assay and it measures an the percentage of cells in an endoderm state.
Using a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes
in gene expression with fluorescent microscopy. The percentage of SOX17+ cells are determined by calculating
the number of SOX17+ cells divided by the cell count. To generalize the intended target to other relatable
targets, this assay component is annotated to the transcription factor intended target family, where the
subfamily is SRY-related HMG-box. Data from the assay component CCTE_Deisenroth_DEVTOX-
GLR_Ecto_SOX17 was analyzed as 1 assay endpoint. The endpoint is the percent SOX17+ cell population minus
the 0.2% DMSO control in the CCTE_Deisenroth_DEVTOX-GLR_Ecto assay. This assay endpoint can be referred
to as a secondary readout since this assay has produced multiple assay endpoints where this one serves as a
measure of endoderm differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Ecto_SOX17 was designed to measure changes in cellular SOX17
expression. Changes are indicative of changes in SOX17 gene expression due to perturbations in ectoderm
differentiation.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

9.98e-05 nM
Key positive control:

NA

Target (nominal) number of replicates:

8

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.07
Response cutoff threshold used to determine hit calls: 0.211

-------
Detection technology used: microscopy (microscopy)

2.6	Response: The DevTox GLR-Ecto assay evaluates SOX2 protein expression as a biomarker for early ectoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 35: resp.logfc (Calculate the

-------
normalized response (resp) as the fold change of logged, i.e. the difference between corrected (cval) and
baseline (bval) log-scale values.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 16	Number of chemicals tested: 16

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
5

7

4

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

1
0

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

5

1

quadratic-polynomialfpoly2) model:	1

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

7

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.024

Neutral control median absolute deviation, by plate: nmad	0.035

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	105.32%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.019

Positive control well median absolute deviation, by plate: pmad	0.028

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.123

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3232

CCTE_Deisenroth_DEVTOX-GLR_Ecto_SOX2

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Ecto Assay Evaluation of SRY-box transcription factor 2 (SOX2) Protein
Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Ecto assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Ecto_SOX2 is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Ecto assay and it measures an the percentage of cells in an ectoderm state.
Using a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes
in gene expression with fluorescent microscopy. The percentage of SOX2+ cells are determined by calculating
the number of SOX2+ cells divided by the cell count. To generalize the intended target to other relatable targets,
this assay component is annotated to the transcription factor intended target family, where the subfamily is
SRY-related HMG-box. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Ecto_SOX2 was
analyzed as 1 assay endpoint. The endpoint is the percent change in SOX2+ cell population relative to the 0.2%
DMSO control in the CCTE_Deisenroth_DEVTOX-GLR_Ecto assay. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves as a measure of ectoderm differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Ecto_SOX2 was designed to measure changes in cellular SOX2
expression. Changes are indicative of changes in SOX2 protein expression due to perturbations in ectoderm
differentiation.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

9.98e-05 nM
Key positive control:

NA

Target (nominal) number of replicates:

8

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 3.918
Response cutoff threshold used to determine hit calls: 11.755

-------
Detection technology used: microscopy (microscopy)

2.6	Response: The DevTox GLR-Ecto assay evaluates SOX2 protein expression as a biomarker for early ectoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive

-------
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 16	Number of chemicals tested: 16

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
5

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	62.09

Neutral control median absolute deviation, by plate: nmad	2.625

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	4.23%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	62.813

Positive control well median absolute deviation, by plate: pmad	2.655

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.401

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3233

CCTE_Deisenroth_DEVTOX-GLR_Ecto_BRA

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Ecto Assay Evaluation of Brachyury (BRA) Protein Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Ecto assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Ecto_BRA is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Ecto assay and it measures an the percentage of cells in an mesoderm state.
Using a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes
in gene expression with fluorescent microscopy. The percentage of BRA+ cells are determined by calculating the
number of BRA+ cells divided by the cell count. To generalize the intended target to other relatable targets, this
assay component is annotated to the transcription factor intended target family, where the subfamily is T-box
protein. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Ecto_BRA was analyzed as 1 assay
endpoint. The endpoint is the percent BRA+ cell population minus the 0.2% DMSO control in the
CCTE_Deisenroth_DEVTOX-GLR_Ecto assay. Furthermore, this assay endpoint can be referred to as a secondary
readout, because this assay has produced multiple assay endpoints where this one serves as a measure of
mesoderm differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Ecto_BRA was designed to measure changes in cellular Bra
expression. Changes are indicative of changes in Bra gene expression due to perturbations in ectoderm
differentiation

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow (Gamble 2022). Cells
are seeded into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total
duration of 48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-
hour differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

9.98e-05 nM
Key positive control:

NA

Target (nominal) number of replicates:

8

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.372
Response cutoff threshold used to determine hit calls: 1.116
Detection technology used: microscopy (microscopy)

-------
2.6	Response: The DevTox GLR-Ecto assay evaluates SOX2 protein expression as a biomarker for early ectoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 35: resp.logfc (Calculate the
normalized response (resp) as the fold change of logged, i.e. the difference between corrected (cval) and
baseline (bval) log-scale values.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 16	Number of chemicals tested: 16

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
5

Inactive hit count: 0
-------
Number of sample-assay endpoints with winning hill model:

0
0

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

6

1

quadratic-polynomialfpoly2) model: 4

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

3

2

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

-------
NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.21

Neutral control median absolute deviation, by plate: nmad	0.487

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	45.25%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.936

Positive control well median absolute deviation, by plate: pmad	0.312

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.067

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)

-------
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

-------
researcli/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3234

CCTE_Deisen roth_DEVTOX-G LR_Ecto_CellCou nt

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Ecto Assay Evaluation of Total Cell Counts, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Ecto assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Ecto_CellCount is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Ecto assay and it measures cell count. Using fluorescent nuclei staining, gain-
of-signal activity can be used to understand changes in cell viability with fluorescent microscopy. It is designed
to make measurements of nuclei counts as a surrogate for cell number, as detected with optical fluorescence
microscopy by Perkin Elmer Harmony nuclei detection algorthm. To generalize the intended target to other
relatable targets, this assay component is annotated to the cell cycle intended target family, where the
subfamily is cytotoxicity. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Ecto_CellCount was
analyzed as 1 assay endpoint. The endpoint the percent change in the cell count relative to the 0.2% DMSO
control in the CCTE_Deisenroth_DEVTOX-GLR_Ecto assay. Furthermore, this assay endpoint can be referred to
as a primary readout, because this assay has produced multiple assay endpoints where this one serves as a
measure of cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Ecto_CellCount was designed to measure changes in the number
of cells where a decrease is indicative of cytotoxicity.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

9.98e-05 nM
Key positive control:

NA

Target (nominal) number of replicates:

8

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 15.683
Response cutoff threshold used to determine hit calls: 47.049

Detection technology used: Perkin Elmer Harmony nuclei detection algorthm (microscopy)

-------
2.6	Response: The DevTox GLR-Ecto assay evaluates SOX2 protein expression as a biomarker for early ectoderm
lineage commitment and differentiation during the gastrulation phase of embryogenesis. Well-level biomarker
expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by the sum of
total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each respective
biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med

-------
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 16	Number of chemicals tested: 16

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
5

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1701

Neutral control median absolute deviation, by plate: nmad	266.868

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	15.76%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	1942

Positive control well median absolute deviation, by plate: pmad	289.107

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.659

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3235

CCTE_Deisenroth_DEVTOX-GLR_Pluri_SOX17

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Pluri Assay Evaluation of SRY-box transcription factor 17 (SOX17) Protein
Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Pluri assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Pluri_SOX17 is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Pluri assay and it measures an the percentage of cells in an endoderm state.
Using a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes
in gene expression with fluorescent microscopy. The percentage of SOX17+ cells are determined by calculating
the number of SOX17+ cells divided by the cell count. To generalize the intended target to other relatable
targets, this assay component is annotated to the transcription factor intended target family, where the
subfamily is SRY-related HMG-box. Data from the assay component CCTE_Deisenroth_DEVTOX-
GLR_Pluri_SOX17 was analyzed as 1 assay endpoint. The endpoint is the percent SOX17+ cell population minus
the 0.2% DMSO control in the CCTE_Deisenroth_DEVTOX-GLR_Pluri assay. This assay endpoint can be referred
to as a secondary readout since this assay has produced multiple assay endpoints where this one serves as a
measure of endoderm differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Pluri_SOX17 was designed to measure changes in cellular SOX17
expression. Changes are indicative of changes in SOX17 gene expression due to perturbations in pluripotency.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.000998 nM
Key positive control:

NA

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.779
Response cutoff threshold used to determine hit calls: 2.338
Detection technology used: microscopy (microscopy)

-------
2.6	Response: The DevTox GLR-Pluri assay evaluates SOX2 protein expression as a biomarker of the pluripotent
state that precedes germ layer development during the gastrulation phase of embryogenesis. Well-level
biomarker expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by
the sum of total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each
respective biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 35: resp.logfc (Calculate the
normalized response (resp) as the fold change of logged, i.e. the difference between corrected (cval) and
baseline (bval) log-scale values.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 65	Number of chemicals tested: 65

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
25

Inactive hit count: 0
-------
Number of sample-assay endpoints with winning hill model:

5
2

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

3

22

quadratic-polynomialfpoly2) model: 3

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2

7

20

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

-------
NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2.352

Neutral control median absolute deviation, by plate: nmad	0.79

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	36.71%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	2.332

Positive control well median absolute deviation, by plate: pmad	0.862

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.097

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)

-------
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

-------
researcli/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3236

CCTE_Deisenroth_DEVTOX-GLR_Pluri_SOX2

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Pluri Assay Evaluation of SRY-box transcription factor 2 (SOX2) Protein
Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Pluri assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Pluri_SOX2 is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Pluri assay and it measures an the percentage of cells in an pluripotent state.
Using a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes
in gene expression with fluorescent microscopy. The percentage of SOX2+ cells are determined by calculating
the number of SOX2+ cells divided by the cell count. To generalize the intended target to other relatable targets,
this assay component is annotated to the transcription factor intended target family, where the subfamily is
SRY-related HMG-box. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Pluri_SOX2 was
analyzed as 1 assay endpoint. The endpoint is the percent change in the SOX2+ cell population relative to the
0.2% DMSO control in the CCTE_Deisenroth_DEVTOX-GLR_Pluri assay. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves as a measure of ectoderm differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Pluri_SOX2 was designed to measure changes in cellular SOX2
expression. Changes are indicative of changes in SOX2 protein expression due to perturbations in pluripotency.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.000998 nM
Key positive control:

NA

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 4.178
Response cutoff threshold used to determine hit calls: 12.533
Detection technology used: microscopy (microscopy)

-------
2.6	Response: The DevTox GLR-Pluri assay evaluates SOX2 protein expression as a biomarker of the pluripotent
state that precedes germ layer development during the gastrulation phase of embryogenesis. Well-level
biomarker expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by
the sum of total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each
respective biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med

-------
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 65	Number of chemicals tested: 65

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
36

Inactive hit count: 0
-------
WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

11
6

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

7

12

quadratic-polynomialfpoly2) model: 5

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

7

1

1

15

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

-------
4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	81.737

Neutral control median absolute deviation, by plate: nmad	3.572

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	4.27%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	83.425
Positive control well median absolute deviation, by plate: pmad 2.47

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.413

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity

-------
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3237

CCTE_Deisenroth_DEVTOX-GLR_Pluri_BRA

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Pluri Assay Evaluation of Brachyury (BRA) Protein Expression, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Pluri assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Pluri_BRA is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Pluri assay and it measures an the percentage of cells in an mesoderm state.
Using a type of inducible fluorescent protein reporter, gain-of-signal activity can be used to understand changes
in gene expression with fluorescent microscopy. The percentage of BRA+ cells are determined by calculating the
number of BRA+ cells divided by the cell count. To generalize the intended target to other relatable targets, this
assay component is annotated to the transcription factor intended target family, where the subfamily is T-box
protein. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Pluri_BRA was analyzed as 1 assay
endpoint. The endpoint is the percent BRA+ cell population minus the 0.2% DMSO control in the
CCTE_Deisenroth_DEVTOX-GLR_Pluri assay. Furthermore, this assay endpoint can be referred to as a secondary
readout, because this assay has produced multiple assay endpoints where this one serves as a measure of
mesoderm differentiation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Pluri_BRA was designed to measure changes in cellular Bra
expression. Changes are indicative of changes in Bra gene expression due to perturbations in pluripotency.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.000998 nM
Key positive control:

NA

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 1.886
Response cutoff threshold used to determine hit calls: 5.658
Detection technology used: microscopy (microscopy)

-------
2.6	Response: The DevTox GLR-Pluri assay evaluates SOX2 protein expression as a biomarker of the pluripotent
state that precedes germ layer development during the gastrulation phase of embryogenesis. Well-level
biomarker expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by
the sum of total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each
respective biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of transcription factor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 35: resp.logfc (Calculate the
normalized response (resp) as the fold change of logged, i.e. the difference between corrected (cval) and
baseline (bval) log-scale values.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 65	Number of chemicals tested: 65

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
34

Inactive hit count: 0
-------
Number of sample-assay endpoints with winning hill model:

5
8

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

5

13

quadratic-polynomialfpoly2) model: 4

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2

1

14

13

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

-------
NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4.162

Neutral control median absolute deviation, by plate: nmad	1.704

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	38.65%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	3.653

Positive control well median absolute deviation, by plate: pmad	1.254

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.191

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 13.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)

-------
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

-------
researcli/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3238

CCTE_Deisen roth_DEVTOX-G LR_PI uri_Cel ICou nt

1. General Information

1.1	Assay Title: CCTE's DevTox GLR-Pluri Assay Evaluation of Total Cell Counts, Deisenroth Lab

1.2	Assay Summary: CCTE_Deisenroth_DEVTOX-GLR_Pluri assay is a cell-based, multiplex-readout assay that uses
RUES2-GLR, a human pluripotent cell line, with measurements taken at 48 hours after chemical dosing in a 384-
well microplate CCTE_Deisenroth_DEVTOX-GLR_Pluri_CellCount is 1 of 4 assay components calculated from the
CCTE_Deisenroth_DEVTOX-GLR_Pluri assay and it measures cell count. Using fluorescent nuclei staining, gain-
of-signal activity can be used to understand changes in cell viability with fluorescent microscopy. It is designed
to make measurements of nuclei counts as a surrogate for cell number, as detected with optical fluorescence
microscopy by Perkin Elmer Harmony nuclei detection algorthm. To generalize the intended target to other
relatable targets, this assay component is annotated to the cell cycle intended target family, where the
subfamily is cytotoxicity. Data from the assay component CCTE_Deisenroth_DEVTOX-GLR_Pluri_CellCount was
analyzed as 1 assay endpoint. The endpoint the percent change in the cell count relative to the 0.2% DMSO
control in the CCTE_Deisenroth_DEVTOX-GLR_Pluri assay. Furthermore, this assay endpoint can be referred to
as a primary readout, because this assay has produced multiple assay endpoints where this one serves as a
measure of cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA Center for Computational Toxicology and Exposure labs focus on developing and
implementing in vitro methods to identify potential environmental toxicants. Principal investigators include
Steve Simmons, Joshua Harrill, and Chad Deisenroth.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human pluripotent stem cell test (hPST) is an in vitro test method for rapidly
identifying potential human developmental toxicants that employs directed differentiation of embryonic stem
cells to measure reductions in SOX17 biomarker expression and nuclear localizationl. The multi-lineage DevTox
GLR platform expands on the hPST principles by utilizing a transgenic pluripotent stem cell line expressing
fluorescent reporter fusion protein biomarkers for SOX17 (endoderm marker), BRA (mesoderm marker), and
SOX2 (ectoderm and pluripotency marker). Initial assay adaption to definitive endoderm (DevTox GLR-Endo)
was performed to emulate the hPSTSOX17 endpoint and enable comparative evaluation of concordant chemical
effects. Assay duration was reduced to two days and screening throughput scaled to 384-well format for
enhanced speed and efficiency. Assay performance for 66 chemicals derived from reference and training set
data resulted in a balanced accuracy of 72% (79% sensitivity and 65% specificity). The DevTox GLR-Endo assay
demonstrated successful adaptation of the hPST concept with increased throughput, shorter assay duration,
and minimal endpoint processing (Gamble 2022). Method development and performance evaluation has also
been completed for the DevTox GLR-Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri
(Pluripotent) assays.

1.9	Assay Throughput: 384-well plate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: CCTE_Deisenroth_DEVTOX-GLR_Pluri_CellCount was designed to measure changes in the number
of cells where a decrease is indicative of cytotoxicity.

The DevTox GLR model platform is comprised of four assay modes: DevTox GLR-Endo (Endoderm), DevTox GLR-
Ecto (Ectoderm), DevTox GLR-Meso (Mesoderm), and DevTox GLR-Pluri (Pluripotent) which assess perturbations
to specific gastrulation-associated cellular states that may be indicative of developmental toxicity.

2.2	Scientific Principles: Birth defects impact approximately 3% of births in the United States annually and are a
major contributor to infant morbidity and mortality (Yoon 1997, Hoyert 2006). The majority of developmental
anomalies are of unknown etiology but there is increasing evidence that exposure to certain environmental
chemicals is a contributing factor (Stillerman 2008). Identifying prenatal developmental toxicants is challenging
since target effects can span critical stages of fetal development (e.g. conception through organogenesis),
resulting in adverse outcomes including low birth weight, congenital defects, functional deficits, and pregnancy
loss (Stillerman 2008). Further complicating hazard identification are fetal-maternal interactions where altered
chemical toxicokinetic and toxicodynamic parameters can be ascribed to xenobiotic metabolism (Fantel 1982,
Webster 2002), placental transport functions (Caserta 2013, Birks 2016, Grindler 2018), or general adverse
effects on maternal physiology. The DevTox GLR model platform has the capacity to rapidly screen and identify
potential chemical hazards during the gastrulation phase of early embryogenesis.

2.3	Experimental System: adherent RUES2-GLR cell-based used. The platform utilizes the RUES2-GLR (Rockefeller
University Embryonic Stem cell line 2 - Germ Layer Reporter) pluripotent stem cell reporter line which expresses
fluorescent fusion protein biomarkers for endoderm (SOX17-tdTomato), mesoderm (Brachyury (BRA)-
mCerulean), and ectoderm or pluripotency (SOX2-mCitriline) to enable a multi-lineage, high-throughput
readout of gastrulation (Martyn 2018).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: All assay protocols are aligned to the DevTox GLR-Endo assay workflow2. Cells are seeded
into 384-well microplates for 24 hours prior to initiating differentiation in defined media for a total duration of
48 hours. A total of two chemical exposures are administered in 24-hour intervals during the 48-hour
differentiation period. At the assay termination point, cells are fixed and stained with a fluorescent dye
demarcating the nucleus. High-content imaging is used to acquire brightfield and fluorescent channel images
across 5 fields per well. Cell identification and mean biomarker fluorescence intensities are calculated with
image analysis software to determine the total cell counts for viability assessment and the biomarker expression
frequency for each lineage-dependent assay. Atypical experimental design tests a 10-point concentration series
(1 nM - 200 uM) for 28 compounds in single technical replicate. Assay plate controls include non-differentiated
pluripotent cells (baseline control), lineage-specific differentiated cells (differentiation control), and DMSO
solvent exposed differentiated cells (solvent control). Experiments typically comprise a minimum of four
experimental replicates (n=4).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.000998 nM
Key positive control:

NA

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 15.862
Response cutoff threshold used to determine hit calls: 47.587

Detection technology used: Perkin Elmer Harmony nuclei detection algorthm (microscopy)

-------
2.6	Response: The DevTox GLR-Pluri assay evaluates SOX2 protein expression as a biomarker of the pluripotent
state that precedes germ layer development during the gastrulation phase of embryogenesis. Well-level
biomarker expression frequency is measured as the 'percent responders'. Well-level 'cell counts', reflected by
the sum of total counted nuclei from all imaged fields, are used to calculate the 'percent responders' for each
respective biomarker and evaluate cytotoxicity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Concentration-response modeling of the 'percent responders' and 'cell counts' endpoints is used
to derive quantitative potency and efficacy values. Normalized values reflect the percent response of the
lineage-specific differentiated cells (differentiation control) set at 100 percent.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

21: agg.median.rep.apid (Aggregate technical replicates by taking the plate-wise median per sample id
and concentration index.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med

-------
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 65	Number of chemicals tested: 65

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
26

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2634.5

Neutral control median absolute deviation, by plate: nmad	365.09

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	17.67%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	3153.75

Positive control well median absolute deviation, by plate: pmad	431.066

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.18

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Gamble, J. T., Hopperstad, K., & Deisenroth, C. (2022). The DevTox Germ Layer Reporter Platform:
An Assay Adaptation of the Human Pluripotent Stem Cell Test. Toxics, 10(7), 392.
https://doi.org/10.3390/toxicsl0070392, Kameoka, S., Babiarz, J., Kolaja, K., & Chiao, E. (2014). A high-
throughput screen for teratogens using human pluripotent stem cells. Toxicological sciences : an official journal
of the Society of Toxicology, 137(1), 76-90. https://doi.org/10.1093/toxsci/kft239, Yoon, P. W., Olney, R. S.,
Khoury, M. J., Sappenfield, W. M., Chavez, G. F., & Taylor, D. (1997). Contribution of birth defects and genetic
diseases to pediatric hospitalizations. A population-based study. Archives of pediatrics & adolescent medicine,
151(11), 1096-1103., Hoyert, D. L., Mathews, T. J., Menacker, F., Strobino, D. M., & Guyer, B. (2006). Annual
summary of vital statistics: 2004. Pediatrics, 117(1), 168-183. https://doi.org/10.1542/peds.2005-2587,
Stillerman, K. P., Mattison, D. R., Giudice, L. C., & Woodruff, T. J. (2008). Environmental exposures and adverse
pregnancy outcomes: a review of the science. Reproductive sciences (Thousand Oaks, Calif.), 15(7), 631-650.
https://doi.org/10.1177/1933719108322436, Fantel A. G. (1982). Culture of whole rodent embryos in teratogen
screening. Teratogenesis, carcinogenesis, and mutagenesis, 2(3-4), 231-242. https://doi.org/10.1002/1520-
6866(1990)2:3/4<231::aid-tcml770020305>3.0.co;2-l, Webster, W. S., Brown-Woodman, P. D., & Ritchie, H. E.
(1997). A review of the contribution of whole embryo culture to the determination of hazard and risk in
teratogenicity testing. The International journal of developmental biology, 41(2), 329-335., Caserta, D.,
Graziano, A., Lo Monte, G., Bordi, G., & Moscarini, M. (2013). Heavy metals and placental fetal-maternal barrier:
a mini-review on the major concerns. European review for medical and pharmacological sciences, 17(16), 2198-
2206., Birks, L., Casas, M., Garcia, A. M., Alexander, J., Barros, H., Bergstrom, A., Bonde, J. P., Burdorf, A., Costet,
N., Danileviciute, A., Eggesb0, M., Fernandez, M. F., Gonzalez-Galarzo, M. C., Regina Grazuleviciene, Hanke, W.,
Jaddoe, V., Kogevinas, M., Kull, I., Lertxundi, A., Melaki, V., ... Vrijheid, M. (2016). Occupational Exposure to
Endocrine-Disrupting Chemicals and Birth Weight and Length of Gestation: A European Meta-Analysis.
Environmental health perspectives, 124(11), 1785-1793. https://doi.org/10.1289/EHP208, Grindler, N. M.,
Vanderlinden, L, Karthikraj, R., Kannan, K., Teal, S., Polotsky, A. J., Powell, T. L, Yang, I. V., & Jansson, T. (2018).
Exposure to Phthalate, an Endocrine Disrupting Chemical, Alters the First Trimester Placental Methylome and
Transcriptome in Women. Scientific reports, 8(1), 6086. https://doi.org/10.1038/s41598-018-24505-w, Martyn,
I., Kanno, T. Y., Ruzo, A., Siggia, E. D., & Brivanlou, A. H. (2018). Self-organization of a human organizer by
combined Wnt and Nodal signalling. Nature, 558(7708), 132-135. https://doi.org/10.1038/s41586-018-0150-y

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2309

CCTE_GLTED_hDI01

1.	General Information

1.1	Assay Title: CCTE's Human Deiodinase 1 (DIOl) Inhibition Assay, Great Lakes Toxicology and Ecology Division
(GLTED) Lab

1.2	Assay Summary: CCTE_GLTED_hDI01 is a cell-free, single-readout assay designed to test the inhibitory activity
of chemicals toward human iodothyronine Deiodinase type 1 (DIOl) enzyme. Enzyme is incubated in presence
of test chemical for 3 hours during which uninhibited enzyme liberates free iodide from substrate. Free iodide
is measured in a second step using the Sandell-Kolthoff method. CCTE_GLTED_hDI01 is the assay component
measured from the CCTE_GLTED_hDI01 assay. It is designed to make measurements of enzyme activity, a form
of enzyme reporter, as detected with absorbance signals by spectrophotometry. Data from the assay
component CCTE_GLTED_hDI01 was analyzed into 1 assay endpoint. This assay endpoint, CCTE_GLTED_hDI01,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of enzyme reporter, loss-of-signal activity can be used to understand changes in the
enzymatic activity as they relate to the gene DIOl. Furthermore, this assay endpoint can be referred to as a
primary readout, because the performed assay has only produced 1 assay endpoint. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'deiodinase' intended target family,
where the subfamily is 'deiodinase Type 1'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA CCTE Great Lakes Toxicology and Ecology Division focuses on ecotoxicology and stressors
of water resources, including devleopment and implementation of in vitro and in vivo assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is non-proprietary.

1.9	Assay Throughput: 96-well plate. Assay is conducted in 96-well plate format. Throughput is such that 3 replicate
plates can be run in one day, usually with 2 people, using 96-well pipettors. In this format 76 chemicals can be
tested in single concentration mode in triplicate in one day.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Deiodinase enzymes play an essential role in converting thyroid hormones between active and
inactive forms by deiodinating the pro-hormone thyroxine (T4) to the active hormone triiodothyronine (T3) and
modifying T4 and T3 to inactive forms. DIOl targets both the outer and inner rings, and thus can convert T4 to
T3 or inactivate either of these thyroid hormones.

This assay is designed to test the inhibitory activity of chemicals toward human iodothyronine Deiodinase type
1 (DIOl) enzyme. Enzyme is incubated in presence of test chemical for 3 hours during which uninhibited enzyme
liberates free iodide from substrate. Free iodide is measured in a second step using the Sandell-Kolthoff method.

2.2	Scientific Principles: Inhibition of iodothyronine deiodinase enzymes has been identified as a molecular
initiating event in seven AOPs currently under development, primarily in fish and amphibians. They are

-------
specifically AOP 155,156, 157, 158,189, 190, and 191 (http://aopkb.org). The most likely effect is reduction of
the active hormone T3 in vertebrate tissue at critical developmental times. Alteration of thyroid hormone status
through this mechanism could have potential harmful effects on vertebrates similar to interference with other
thyroid MIEs like TPO (thyroid peroxidase) or NIS (sodium iodide symporter).

2.3 Experimental System: NA NA cell-free used. Human deiodinase 1 (DI01) was cloned and expressed in HEK293
cells with adenoviral expression vector. The cell homogenate was harvested as the source of human DI01 for

2.4	Metabolic Competence: This assay is done in a cell-homogenate, not live cells. Metabolic activity of the
homogenate has not been determined.

2.5	Exposure Regime: Human deiodinase type 1 (hDIOl) assay, developed at Great Lakes Toxicology and Ecology
Division (GLTED) within the EPA's Center for Computational Toxicology and Exposure (CCTE), is designed to test
the chemicals' ability to inhibit DI01. Assay is based on iodide release measured by the Sandell-Kolthoff method
and is run in 96-well plate format with assay plates run in triplicate (n=3). The method used for the deiodinase
assay was largely based on Renko et al. (2012, 2015). These works describe the utilization of the SK reaction to
measure deiodinase-liberated iodide in a 96-well-plate format. The SK reaction is based on the reduction of the
yellow-colored cerium IV to the non-colored cerium III by arsenic, with the rate of this reaction increased in the
presence of iodide in a concentration-dependent manner (Sandell and Kolthoff, 1937). lodinated test chemicals
can interfere as competitive inhibitors, or if the stock chemical has contaminating free-iodide present. Other
chemicals, e.g.: nitrite, iron, and thiocyanate can interfere with the Sandell-Kolthoff reaction (see Sandell and
Kolthoff, 1937).

Baseline median absolute deviation for the assay (bmad): 5.075
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Spectrophotometry (Absorbance )

2.6	Response: The assay measures the deiodination activity of human DIOl on the substrate thyroxine (T4). After
3h incubation of substrate, enzyme, and test chemical, the components are eluted through a 96-well Dowex
column to separate the free iodide released from the reaction from the other assay components. The free
iodide is measured by the Sandell-Kolthoff reaction in which the rate of reduction of Cerium (Ce+4 to 2Ce+3) by
arsenic (As+3 to As+5) is increased by the presence of iodide which can be observed as the loss of yellow color
in the reaction and the quantified by the change in absorbance at 420 nm. The change in reaction rate is
proportional to the amount of iodide present. Inhibition of deiodinase enzyme activity has the potential for
altering the ratio of T3/T4 thyroid hormones in all vertebrates. Insufficient active hormone can lead to delayed
or asynchronous amphibian metamorphosis, impaired swim bladder formation in fish, and potentially to
adverse neurodevelopmental outcomes in mammals.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.

the assay

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.0336 nM
Key positive control:

6-propyl-thiouracil (PTU)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

1000 nM
Neutral vehicle control:

DMSO

-------
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network

Additionally, this assay was annotated to the intended target family of deiodinase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw data is measured as absorbance units at 420 nM. Absorbance at the 10 minute read time is
subtracted from absorbance at 1 minute read time to obtain the change in absorbance. The net change in
absorbance is determined by taking this measured change in absorbance and subtracting the mean background
change in absorbance defined by the completely inhibited reaction in the six wells in that plate containing 200
HM XTH. Data are normalized to percent of control by dividing the net change in absorbance by the mean net
change in absorbance of the seven uninhibited reactions (DMSO control wells) representing the maximum DIOl
activity. Test chemical results are reported as percent inhibition, which is calculated as 100% minus percent of
control. Decrease in signal means the activity of the deiodinase enzyme is being inhibited in some way. Range
is defined by the absorbance of the seven DMSO control wells and the six high concentration XTH wells on each
plate. DMSO control wells contain uninhibited enzyme, thus define maximum activity, defined as 100%. Wells
with high concentration XTH inhibitor contain fully inhibited enzyme, thus are defined as 0% activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

-------
Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 175	Number of chemicals tested: 160

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
168

Inactive hit count: 0
-------
exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

6

20

51

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.012

Neutral control median absolute deviation, by plate: nmad

4.324

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-79.95%

-------
POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	100.366

Positive control well median absolute deviation, by plate: pmad	5.283

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	12.985

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 51.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Hornung MW, Korte JJ, Olker JH, Denny JS, Knutsen C, Hartig PC, Cardon MC, Degitz SJ. Screening
the ToxCast Phase 1 Chemical Library for Inhibition of Deiodinase Type 1 Activity. Toxicol Sci. 2018 Apr
1;162(2):570-581. doi: 10.1093/toxsci/kfx279. PMID: 29228274; PMCID: PMC6639810., OlkerJH, Korte JJ, Denny
JS, Hartig PC, Cardon MC, Knutsen CN, Kent PM, Christensen JP, Degitz SJ, Hornung MW. Screening the ToxCast
Phase 1, Phase 2, and elk Chemical Libraries for Inhibitors of lodothyronine Deiodinases. Toxicol Sci. 2019 Apr
1; 168(2):430-442. doi: 10.1093/toxsci/kfy302. PMID: 30561685; PMCID: PMC6520049., Degitz, S. J., Olker, J. H.,
Denny, J. S., Degoey, P. P., Hartig, P. C., Cardon, M. C., Eytcheson, S. A., Haselman, J. T., Mayasich, S. A., &
Hornung, M. W. (2024). In vitro screening of per- and polyfluorinated substances (PFAS) for interference with
seven thyroid hormone system targets across nine assays. Toxicology in vitro : an international journal published
in association with BIBRA, 95,105762. https://doi.Org/10.1016/j.tiv.2023.105762

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2532

CCTE_GLTED_hDI02

1.	General Information

1.1	Assay Title: CCTE's Human Deiodinase 2 (DI02) Inhibition Assay, Great Lakes Toxicology and Ecology Division
(GLTED) Lab

1.2	Assay Summary: CCTE_GLTED_hDI02 is a cell-free, single-readout assay designed to test the inhibitory activity
of chemicals toward human iodothyronine Deiodinase type 2 (DI02) enzyme. Enzyme is incubated in presence
of test chemical for 3 hours during which uninhibited enzyme liberates free iodide from substrate. Free iodide
is measured in a second step using the Sandell-Kolthoff method. CCTE_GLTED_hDI02 is the assay component
measured from the CCTE_GLTED_hDI02 assay. It is designed to make measurements of enzyme activity, a form
of enzyme reporter, as detected with absorbance signals by spectrophotometry. Data from the assay
component CCTE_GLTED_hDI02 was analyzed into 1 assay endpoint. This assay endpoint, CCTE_GLTED_hDI02,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of enzyme reporter, loss-of-signal activity can be used to understand changes in the
enzymatic activity as they relate to the gene DI02. Furthermore, this assay endpoint can be referred to as a
primary readout, because the performed assay has only produced 1 assay endpoint. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'deiodinase' intended target family,
where the subfamily is 'deiodinase Type 2'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA CCTE Great Lakes Toxicology and Ecology Division focuses on ecotoxicology and stressors
of water resources, including devleopment and implementation of in vitro and in vivo assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is non-proprietary.

1.9	Assay Throughput: 96-well plate. Assay is conducted in 96-well plate format. Throughput is such that 3 replicate
plates can be run in one day, usually with 2 people, using 96-well pipettors. In this format 76 chemicals can be
tested in single concentration mode in triplicate in one day.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Deiodinase enzymes play an essential role in converting thyroid hormones between active and
inactive forms by deiodinating the pro-hormone thyroxine (T4) to the active hormone triiodothyronine (T3) and
modifying T4 and T3 to inactive forms. DI02 is important for converting T4 to T3 though the removal of the
5a€™ outer ring iodine.

This assay is designed to test the inhibitory activity of chemicals toward human iodothyronine Deiodinase type
2 (DI02) enzyme. Enzyme is incubated in presence of test chemical for 3 hours during which uninhibited enzyme
liberates free iodide from substrate. Free iodide is measured in a second step using the Sandell-Kolthoff method.

2.2

Scientific Principles: Inhibition of iodothyronine deiodinase enzymes has been identified as a molecular
initiating event in seven AOPs currently under development, primarily in fish and amphibians. They are

-------
specifically AOP 155,156, 157, 158,189, 190, and 191 (http://aopkb.org). The most likely effect is reduction of
the active hormone T3 in vertebrate tissue at critical developmental times. Alteration of thyroid hormone status
through this mechanism could have potential harmful effects on vertebrates similar to interference with other
thyroid MIEs like TPO (thyroid peroxidase) or NIS (sodium iodide symporter).

2.3 Experimental System: NA NA cell-free used. Human deiodinase 2 (DI02) was cloned and expressed in HEK293
cells with adenoviral expression vector. The cell homogenate was harvested as the source of human DI02 for

2.4	Metabolic Competence: This assay is done in a cell-homogenate, not live cells. Metabolic activity of the
homogenate has not been determined.

2.5	Exposure Regime: Human deiodinase type 2 (hDI02) assay, developed at Great Lakes Toxicology and Ecology
Division (GLTED) within the EPA's Center for Computational Toxicology and Exposure (CCTE), is designed to test
the chemicals' ability to inhibit DI02. Assay is based on iodide release measured by the Sandell-Kolthoff method
and is run in 96-well plate format with assay plates run in triplicate (n=3). The method used for the deiodinase
assay was largely based on Renko et al. (2012, 2015). These works describe the utilization of the SK reaction to
measure deiodinase-liberated iodide in a 96-well-plate format. The SK reaction is based on the reduction of the
yellow-colored cerium IV to the non-colored cerium III by arsenic, with the rate of this reaction increased in the
presence of iodide in a concentration-dependent manner (Sandell and Kolthoff, 1937). lodinated test chemicals
can interfere as competitive inhibitors, or if the stock chemical has contaminating free-iodide present. Other
chemicals, e.g.: nitrite, iron, and thiocyanate can interfere with the Sandell-Kolthoff reaction (see Sandell and
Kolthoff, 1937).

Baseline median absolute deviation for the assay (bmad): 4.347
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Spectrophotometry (Absorbance )

2.6	Response: The assay measures the deiodination activity of human DI02 on the substrate thyroxine (T4). After
3h incubation of substrate, enzyme, and test chemical, the components are eluted through a 96-well Dowex
column to separate the free iodide released from the reaction from the other assay components. The free
iodide is measured by the Sandell-Kolthoff reaction in which the rate of reduction of Cerium (Ce+4 to 2Ce+3) by
arsenic (As+3 to As+5) is increased by the presence of iodide which can be observed as the loss of yellow color
in the reaction and the quantified by the change in absorbance at 420 nm. The change in reaction rate is
proportional to the amount of iodide present. Inhibition of deiodinase enzyme activity has the potential for
altering the ratio of T3/T4 thyroid hormones in all vertebrates. Insufficient active hormone can lead to delayed
or asynchronous amphibian metamorphosis, impaired swim bladder formation in fish, and potentially to
adverse neurodevelopmental outcomes in mammals.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.

the assay

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.032 nM
Key positive control:
xanthohumol

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

-------
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network

Additionally, this assay was annotated to the intended target family of deiodinase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw data is measured as absorbance units at 420 nM. Absorbance at the 10 minute read time is
subtracted from absorbance at 1 minute read time to obtain the change in absorbance. The net change in
absorbance is determined by taking this measured change in absorbance and subtracting the mean background
change in absorbance defined by the completely inhibited reaction in the six wells in that plate containing 200
HM XTH. Data are normalized to percent of control by dividing the net change in absorbance by the mean net
change in absorbance of the seven uninhibited reactions (DMSO control wells) representing the maximum DI02
activity. Test chemical results are reported as percent inhibition, which is calculated as 100% minus percent of
control. Decrease in signal means the activity of the deiodinase enzyme is being inhibited in some way. Range
is defined by the absorbance of the seven DMSO control wells and the six high concentration XTH wells on each
plate. DMSO control wells contain uninhibited enzyme, thus define maximum activity, defined as 100%. Wells
with high concentration XTH inhibitor contain fully inhibited enzyme, thus are defined as 0% activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

-------
Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 235	Number of chemicals tested: 230

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
211

Inactive hit count: 0
-------
exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

6

3

35

37

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.372

Neutral control median absolute deviation, by plate: nmad

3.935

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-284.02%

-------
POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	100

Positive control well median absolute deviation, by plate: pmad	2.491

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	20.356

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 37.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Hornung MW, Korte JJ, Olker JH, Denny JS, Knutsen C, Hartig PC, Cardon MC, Degitz SJ. Screening
the ToxCast Phase 1 Chemical Library for Inhibition of Deiodinase Type 1 Activity. Toxicol Sci. 2018 Apr
1;162(2):570-581. doi: 10.1093/toxsci/kfx279. PMID: 29228274; PMCID: PMC6639810., OlkerJH, Korte JJ, Denny
JS, Hartig PC, Cardon MC, Knutsen CN, Kent PM, Christensen JP, Degitz SJ, Hornung MW. Screening the ToxCast
Phase 1, Phase 2, and elk Chemical Libraries for Inhibitors of lodothyronine Deiodinases. Toxicol Sci. 2019 Apr
1; 168(2):430-442. doi: 10.1093/toxsci/kfy302. PMID: 30561685; PMCID: PMC6520049., Degitz, S. J., Olker, J. H.,
Denny, J. S., Degoey, P. P., Hartig, P. C., Cardon, M. C., Eytcheson, S. A., Haselman, J. T., Mayasich, S. A., &
Hornung, M. W. (2024). In vitro screening of per- and polyfluorinated substances (PFAS) for interference with
seven thyroid hormone system targets across nine assays. Toxicology in vitro : an international journal published
in association with BIBRA, 95,105762. https://doi.Org/10.1016/j.tiv.2023.105762

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2533

CCTE_GLTED_hDI03

1.	General Information

1.1	Assay Title: CCTE's Human Deiodinase 3 (DI03) Inhibition Assay, Great Lakes Toxicology and Ecology Division
(GLTED) Lab

1.2	Assay Summary: CCTE_GLTED_hDI03 is a cell-free, single-readout assay designed to test the inhibitory activity
of chemicals toward human iodothyronine Deiodinase type 3 (DI03) enzyme. Enzyme is incubated in presence
of test chemical for 3 hours during which uninhibited enzyme liberates free iodide from substrate. Free iodide
is measured in a second step using the Sandell-Kolthoff method. CCTE_GLTED_hDI03 is the assay component
measured from the CCTE_GLTED_hDI03 assay. It is designed to make measurements of enzyme activity, a form
of enzyme reporter, as detected with absorbance signals by spectrophotometry. Data from the assay
component CCTE_GLTED_hDI03 was analyzed into 1 assay endpoint. This assay endpoint, CCTE_GLTED_hDI03,
was analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of enzyme reporter, loss-of-signal activity can be used to understand changes in the
enzymatic activity as they relate to the gene DI03. Furthermore, this assay endpoint can be referred to as a
primary readout, because the performed assay has only produced 1 assay endpoint. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'deiodinase' intended target family,
where the subfamily is 'deiodinase Type 3'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA CCTE Great Lakes Toxicology and Ecology Division focuses on ecotoxicology and stressors
of water resources, including devleopment and implementation of in vitro and in vivo assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is non-proprietary.

1.9	Assay Throughput: 96-well plate. Assay is conducted in 96-well plate format. Throughput is such that 3 replicate
plates can be run in one day, usually with 2 people, using 96-well pipettors. In this format 76 chemicals can be
tested in single concentration mode in triplicate in one day.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Deiodinase enzymes play an essential role in converting thyroid hormones between active and
inactive forms by deiodinating the pro-hormone thyroxine (T4) to the active hormone triiodothyronine (T3) and
modifying T4 and T3 to inactive forms. DI03 inactivates both T4 and T3 by removing an inner ring iodine,
producing reverse T3 (rT3) and diiodotyrosine (T2), respectively.

This assay is designed to test the inhibitory activity of chemicals toward human iodothyronine Deiodinase type
3 (DI03) enzyme. Enzyme is incubated in presence of test chemical for 3 hours during which uninhibited enzyme
liberates free iodide from substrate. Free iodide is measured in a second step using the Sandell-Kolthoff method.

2.2	Scientific Principles: Inhibition of iodothyronine deiodinase enzymes has been identified as a molecular
initiating event in seven AOPs currently under development, primarily in fish and amphibians. They are

-------
specifically AOP 155,156, 157, 158,189, 190, and 191 (http://aopkb.org). The most likely effect is reduction of
the active hormone T3 in vertebrate tissue at critical developmental times. Alteration of thyroid hormone status
through this mechanism could have potential harmful effects on vertebrates similar to interference with other
thyroid MIEs like TPO (thyroid peroxidase) or NIS (sodium iodide symporter).

2.3 Experimental System: NA NA cell-free used. Human deiodinase 3 (DI03) was cloned and expressed in HEK293
cells with adenoviral expression vector. The cell homogenate was harvested as the source of human DI03 for

2.4	Metabolic Competence: This assay is done in a cell-homogenate, not live cells. Metabolic activity of the
homogenate has not been determined.

2.5	Exposure Regime: Human deiodinase type 1 (hDIOl) assay, developed at Great Lakes Toxicology and Ecology
Division (GLTED) within the EPA's Center for Computational Toxicology and Exposure (CCTE), is designed to test
the chemicals' ability to inhibit DI03. Assay uses human deiodinase type 3 (DI03) enzyme produced with
adenovirus expression system. Assay is based on iodide release measured by the Sandell-Kolthoff method and
is run in 96-well plate format with assay plates run in triplicate (n=3). The method used for the deiodinase assay
was largely based on Renko et al. (2012, 2015). These works describe the utilization of the SK reaction to
measure deiodinase-liberated iodide in a 96-well-plate format. The SK reaction is based on the reduction of the
yellow-colored cerium IV to the non-colored cerium III by arsenic, with the rate of this reaction increased in the
presence of iodide in a concentration-dependent manner (Sandell and Kolthoff, 1937). lodinated test chemicals
can interfere as competitive inhibitors, or if the stock chemical has contaminating free-iodide present. Other
chemicals, e.g.: nitrite, iron, and thiocyanate can interfere with the Sandell-Kolthoff reaction (see Sandell and
Kolthoff, 1937).

Baseline median absolute deviation for the assay (bmad): 4.524
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Spectrophotometry (Absorbance )

2.6	Response: The assay measures the deiodination activity of human DI03 on the substrate thyroxine (T4). After
3h incubation of substrate, enzyme, and test chemical, the components are eluted through a 96-well Dowex
column to separate the free iodide released from the reaction from the other assay components. The free
iodide is measured by the Sandell-Kolthoff reaction in which the rate of reduction of Cerium (Ce+4 to 2Ce+3) by
arsenic (As+3 to As+5) is increased by the presence of iodide which can be observed as the loss of yellow color
in the reaction and the quantified by the change in absorbance at 420 nm. The change in reaction rate is
proportional to the amount of iodide present. Inhibition of deiodinase enzyme activity has the potential for
altering the ratio of T3/T4 thyroid hormones in all vertebrates. Insufficient active hormone can lead to delayed
or asynchronous amphibian metamorphosis, impaired swim bladder formation in fish, and potentially to
adverse neurodevelopmental outcomes in mammals.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

the assay

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.032 nM
Key positive control:
xanthohumol

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

-------
2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network

Additionally, this assay was annotated to the intended target family of deiodinase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw data is measured as absorbance units at 420 nM. Absorbance at the 10 minute read time is
subtracted from absorbance at 1 minute read time to obtain the change in absorbance. The net change in
absorbance is determined by taking this measured change in absorbance and subtracting the mean background
change in absorbance defined by the completely inhibited reaction in the six wells in that plate containing 200
HM XTH. Data are normalized to percent of control by dividing the net change in absorbance by the mean net
change in absorbance of the seven uninhibited reactions (DMSO control wells) representing the maximum DI03
activity. Test chemical results are reported as percent inhibition, which is calculated as 100% minus percent of
control. Decrease in signal means the activity of the deiodinase enzyme is being inhibited in some way. Range
is defined by the absorbance of the seven DMSO control wells and the six high concentration XTH wells on each
plate. DMSO control wells contain uninhibited enzyme, thus define maximum activity, defined as 100%. Wells
with high concentration XTH inhibitor contain fully inhibited enzyme, thus are defined as 0% activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp

-------
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230	Number of chemicals tested: 227

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
210

Inactive hit count: Oihitc 0.9
16

WINING MODEL SELECTION

NA hit count: hitc^O
4

Number of sample-assay endpoints with winning hill model:

90
5

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

15

15

-------
quadratic-polynomialfpoly2) model: 30

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

44

29

1

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.163

Neutral control median absolute deviation, by plate: nmad

4.455

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-40.85%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.819

Positive control well median absolute deviation, by plate: pmad	2.136

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	20.305

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 44.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Hornung MW, Korte JJ, Olker JH, Denny JS, Knutsen C, Hartig PC, Cardon MC, Degitz SJ. Screening
the ToxCast Phase 1 Chemical Library for Inhibition of Deiodinase Type 1 Activity. Toxicol Sci. 2018 Apr
1;162(2):570-581. doi: 10.1093/toxsci/kfx279. PMID: 29228274; PMCID: PMC6639810., OlkerJH, Korte JJ, Denny
JS, Hartig PC, Cardon MC, Knutsen CN, Kent PM, Christensen JP, Degitz SJ, Hornung MW. Screening the ToxCast
Phase 1, Phase 2, and elk Chemical Libraries for Inhibitors of lodothyronine Deiodinases. Toxicol Sci. 2019 Apr
1; 168(2):430-442. doi: 10.1093/toxsci/kfy302. PMID: 30561685; PMCID: PMC6520049., Degitz, S. J., Olker, J. H.,
Denny, J. S., Degoey, P. P., Hartig, P. C., Cardon, M. C., Eytcheson, S. A., Haselman, J. T., Mayasich, S. A., &
Hornung, M. W. (2024). In vitro screening of per- and polyfluorinated substances (PFAS) for interference with
seven thyroid hormone system targets across nine assays. Toxicology in vitro : an international journal published
in association with BIBRA, 95, 105762. https://doi.Org/10.1016/j.tiv.2023.105762, Mayasich, S. A., Korte, J. J.,
Denny, J. S., Hartig, P. C., Olker, J. H., DeGoey, P., O'Flanagan, J., Degitz, S. J., & Hornung, M. W. (2021). Xenopus
laevis and human type 3 iodothyronine deiodinase enzyme cross-species sensitivity to inhibition by ToxCast
chemicals. Toxicology in vitro : an international journal published in association with BIBRA, 73, 105141.
https://doi.Org/10.1016/j.tiv.2021.105141

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3032

CCTE_GLTED_hlYD

1.	General Information

1.1	Assay Title: CCTE's Human lodotyrosine Deiodinase (xlYD) Inhibition Assay, Great Lakes Toxicology and Ecology
Division (GLTED) Lab

1.2	Assay Summary: CCTE_GLTED_hlYD is a cell-free, single-readout assay that uses human iodotyrosine deiodinase
(hlYD) enzyme produced with baculovirus-insect cell system. The assay is based on iodide release measured by
the Sandell-Kolthoff method and is run in 96-well plate format with assay plates run in triplicate (n=3) with
measurements taken at 3 hours after chemical dosing. Platewise-normalization was used based on positive
control (3-Nitro-L-tyrosine, MNT) and negative/solvent controls (DMSO, NaOH). CCTE_GLTED_hlYD is the assay
component measured from the CCTE_GLTED_hlYD assay. It measures substrate involved in regulation of
catalytic activity using spectrophotometry. The assay measures the deiodination activity of human IYD on the
substrate monoiodotyrosine (MIT). After 3h incubation of substrate, enzyme, and test chemical, the
components are applied to a 96-well Dowex column. Free iodide is not retained by the column and is separated
from interfering assay components. The free iodide is measured by the Sandell-Kolthoff reaction in which the
rate of reduction of cerium (Ce+4 to Ce+3) by arsenic (As+3 to As+5) is increased by the presence of iodide. This
is observed as the loss of yellow color in the reaction and the quantified by the change in absorbance at 420
nm. The change in reaction rate is proportional to the amount of iodide present. Data from the assay component
CCTE_GLTED_hlYD was analyzed at the assay endpoint CCTE_GLTED_hlYD in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of enzyme reporter,
loss-of-signal activity can be used to understand iodotyrosine deiodinase activity. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'dehalogenase' intended target family,
where the subfamily is 'iodotyrosine deiodinase1.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA CCTE Great Lakes Toxicology and Ecology Division focuses on ecotoxicology and stressors
of water resources, including devleopment and implementation of in vitro and in vivo assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is non-proprietary.

1.9	Assay Throughput: 96-well plate. Assay is conducted in 96-well plate format. Throughput is such that 12
replicate plates can be run in one day, usually with 2 people, using 96-well pipettors. In this format 304 chemicals
can be tested in single concentration mode in triplicate in one day.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The iodide recycling enzyme, iodotyrosine deiodinase (IYD), is one conserved putative molecular
target that plays an essential role in maintaining adequate levels of free iodide in the thyroid gland for hormone
synthesis.

-------
This assay is designed to test the binding activity of chemicals toward human thyroxine-binding globulin (TBG).
TBG and the fluorescent probe 8-anilino-l-naphthalenesulfonic acid (ANSA) are incubated in the presence of
test chemical for 2 hours. Chemicals with binding affinity for TBG displace ANSA resulting in a loss of
fluorescence, which is measured at the end of the incubation period.

2.2	Scientific Principles: Thyroid hormone serum binding proteins promote uniform distribution of circulating
thyroid hormones and facilitate delivery of thyroid hormone to target tissues. Chemical binding to TBG and
subsequent displacement of thyroid hormone may result in altered thyroid hormone homeostasis.

2.3	Experimental System: NA SF-21 cell-free used. Human thyroxine-binding globulin (TBG), manufactured by
Calbiochem, was purchased from Sigma Aldrich. Human TBG was delivered in 140 mM NaCI, 10 mM Tris, 0.1%
NaN3, pH 8.0, sterile filtered.

2.4	Metabolic Competence: This assay is done with pure protein in buffer, not live cells; thus, this assay is not
metabolically competent.

2.5	Exposure Regime: Human thyroxine binding globulin (TBG) assay, developed at Great Lakes Toxicology and
Ecology Division (GLTED) within the EPA's Center for Computational Toxicology and Exposure (CCTE), is designed
to test the chemicals' ability to displace T4 from the TBG protein. The method used was largely based on
Montano et al. (2012). This work describes the utilization of ANSA fluorescence when bound to the protein to
approximate binding affinity of chemicals through a reduction of fluorescence when ANSA is displaced. This
assay was conducted in 96-well plates with an incubation at 4C for 2 hours in a final volume of 200 uL. Final
assay conditions included 0.0625 nM TBG, 0.6 nM ANSA, and 0.5% DMSO in 0.1 M phosphate buffer, pH=7.5.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

7	5

Standard minimum concentration tested:	Standard maximum concentration tested:

0.1600736 nM	1000.46 nM

Key positive control:	Neutral vehicle control:

3-Nitro-L-tyrosine (0.05 M NaOH for model	DMSO

inhibitor (MNT))

Baseline median absolute deviation for the assay (bmad): 6.806
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Spectrophotometry (Absorbance )

2.6	Response: The assay measures chemical binding to human TBG through displacement of ANSA. After 2h
incubation of the protein, ANSA, and test chemical, fluorescence is measured in a 96-well plate reader using
excitation and emission wavelengths of 380 and 475 nm, respectively. When chemicals bind to TBG, ANSA is
displaced and relative fluorescence units (RFUs) are decreased.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

-------
2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network

Additionally, this assay was annotated to the intended target family of dehalogenase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw data is measured as relative fluorescence units at 475 nM. Platewise-normalization was
used based on highest (0.3645 nM) and lowest (0.0013 nM) concentrations of the positive control (Thyroxine,
T4), and each plate also included wells with a negative control (DMSO). Data were processed by: a) correcting
for background fluorescence [background corrected: value - mean(value for TBG wells)]; b) calculating the net
fluorescence [background corrected value - mean(background corrected value for 0.3645 nM T4 wells)]; and c)
normalizing to % of control [net fluorescence/mean(net fluorescence for 0.0013 nM T4 wells) *100%]. Percent
activity was calculated as 100% - % of control (uninhibited) reaction.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning

-------
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 176	Number of chemicals tested: 166

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
122

Inactive hit count: Oihitc 0.9
49

WINING MODEL SELECTION

NA hit count: hitc^O
5

Number of sample-assay endpoints with winning hill model:

18
1

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

64

18

quadratic-polynomialfpoly2) model: 28

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2

27

1

17

-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.386

Neutral control median absolute deviation, by plate: nmad

5.89

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-177.28%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

NA

NA

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

NA

-------
Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 17.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Olker JH, Korte JJ, Denny JS, Haselman JT, Hartig PC, Cardon MC, Hornung MW, Degitz SJ. In vitro
screening for chemical inhibition of the iodide recycling enzyme, iodotyrosine deiodinase. Toxicol In Vitro. 2021
Mar;71:105073. doi: 10.1016/j.tiv.2020.105073. Epub 2020 Dec 29. PMID: 33352258; PMCID: PMC8130633.,
Degitz, S. J., Olker, J. H., Denny, J. S., Degoey, P. P., Hartig, P. C., Cardon, M. C., Eytcheson, S. A., Haselman, J. T.,
Mayasich, S. A., & Hornung, M. W. (2024). In vitro screening of per- and polyfluorinated substances (PFAS) for
interference with seven thyroid hormone system targets across nine assays. Toxicology in vitro : an international
journal published in association with BIBRA, 95,105762. https://doi.Org/10.1016/j.tiv.2023.105762

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3088

CCTE_GLTED_hTBG

1.	General Information

1.1	Assay Title: CCTE's Human Thyroxine-binding Globulin (TBG) Inhibition Assay, Great Lakes Toxicology and
Ecology Division (GLTED) Lab

1.2	Assay Summary: CCTE_GLTED_hTBG is a cell-free, single-readout assay designed to test the chemicals' ability to
displace T4 from the human thyroxine-binding globulin (TBG) protein. TBG is incubated in presence of test
chemical for 2 hours during which disrupting chemicals displace ANSA from the TBG protein. Results read
fluorescence at 380 excitation and 475 emission. CCTE_GLTED_hTBG is the assay component measured from
the CCTE_GLTED_hTBG assay. It is designed to make measurements of fluorescent polarization, using a form of
binding reporter, as detected with Fluorescence signals by Fluorescence technology. Data from the assay
component CCTE_GLTED_hTBG was analyzed at the assay endpoint CCTE_GLTED_hTBG in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of binding
reporter, loss-of-signal activity can be used to understand a chemical's ability to displace T4 from thyroxine-
binding globulin. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the 'transporter' intended target family, where the subfamily is 'hormone carrier protein'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA CCTE Great Lakes Toxicology and Ecology Division focuses on ecotoxicology and stressors
of water resources, including devleopment and implementation of in vitro and in vivo assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is non-proprietary.

1.9	Assay Throughput: 96-well plate. Assay is conducted in 96-well plate format. Throughput is such that 12
replicate plates can be run in one day, usually with 2 people, using 96-well pipettors. In this format 304 chemicals
can be tested in single concentration mode in triplicate in one day.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Thyroxine-binding globulin (TBG) is one the major transport proteins responsible for binding to
and transporting thyroid hormones to the necessary tissues.

This assay is designed to test the binding activity of chemicals toward human transthyretin (TTR). TTR and the
fluorescent probe 8-anilino-l-naphthalenesulfonic acid ammonium salt (ANSA) are incubated in the presence
of test chemical for 2 hours. Chemicals with binding affinity for TTR displace ANSA resulting in a loss of
fluorescence, which is measured at the end of the incubation period.

2.2	Scientific Principles: Thyroid hormone serum binding proteins promote uniform distribution of circulating
thyroid hormones and facilitate delivery of thyroid hormone to target tissues. Chemical binding to TTR and
subsequent displacement of thyroid hormone may result in altered thyroid hormone homeostasis.

-------
2.3

Experimental System: NA NA cell-free used. Human transthyretin (>95% purity) was purchased from Sigma
Aldrich. Human TTR was delivered as lyophilized powder and reconstituted in 0.1 M phosphate buffer, pH=7.5
to prepare a 5nM stock solution.

2.4	Metabolic Competence: This assay is done with pure protein in buffer, not live cells; thus, this assay is not
metabolically competent.

2.5	Exposure Regime: Human transthyretin (TTR) assay, developed at Great Lakes Toxicology and Ecology Division
(GLTED) within the EPA's Center for Computational Toxicology and Exposure (CCTE), is designed to test the
chemicals' ability to displace T4 from the TTR protein. TTR (0.5uM) is incubated in presence of test chemical for
2 hours during which disrupting chemicals displace ANSA from the TTR protein. Results read fluorescence at 380
excitation and 475 emission.

Baseline median absolute deviation for the assay (bmad): 6.774
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Spectrophotometry (Fluorescence)

2.6	Response: The assay measures chemical binding to human TTR through displacement of ANSA. After 2h
incubation of the protein, ANSA, and test chemical, fluorescence is measured in a 96-well plate reader using
excitation and emission wavelengths of 380 and 475 nm, respectively. When chemicals bind to TTR, ANSA is
displaced and relative fluorescence units (RFUs) are decreased.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

45 nM
Key positive control:

3,5,3,5-tetraiodothyronine (T4)

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

150 nM
Neutral vehicle control:

DMSO

-------
3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw data is measured as relative fluorescence units at 475 nM. Platewise-normalization was
used based on highest (1.8 nM) and lowest (0.0067 nM) concentrations of the positive control (Thyroxine, T4),
and each plate also included wells with a negative control (DMSO). Data were processed by: a) correcting for
background fluorescence [background corrected: value - mean(value for TTR wells)]; b) calculating the net
fluorescence [background corrected value - mean(background corrected value for 1.8 nM T4 wells)]; and c)
normalizing to % of control [net fluorescence/mean(net fluorescence for 0.0067 nM T4 wells) *100%]. Percent
activity was calculated as 100% - % of control (uninhibited) reaction.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates

-------
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 313	Number of chemicals tested: 312

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.608

Neutral control median absolute deviation, by plate: nmad	1.323

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-7.54%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

-------
Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 41.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Degitz, S. J., Olker, J. H., Denny, J. S., Degoey, P. P., Hartig, P. C., Cardon, M. C., Eytcheson, S. A.,
Haselman, J. T., Mayasich, S. A., & Hornung, M. W. (2024). In vitro screening of per- and polyfluorinated
substances (PFAS) for interference with seven thyroid hormone system targets across nine assays. Toxicology in
vitro	an international journal published in association with BIBRA, 95, 105762.
https://doi.Org/10.1016/j.tiv.2023.105762

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3089

CCTE_G LTED_hTTR_0.5u M

1.	General Information

1.1	Assay Title: CCTE's Human Transthyretin (hTTR) Inhibition Assay, with 0.5uM TTR, Great Lakes Toxicology and
Ecology Division (GLTED) Lab

1.2	Assay Summary: CCTE_GLTED_hTTR is a cell-free, single-readout assay designed to test the chemicals' ability to
displace T4 from the human transthyretin (TTR) protein. TTR is incubated in presence of test chemical for 2
hours during which disrupting chemicals displace ANSA from the TTR protein. Results read fluorescence at 380
excitation and 475 emission. CCTE_GLTED_hTTR_0.5uM is the assay component measured from the
CCTE_GLTED_hTTR assay. It is designed to make measurements of fluorescent polarization, using a form of
binding reporter, as detected with Fluorescence signals by Fluorescence technology. Data from the assay
component CCTE_GLTED_hTTR_0.5uM was analyzed at the assay endpoint CCTE_GLTED_hTTR_0.5uM in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, loss-of-signal activity can be used to understand a chemical's ability to displace T4 from
transthyretin. To generalize the intended target to other relatable targets, this assay endpoint is annotated to
the 'transporter' intended target family, where the subfamily is 'hormone carrier protein'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA CCTE Great Lakes Toxicology and Ecology Division focuses on ecotoxicology and stressors
of water resources, including devleopment and implementation of in vitro and in vivo assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is non-proprietary.

1.9	Assay Throughput: 96-well plate. Assay is conducted in 96-well plate format. Throughput is such that 3 replicate
plates can be run in one day, usually with 2 people, using 96-well pipettors. In this format 76 chemicals can be
tested in single concentration mode in triplicate in one day.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Transthyretin (TTR) is one the major transport proteins responsible for binding to and transporting
thyroid hormones to the necessary tissues.

This assay is designed to test the inhibitory activity of chemicals toward human thyroperoxidase (hTPO) enzyme.
Enzyme is incubated in presence of excess hydrogen peroxide and test chemical for 30 minutes during which
AUR is converted to Amplex UltroxRed by uninhibited TPO. End-point fluorescence was measured at 544/590
nm excitation/emission.

2.2	Scientific Principles: Inhibition of thyroperoxidase has been identified as a molecular initiating event in several
AOPs currently under development, primarily in fish and amphibians. These specifically include AOP 42, 159,
175, 363, 364, 365, and 402 (http://aopwiki.org). The most likely effect of TPO inhibition is reduced synthesis
of thyroid hormone. Alteration of thyroid hormone status through this mechanism could have potential harmful

-------
effects on vertebrates similar to interference with other thyroid MIEs like deiodinases (diol/dio2/dio3) or NIS
(sodium iodide symporter).

2.3	Experimental System: NA NA cell-free used. Human thyroperoxidase (hTPO) was cloned and expressed in
HEK293 cells with adenoviral expression vector. The cell homogenate was harvested as the source of human
TPO for the assay

2.4	Metabolic Competence: This assay is done in a cell-homogenate, not live cells. Metabolic activity of the
homogenate has not been determined.

2.5	Exposure Regime: Human thyroperoxidase protein (TPO) assay, developed at Great Lakes Toxicology and
Ecology Division (GLTED) within the EPA's Center for Computational Toxicology and Exposure (CCTE, in which
uninhibited enzyme converts Amplex UltraRed to a fluorescent product. Measurements were taken 0.5 hour
after chemical dosing in a 96-well plate format with assay plates run in triplicate (n=3). 4 chemicals
[EPAPLT230F07, EPAPLT228D06, EPAPLT228H07, EPAPLT230D12] on the PFAS plates were found to interfere
with the assay and thus were excluded from this dataset.

Baseline median absolute deviation for the assay (bmad): 2.992
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Spectrophotometry (Fluorescence)

2.6	Response: The assay measures the peroxidase activity of human TPO on the substrate Amplex UltraRed (AUR).
After 30 minutes incubation of substrate, enzyme, excess hydrogen peroxide, and test chemical, fluorescence
was read using a plate reader at 544 nm excitation and 590 nm emission wavelength. Uninhibited TPO results
in the conversion of AUR to Amplex UltroxRed, which is brightly fluorescent. Chemical inhibition of TPO would
prevent production of Amplex UltroxRed and subsequent fluorescence. In vivo, inhibition of TPO could result
in reduced synthesis of thyroid hormone in vertebrates. Insufficient thyroid hormone can lead to delayed or
asynchronous amphibian metamorphosis, impaired swim bladder formation in fish, and potentially adverse
neurodevelopment in mammals.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
12

Standard minimum concentration tested:

0.051075943 nM
Key positive control:

3,5,3,5-tetraiodothyronine (T4)

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

150 nM
Neutral vehicle control:

DMSO

Additionally, this assay was annotated to the intended target family of transporter.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Platewise-normalization was used based on highest (1000 nM) and lowest (0.0001 nM)
concentrations of the positive control (Methimazole, MMI), and each plate also included wells with a negative
control (DMSO). Data were processed by: a) correcting for background fluorescence [background corrected:
value - mean(value for TPO wells)]; b) calculating the net fluorescence [background corrected value -
mean(background corrected value for 1000 nM MMI wells)]; and c) normalizing to % of control [net
fluorescence/mean(net fluorescence for 0.0001 nM MMI wells) *100%]. Percent activity was calculated as 100%
- % of control (uninhibited) reaction.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one

-------
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 90	Number of chemicals tested: 89

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

-2.54
1.239
-29.33%

NA
NA

NA

Positive control signal-to-background: (pmed/nmed)

NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 17.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: Degitz, S. J., Olker, J. H., Denny, J. S., Degoey, P. P., Hartig, P. C., Cardon, M. C., Eytcheson, S. A.,
Haselman, J. T., Mayasich, S. A., & Hornung, M. W. (2024). In vitro screening of per- and polyfluorinated
substances (PFAS) for interference with seven thyroid hormone system targets across nine assays. Toxicology in
vitro	an international journal published in association with BIBRA, 95, 105762.
https://doi.Org/10.1016/j.tiv.2023.105762

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3090

CCTE_GLTED_hTPO

1.	General Information

1.1	Assay Title: CCTE's Human Thyroperoxidase (hTPO) Inhibition Assay, Great Lakes Toxicology and Ecology Division
(GLTED) Lab

1.2	Assay Summary: CCTE_GLTED_hTPO is a cell-free, single-readout assay designed to test the inhibitory activity of
chemicals toward human thyroperoxidase (TPO) protein. Uninhibited enzyme converts Amplex UltraRed to a
fluorescent product. Measurements were taken 0.5 hour after chemical dosing in a 96-well
plate.AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA CCTE_GLTED
_hTPO is the assay component measured from the CCTE_GLTED_hTPO assay. It is designed to make
measurements of enzyme activity, a form of enzyme reporter, as detected with absorbance signals by
Fluorescence technology. Data from the assay component CCTE_GLTED_hTPO was analyzed at the assay
endpoint CCTE_GLTED_hTPO in the positive analysis fitting direction relative to DMSO as the negative control
and baseline of activity. Using a type of enzyme reporter (the Amplex UltraRed assay), loss-of-signal activity can
be used to understand decreased peroxidase activity in the presence of excess hydrogen peroxidase. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the
'oxidoreductase' intended target family, where the subfamily is 'peroxidase1.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA CCTE Great Lakes Toxicology and Ecology Division focuses on ecotoxicology and stressors
of water resources, including devleopment and implementation of in vitro and in vivo assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is non-proprietary.

1.9	Assay Throughput: 96-well plate. Assay is conducted in 96-well plate format. Throughput is such that 3 replicate
plates can be run in one day, usually with 2 people, using 96-well pipettors. In this format 76 chemicals can be
tested in single concentration mode in triplicate in one day.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: A well-characterized MIE for adverse thyroid-mediated outcomes is inhibition of TPO. TPO
catalyzes iodine oxidation in the presence of hydrogen peroxide, regulates nonspecific iodination of tyrosyl
residues of thyroglobulin to form TH precursors, monoiodotyrosine (MIT), and diiodotyrosine (DIT), and
modulates coupling of these iodotyrosyl residues.

This assay is designed to test the inhibitory activity of chemicals toward Xenopus iodothyronine Deiodinase type
3 (DI03) enzyme. Enzyme is incubated in presence of test chemical for 3 hours during which uninhibited enzyme
liberates free iodide from substrate. Free iodide is measured in a second step using the Sandell-Kolthoff method.

2.2	Scientific Principles: Inhibition of iodothyronine deiodinase enzymes has been identified as a molecular
initiating event in seven AOPs currently under development, primarily in fish and amphibians. They are
specifically AOP 155,156, 157, 158,189, 190, and 191 (http://aopkb.org). The most likely effect is reduction of

-------
the active hormone T3 in vertebrate tissue at critical developmental times. Alteration of thyroid hormone status
through this mechanism could have potential harmful effects on vertebrates similar to interference with other
thyroid MIEs like TPO (thyroid peroxidase) or NIS (sodium iodide symporter).

2.3	Experimental System: NA NA cell-free used. Xenopus deiodinase 3 (DI03) was cloned and expressed in HEK293
cells transfected with Xldio3 enzyme in a recombinant pcDNA3.1 (+) plasmid. The X. laevis 3'-UTR was replaced
by the human DI03 (hDI03) 3'-UTR to enhance translation of Xldio3 in human cells. The cell homogenate was
harvested as the source of Xenopus DI03 for the assay

2.4	Metabolic Competence: This assay is done in a cell-homogenate, not live cells. Metabolic activity of the
homogenate has not been determined.

2.5	Exposure Regime: Xenopus deiodinase type 3 (hDI03) assay, developed at Great Lakes Toxicology and Ecology
Division (GLTED) within the EPA's Center for Computational Toxicology and Exposure (CCTE), is designed to test
the chemicals' ability to inhibit DI03. Assay uses Xenopus deiodinase type 1 (DIOl) enzyme produced with
adenovirus expression system. Assay is based on iodide release measured by the Sandell-Kolthoff method and
is run in 96-well plate format with assay plates run in triplicate (n=3). The method used for the deiodinase assay
was largely based on Renko et al. (2012, 2015). These works describe the utilization of the SK reaction to
measure deiodinase-liberated iodide in a 96-well-plate format. The SK reaction is based on the reduction of the
yellow-colored cerium IV to the non-colored cerium III by arsenic, with the rate of this reaction increased in the
presence of iodide in a concentration-dependent manner (Sandell and Kolthoff, 1937).

Baseline median absolute deviation for the assay (bmad): 5.465
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Spectrophotometry (Fluorescence)

2.6	Response: The assay measures the deiodination activity of Xenopus DI03 on the substrate 3,5,3'-triiodo-L-
thyronine (T3). After 3h incubation of substrate, enzyme, and test chemical, the components are eluted through
a 96-well Dowex column to separate the free iodide released from the reaction from the other assay
components. The free iodide is measured by the Sandell-Kolthoff reaction in which the rate of reduction of
Cerium (Ce+4 to 2Ce+3) by arsenic (As+3 to As+5) is increased by the presence of iodide which can be observed
as the loss of yellow color in the reaction and the quantified by the change in absorbance at 420 nm. The change
in reaction rate is proportional to the amount of iodide present. Inhibition of deiodinase enzyme activity has the
potential for altering the ratio of T3/T4 thyroid hormones in all vertebrates. Insufficient active hormone can
lead to delayed or asynchronous amphibian metamorphosis, impaired swim bladder formation in fish, and
potentially to adverse neurodevelopmental outcomes in mammals.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

le-04 nM
Key positive control:

Methimizole

I arget (nominal) number of replicates:

4

Standard maximum concentration tested:

1000 nM
Neutral vehicle control:

DMSO

-------
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network

Additionally, this assay was annotated to the intended target family of oxidoreductase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw data is measured as absorbance units at 420 nM. Absorbance at the 20 minute read time is
subtracted from absorbance at 1 minute read time to obtain the change in absorbance. The net change in
absorbance is determined by taking this measured change in absorbance and subtracting the mean background
change in absorbance defined by the completely inhibited reaction in the six wells in that plate containing 200
HM XTH. Data are normalized to percent of control by dividing the net change in absorbance by the mean net
change in absorbance of the seven uninhibited reactions (DMSO control wells) representing the maximum DI03
activity. Test chemical results are reported as percent inhibition, which is calculated as 100% minus percent of
control. Decrease in signal means the activity of the deiodinase enzyme is being inhibited in some way. Range
is defined by the absorbance of the seven DMSO control wells and the six high concentration XTH wells on each
plate. DMSO control wells contain uninhibited enzyme, thus define maximum activity, defined as 100%. Wells
with high concentration XTH inhibitor contain fully inhibited enzyme, thus are defined as 0% activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

-------
2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 19	Number of chemicals tested: 19

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
14

Inactive hit count: 0
-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

2

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-3.719

Neutral control median absolute deviation, by plate: nmad

3.924

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-68.07%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

NA
NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Degitz, S. J., Olker, J. H., Denny, J. S., Degoey, P. P., Hartig, P. C., Cardon, M. C., Eytcheson, S. A.,
Haselman, J. T., Mayasich, S. A., & Hornung, M. W. (2024). In vitro screening of per- and polyfluorinated
substances (PFAS) for interference with seven thyroid hormone system targets across nine assays. Toxicology in
vitro	an international journal published in association with BIBRA, 95, 105762.
https://doi.Org/10.1016/j.tiv.2023.105762

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3091

CCTE_GLTED_xDI03

1.	General Information

1.1	Assay Title: CCTE's Xenopus lodothyronine Deiodinase Type 3 (xDI03) Inhibition Assay, Great Lakes Toxicology
and Ecology Division (GLTED) Lab

1.2	Assay Summary: CCTE_GLTED_xDI03 is a cell-free, single-readout assay designed to test the inhibitory activity
of chemicals toward Xenopus (frog) iodothyronine Deiodinase Type 3 (DI03) enzyme. Enzyme is incubated in
presence of test chemical for 3 hours during which uninhibited enzyme liberates free iodide from substrate.
Free iodide is measured in a second step using the Sandell-Kolthoff method. CCTE_GLTED_xDI03 is the assay
component measured from the CCTE_GLTED_xDIO assay. It is designed to make measurements of enzyme
activity, a form of enzyme reporter, as detected with absorbance signals by spectrophotometry. Data from the
assay component CCTE_GLTED_xDI03 was analyzed at the assay endpoint CCTE_GLTED_xDI03 in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of enzyme
reporter, loss-of-signal activity can be used to understand deiodinase Type 3 activity. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the 'deiodinase' intended target family,
where the subfamily is 'deiodinase Type 3'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA CCTE Great Lakes Toxicology and Ecology Division focuses on ecotoxicology and stressors
of water resources, including devleopment and implementation of in vitro and in vivo assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is non-proprietary.

1.9	Assay Throughput: 96-well plate. Assay is conducted in 96-well plate format. Throughput is such that 3 replicate
plates can be run in one day, usually with 2 people, using 96-well pipettors. In this format 76 chemicals can be
tested in single concentration mode in triplicate in one day.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Deiodinase enzymes play an essential role in converting thyroid hormones between active and
inactive forms by deiodinating the pro-hormone thyroxine (T4) to the active hormone triiodothyronine (T3) and
modifying T4 and T3 to inactive forms. DI03 inactivates both T4 and T3 by removing an inner ring iodine,
producing reverse T3 (rT3) and diiodotyrosine (T2), respectively.

This assay is designed to test the inhibitory activity of chemicals toward Xenopus iodotyrosine deiodinase (IYD)
enzyme. Enzyme is incubated in presence of test chemical for 3 hours during which uninhibited enzyme
liberates free iodide from substrate. Free iodide is measured in a second step using the Sandell-Kolthoff (SK)
method.

2.2	Scientific Principles: IYD genetic mutations in humans can result in hypothyroidism and goiter, with altered
neurological development in some documented cases (Moreno and Visser 2010). Chemical inhibition of IYD has
been shown to reduce 4 and increase thyroid gland size in rats (Green 1968, Greenl971, Meinhold and Buchholz

-------
1983), and delay or arrest metamorphosis in amphibians (Olker et al. 2018). Inhibition of IYD enzyme has been
identified as a molecular initiating event in one AOP currently under development for amphibians (AOP 188,
http://aopkb.org). Interference with IYD enzyme activity reduces available free iodide for use in the synthesis
of thyroid hormones. Alteration of thyroid hormone status through this mechanism could have potential
harmful effects on vertebrates similar to interference with other thyroid MIEs like TPO (thyroid peroxidase) or
NIS (sodium iodide symporter).

2.3	Experimental System: NA NA cell-free used. Microsomal fractions were prepared from 20 NF stage 50-60 X.
laevis livers. Microsomal preparations were used as the source of Xenopus IYD for the assay.

2.4	Metabolic Competence: This assay is done using a liver microsomal preparation, not live cells. Metabolic activity
of the microsomes has not been determined.

2.5	Exposure Regime: Chemical inhibition of Xenopus iodotyrosine deiodinase enzyme with assay developed at EPA
Great Lakes Toxicology and Ecology Division. Assay uses amphibian iodotyrosine deiodinase (hlYD) enzyme from
microsomal fractions of livers collected from larval Xenopus laevis. Assay is based on iodide release measured
by the Sandell-Kolthoff method and is run in 96-well plate format with assay plates run in triplicate (n=3).
Platewise-normalization was used based on positive control (3-Nitro-L-tyrosine, MNT) and negative/solvent
controls (DMSO, NaOH).

Baseline median absolute deviation for the assay (bmad): 6.096
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Spectrophotometry (Absorbance )

2.6	Response: The assay measures the deiodination activity of Xenopus IYD on the substrate monoiodotyrosine
(MIT). After 3h incubation of substrate, enzyme, and test chemical, the components are applied to a 96-well
Dowex column. Free iodide is not retained by the column and is separated from interfering assay components.
The free iodide is measured by the Sandell-Kolthoff reaction in which the rate of reduction of cerium (Ce+4 to
Ce+3) by arsenic (As+3 to As+5) is increased by the presence of iodide. This is observed as the loss of yellow
color in the reaction and the quantified by the change in absorbance at 420 nm. The change in reaction rate is
proportional to the amount of iodide present.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.032 nM
Key positive control:
xanthohumol

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

-------
Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network, Non-mammalian
Vertebrate: Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of deiodinase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were processed by: a) determining the change in absorbance between the 1 and 20 minute
reaching for each well [abs420_lmin - abs420_20min]; b) calculating the net change in absorbance [change in
abs - mean(change in abs for 200 nM MNT wells)]; and c) normalizing to % of control [net change in
abs/mean(net change in abs for DMSO & NaOH control wells) *100%]. Percent inhibition was calculated as 100%
- % of control (uninhibited) reaction.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.

-------
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 20	Number of chemicals tested: 20

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
15

Inactive hit count: 0
-------
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

0.121
5.271
-70.57%

NA
NA

NA

-------
Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

-------
• Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Degitz, S. J., Olker, J. H., Denny, J. S., Degoey, P. P., Hartig, P. C., Cardon, M. C., Eytcheson, S. A.,
Haselman, J. T., Mayasich, S. A., & Hornung, M. W. (2024). In vitro screening of per- and polyfluorinated
substances (PFAS) for interference with seven thyroid hormone system targets across nine assays. Toxicology in
vitro	an international journal published in association with BIBRA, 95, 105762.
https://doi.Org/10.1016/j.tiv.2023.105762, Mayasich, S. A., Korte, J. J., Denny, J. S., Hartig, P. C., Olker, J. H.,
DeGoey, P., O'Flanagan, J., Degitz, S. J., & Hornung, M. W. (2021). Xenopus laevis and human type 3
iodothyronine deiodinase enzyme cross-species sensitivity to inhibition by ToxCast chemicals. Toxicology in vitro

an international journal published in association with BIBRA, 73, 105141.
https://doi.Org/10.1016/j.tiv.2021.105141

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3092

CCTE_G LTED_xlYD

1.	General Information

1.1	Assay Title: CCTE's Xenopus lodotyrosine Deiodinase (xlYD) Inhibition Assay, Great Lakes Toxicology and Ecology
Division (GLTED) Lab

1.2	Assay Summary: CCTE_GLTED_xlYD is a cell-free, single-readout assay that uses Xenopus (frog) iodotyrosine
deiodinase (xlYD) enzyme from liver microsomes. Assay is based on iodide release measured by the Sandell-
Kolthoff method and is run in 96-well plate format with assay plates run in triplicate (n=3) with measurements
taken at 3 hours after chemical dosing. Platewise-normalization was used based on positive control (3-Nitro-L-
tyrosine, MNT) and negative/solvent controls (DMSO, NaOH). CCTE_GLTED_xlYD is the assay component
measured from the CCTE_GLTED_xlYD assay. It measures substrate involved in regulation of catalytic activity
using spectrophotometry. The assay measures the deiodination activity of Xenopus (frog) IYD on the substrate
monoiodotyrosine (MIT). After 3h incubation of substrate, enzyme, and test chemical, the components are
applied to a 96-well Dowex column. Free iodide is not retained by the column and is separated from interfering
assay components. The free iodide is measured by the Sandell-Kolthoff reaction in which the rate of reduction
of cerium (Ce+4 to Ce+3) by arsenic (As+3 to As+5) is increased by the presence of iodide. This is observed as
the loss of yellow color in the reaction and the quantified by the change in absorbance at 420 nm. The change
in reaction rate is proportional to the amount of iodide present. Data from the assay component
CCTE_GLTED_xlYD was analyzed at the assay endpoint CCTE_GLTED_xlYD in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of enzyme reporter, loss-of-signal
activity can be used to understand iodotyrosine deiodinase activity. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the 'dehalogenase' intended target family, where the
subfamily is 'iodotyrosine deiodinase1.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA CCTE Great Lakes Toxicology and Ecology Division focuses on ecotoxicology and stressors
of water resources, including devleopment and implementation of in vitro and in vivo assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is non-proprietary.

1.9	Assay Throughput: 96-well plate. Assay is conducted in 96-well plate format. Throughput is such that 12
replicate plates can be run in one day, usually with 2 people, using 96-well pipettors. In this format 304 chemicals
can be tested in single concentration mode in triplicate in one day.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The iodide recycling enzyme, iodotyrosine deiodinase (IYD), is one conserved putative molecular
target that plays an essential role in maintaining adequate levels of free iodide in the thyroid gland for hormone
synthesis.

-------
This assay is designed to test the binding activity of chemicals toward human transthyretin (TTR). TTR and the
fluorescent probe 8-anilino-l-naphthalenesulfonic acid ammonium salt (ANSA) are incubated in the presence
of test chemical for 2 hours. Chemicals with binding affinity for TTR displace ANSA resulting in a loss of
fluorescence, which is measured at the end of the incubation period.

2.2	Scientific Principles: Thyroid hormone serum binding proteins promote uniform distribution of circulating
thyroid hormones and facilitate delivery of thyroid hormone to target tissues. Chemical binding to TTR and
subsequent displacement of thyroid hormone may result in altered thyroid hormone homeostasis.

2.3	Experimental System: NA NA tissue-based cell-free used. Human transthyretin (>95% purity) was purchased
from Sigma Aldrich. Human TTR was delivered as lyophilized powder and reconstituted in 0.1 M phosphate
buffer, pH=7.5 to prepare a 5nM stock solution.

2.4	Metabolic Competence: This assay is done with pure protein in buffer, not live cells; thus, this assay is not
metabolically competent.

2.5	Exposure Regime: Human transthyretin (TTR) assay, developed at Great Lakes Toxicology and Ecology Division
(GLTED) within the EPA's Center for Computational Toxicology and Exposure (CCTE), is designed to test the
chemicals' ability to displace T4 from the TTR protein. TTR (0.125uM) is incubated in presence of test chemical
for 2 hours during which disrupting chemicals displace ANSA from the TTR protein. Results read fluorescence at
380 excitation and 475 emission.

Baseline median absolute deviation for the assay (bmad): 6.997
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Spectrophotometry (Absorbance )

2.6	Response: The assay measures chemical binding to human TTR through displacement of ANSA. After 2h
incubation of the protein, ANSA, and test chemical, fluorescence is measured in a 96-well plate reader using
excitation and emission wavelengths of 380 and 475 nm, respectively. When chemicals bind to TTR, ANSA is
displaced and relative fluorescence units (RFUs) are decreased.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network, Non-mammalian
Vertebrate: Assays associated with non-mammalian vertebrate species

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.032 nM
Key positive control:

3-Nitro-L-tyrosine (MNT)

Target (nominal) number of replicates:

7

Standard maximum concentration tested:

200 nM
Neutral vehicle control:

DMSO

-------
Additionally, this assay was annotated to the intended target family of dehalogenase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw data is measured as relative fluorescence units at 475 nM. Platewise-normalization was
used based on highest (1.8 nM) and lowest (0.0067 nM) concentrations of the positive control (Thyroxine, T4),
and each plate also included wells with a negative control (DMSO). Data were processed by: a) correcting for
background fluorescence [background corrected: value - mean(value for TTR wells)]; b) calculating the net
fluorescence [background corrected value - mean(background corrected value for 1.8 nM T4 wells)]; and c)
normalizing to % of control [net fluorescence/mean(net fluorescence for 0.0067 nM T4 wells) *100%]. Percent
activity was calculated as 100% - % of control (uninhibited) reaction.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active

-------
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 15	Number of chemicals tested: 15

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

12	3	0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	2

gain-loss (gnls) model:	0

power(pow) model:	5

linear-polynomial (polyl) model:	1

quadratic-polynomial(poly2) model:	5

exponential-2 (exp2) model:	0

exponential-3 (exp3) model:	2

exponential-4 (exp4) model:	0

exponential-5 (exp5) model:	0

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

0.083
7.055
270.74%

NA
NA

NA

Positive control signal-to-background: (pmed/nmed)

NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: Degitz, S. J., Olker, J. H., Denny, J. S., Degoey, P. P., Hartig, P. C., Cardon, M. C., Eytcheson, S. A.,
Haselman, J. T., Mayasich, S. A., & Hornung, M. W. (2024). In vitro screening of per- and polyfluorinated
substances (PFAS) for interference with seven thyroid hormone system targets across nine assays. Toxicology in
vitro	an international journal published in association with BIBRA, 95, 105762.
https://doi.Org/10.1016/j.tiv.2023.105762

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3264

CCTE_G UED_hTTR_0.125uM

1.	General Information

1.1	Assay Title: CCTE's Human Transthyretin (hTTR) Inhibition Assay, with 0.125uM TTR, Great Lakes Toxicology and
Ecology Division (GLTED) Lab

1.2	Assay Summary: CCTE_GLTED_hTTR is a cell-free, single-readout assay designed to test the chemicals' ability to
displace T4 from the human transthyretin (TTR) protein. TTR is incubated in presence of test chemical for 2
hours during which disrupting chemicals displace ANSA from the TTR protein. Results read fluorescence at 380
excitation and 475 emission. CCTE_GLTED_hTTR_0.125uM is the assay component measured from the
CCTE_GLTED_hTTR assay. It is designed to make measurements of fluorescent polarization, using a form of
binding reporter, as detected with Fluorescence signals by Fluorescence technology. Data from the assay
component CCTE_GLTED_hTTR_0.125uM was analyzed at the assay endpoint CCTE_GLTED_hTTR_0.125uM in
the positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a
type of binding reporter, loss-of-signal activity can be used to understand a chemical's ability to displace T4 from
transthyretin. To generalize the intended target to other relatable targets, this assay endpoint is annotated to
the 'transporter' intended target family, where the subfamily is 'hormone carrier protein'.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The EPA CCTE Great Lakes Toxicology and Ecology Division focuses on ecotoxicology and stressors
of water resources, including devleopment and implementation of in vitro and in vivo assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is non-proprietary.

1.9	Assay Throughput: 96-well plate. Assay is conducted in 96-well plate format. Throughput is such that 12
replicate plates can be run in one day, usually with 2 people, using 96-well pipettors. In this format 304 chemicals
can be tested in single concentration mode in triplicate in one day.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Transthyretin (TTR) is one the major transport proteins responsible for binding to and transporting
thyroid hormones to the necessary tissues.

This assay is designed to test the binding activity of chemicals toward human transthyretin (TTR). TTR and the
fluorescent probe 8-anilino-l-naphthalenesulfonic acid ammonium salt (ANSA) are incubated in the presence
of test chemical for 2 hours. Chemicals with binding affinity for TTR displace ANSA resulting in a loss of
fluorescence, which is measured at the end of the incubation period.

2.2	Scientific Principles: Thyroid hormone serum binding proteins promote uniform distribution of circulating
thyroid hormones and facilitate delivery of thyroid hormone to target tissues. Chemical binding to TTR and
subsequent displacement of thyroid hormone may result in altered thyroid hormone homeostasis.

-------
2.3

Experimental System: NA NA cell-free used. Human transthyretin (>95% purity) was purchased from Sigma
Aldrich. Human TTR was delivered as lyophilized powder and reconstituted in 0.1 M phosphate buffer, pH=7.5
to prepare a 5nM stock solution.

2.4	Metabolic Competence: This assay is done with pure protein in buffer, not live cells; thus, this assay is not
metabolically competent.

2.5	Exposure Regime: Human transthyretin (TTR) assay, developed at Great Lakes Toxicology and Ecology Division
(GLTED) within the EPA's Center for Computational Toxicology and Exposure (CCTE), is designed to test the
chemicals' ability to displace T4 from the TTR protein. TTR (0.125uM) is incubated in presence of test chemical
for 2 hours during which disrupting chemicals displace ANSA from the TTR protein. Results read fluorescence at
380 excitation and 475 emission.

Baseline median absolute deviation for the assay (bmad): 7.277
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Spectrophotometry (Fluorescence)

2.6	Response: The assay measures chemical binding to human TTR through displacement of ANSA. After 2h
incubation of the protein, ANSA, and test chemical, fluorescence is measured in a 96-well plate reader using
excitation and emission wavelengths of 380 and 475 nm, respectively. When chemicals bind to TTR, ANSA is
displaced and relative fluorescence units (RFUs) are decreased.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Thyroid Bioactivity: Assays related to the thyroid adverse outcome pathway network

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
12

Standard minimum concentration tested:

0.023 nM
Key positive control:

3,5,3,5-tetraiodothyronine (T4)

Target (nominal) number of replicates:

5

Standard maximum concentration tested:

150 nM
Neutral vehicle control:

DMSO

-------
3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw data is measured as relative fluorescence units at 475 nM. Platewise-normalization was
used based on highest (1.8 nM) and lowest (0.0067 nM) concentrations of the positive control (Thyroxine, T4),
and each plate also included wells with a negative control (DMSO). Data were processed by: a) correcting for
background fluorescence [background corrected: value - mean(value for TTR wells)]; b) calculating the net
fluorescence [background corrected value - mean(background corrected value for 1.8 nM T4 wells)]; and c)
normalizing to % of control [net fluorescence/mean(net fluorescence for 0.0067 nM T4 wells) *100%]. Percent
activity was calculated as 100% - % of control (uninhibited) reaction.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates

-------
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 283	Number of chemicals tested: 283

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-9.706

Neutral control median absolute deviation, by plate: nmad	1.473

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-13.94%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

-------
Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 123.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Degitz, S. J., Olker, J. H., Denny, J. S., Degoey, P. P., Hartig, P. C., Cardon, M. C., Eytcheson, S. A.,
Haselman, J. T., Mayasich, S. A., & Hornung, M. W. (2024). In vitro screening of per- and polyfluorinated
substances (PFAS) for interference with seven thyroid hormone system targets across nine assays. Toxicology in
vitro	an international journal published in association with BIBRA, 95, 105762.
https://doi.Org/10.1016/j.tiv.2023.105762

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: Till

CCTE_M u ndy_HCI_Cortica l_NOG_BPCount

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in Rat Cortical Cells for Branch Point Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_NOG is a multiplexed, cell-based-readout assay that uses rat
primary cortical cells with measurements taken at 50.33 hours after chemical dosing in a microplate: 96-well
plate. CCTE_Mundy_HCI_Cortical_NOG_BPCount is one of four components of the
CCTE_Mundy_HCI_Cortical_NOG assay. It measures neurite outgrowth related to the number of branch points
using High Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_Cortical_NOG_BPCount	was	analyzed	at	the	endpoint
CCTE_Mundy_HCI_Cortical_NOG_BPCount in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, loss -of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay as described below is medium-to-high throughput. It uses primary
cortical rat neurons, seeded on a 96-well plate. Each plate may contain 8 test compounds at up to 11
concentrations, in addition to vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to
chemicals for 48 hours prior to fixation and analysis. Each experiment should be replicated on three separate
plates from the same culture preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of branch points are indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures changes in
neurite branch point count.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of

-------
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent NA primary cell used. Primary rat cortical cultures are prepared on site from
the neocortex dissected from the CNS of newborn (PNDO) Long-Evans rat pups. In a typical culture, cells are
isolated from the combined cortices of 3-5 pups, seeded onto a Poly-L-Lysine coated 96-well plate at a density
of 10,000 cells/well and are allowed 2 hours to attach. The cells are maintained in a humidified incubator at 37C
and 5% C02. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and
female pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 8.753
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in a rat primary cell
culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image analysis protocol
is employed to systematically identify targeted structures based on preassigned criteria. Ultimately, changes in
the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

10 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO or water

-------
Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl). Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 270	Number of chemicals tested: 237

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
97

Inactive hit count: 0
-------
exponentials (exp5) model:

32

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

3.47

Neutral control median absolute deviation, by plate: nmad

0.334

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

9.9%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

0.371

0.44

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-7.13

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	2.38

Negative control well median absolute deviation value, by plate: mmad	0.423

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.262

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 32.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2778

CCTE_M u ndy_HCI_Cortica l_NOG_Neu riteCou nt

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in Rat Cortical Cells for Neurite Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_NOG is a multiplexed, cell-based-readout assay that uses rat
primary cortical cells with measurements taken at 50.33 hours after chemical dosing in a microplate: 96-well
plate. CCTE_Mundy_HCI_Cortical_NOG_NeuriteCount is one of four components of the
CCTE_Mundy_HCI_Cortical_NOG assay. It measures neurite outgrowth related to the number of neurites using
High Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_Cortical_NOG_NeuriteCount was analyzed at the endpoint
CCTE_Mundy_HCI_Cortical_NOG_NeuriteCount in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, loss -of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay as described below is medium-to-high throughput. It uses primary
cortical rat neurons, seeded on a 96-well plate. Each plate may contain 8 test compounds at up to 11
concentrations, in addition to vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to
chemicals for 48 hours prior to fixation and analysis. Each experiment should be replicated on three separate
plates from the same culture preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of neurites are indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the number
of neurites.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of

-------
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent NA primary cell used. Primary rat cortical cultures are prepared on site from
the neocortex dissected from the CNS of newborn (PNDO) Long-Evans rat pups. In a typical culture, cells are
isolated from the combined cortices of 3-5 pups, seeded onto a Poly-L-Lysine coated 96-well plate at a density
of 10,000 cells/well and are allowed 2 hours to attach. The cells are maintained in a humidified incubator at 37C
and 5% C02. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and
female pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 4.475
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in a rat primary cell
culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image analysis protocol
is employed to systematically identify targeted structures based on preassigned criteria. Ultimately, changes in
the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

10 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO or water

-------
Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl). Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 270	Number of chemicals tested: 237

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
88

Inactive hit count: 0
-------
exponentials (exp5) model:

30

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

2.72

Neutral control median absolute deviation, by plate: nmad

0.119

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

4.48%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

0.633

0.345

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-5.105

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	2.465

Negative control well median absolute deviation value, by plate: mmad	0.104

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.07

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 30.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2779

CCTE_Mundy_HCI_Cortical_NOG_NeuriteLength

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in Rat Cortical Cells for Neurite Length, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_NOG is a multiplexed, cell-based-readout assay that uses rat
primary cortical cells with measurements taken at 50.33 hours after chemical dosing in a microplate: 96-well
plate. CCTE_Mundy_HCI_Cortical_NOG_NeuriteLength is one of four components of the
CCTE_Mundy_HCI_Cortical_NOG assay. It measures neurite outgrowth related to the neurite length using High
Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_Cortical_NOG_NeuriteLength was analyzed at the endpoint
CCTE_Mundy_HCI_Cortical_NOG_NeuriteLength in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, loss -of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay as described below is medium-to-high throughput. It uses primary
cortical rat neurons, seeded on a 96-well plate. Each plate may contain 8 test compounds at up to 11
concentrations, in addition to vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to
chemicals for 48 hours prior to fixation and analysis. Each experiment should be replicated on three separate
plates from the same culture preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the neurite length are indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the length of
the neurites.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of

-------
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent NA primary cell used. Primary rat cortical cultures are prepared on site from
the neocortex dissected from the CNS of newborn (PNDO) Long-Evans rat pups. In a typical culture, cells are
isolated from the combined cortices of 3-5 pups, seeded onto a Poly-L-Lysine coated 96-well plate at a density
of 10,000 cells/well and are allowed 2 hours to attach. The cells are maintained in a humidified incubator at 37C
and 5% C02. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and
female pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 7.172
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in a rat primary cell
culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image analysis protocol
is employed to systematically identify targeted structures based on preassigned criteria. Ultimately, changes in
the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

10 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO or water

-------
Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl). Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 300	Number of chemicals tested: 262

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
101

Inactive hit count: 0
-------
exponentials (exp5) model:

34

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

140.882

Neutral control median absolute deviation, by plate: nmad

10.578

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

7.17%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

25.585

8.755

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-6.286

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	116.17

Negative control well median absolute deviation value, by plate: mmad	7.761

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.062

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 34.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2780

CCTE_Mundy_HCI_Cortical_NOG_NeuronCount

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in Rat Cortical Cells for Neuron Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_NOG is a multiplexed, cell-based-readout assay that uses rat
primary cortical cells with measurements taken at 50.33 hours after chemical dosing in a microplate: 96-well
plate. CCTE_Mundy_HCI_Cortical_NOG_NeuronCount is one of four components of the
CCTE_Mundy_HCI_Cortical_NOG assay. It measures cell viability related to the number of neurons using High
Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_Cortical_NOG_NeuronCount was analyzed at the endpoint
CCTE_Mundy_HCI_Cortical_NOG_NeuronCount in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, loss -of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay as described below is medium-to-high throughput. It uses primary
cortical rat neurons, seeded on a 96-well plate. Each plate may contain 8 test compounds at up to 11
concentrations, in addition to vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to
chemicals for 48 hours prior to fixation and analysis. Each experiment should be replicated on three separate
plates from the same culture preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of neurons are indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the number
of neurons as a measure of cytotoxicity.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of

-------
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent NA primary cell used. Primary rat cortical cultures are prepared on site from
the neocortex dissected from the CNS of newborn (PNDO) Long-Evans rat pups. In a typical culture, cells are
isolated from the combined cortices of 3-5 pups, seeded onto a Poly-L-Lysine coated 96-well plate at a density
of 10,000 cells/well and are allowed 2 hours to attach. The cells are maintained in a humidified incubator at 37C
and 5% C02. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and
female pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 9.997
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in a rat primary cell
culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image analysis protocol
is employed to systematically identify targeted structures based on preassigned criteria. Ultimately, changes in
the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

10 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO or water

-------
Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl). Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27:

-------
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 300	Number of chemicals tested: 262

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
82

Inactive hit count: Oihitc 0.9
137

WINING MODEL SELECTION

NA hit count: hitc^O
SI

Number of sample-assay endpoints with winning hill model:

20
43

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

13

109

quadratic-polynomialfpoly2) model: 28

exponential-2 (exp2) model:

10

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

55

19

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

24.352

Neutral control median absolute deviation, by plate: nmad

2.287

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

10.1%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed	16.83

Positive control well median absolute deviation, by plate: pmad	2.728

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.567

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	19.96

Negative control well median absolute deviation value, by plate: mmad	2.721

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.044

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 19.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2781

CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_BPCount

1.	General Information

1.1	Assay Title: CCTE's Synaptogenesis and Neurite Maturation Assay in Rat Cortical Cells for Branch Point Count,
Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur is a multiplexed, cell-based-readout assay
that uses rat primary cortical cells with measurements taken at 288.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_BPCount is one of eight components
of the CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur assay. It measures synaptogenesis and neurite
maturation related to number of branch points using High Content Imaging of fluorescently labelled markers.
Data from the assay component CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_BPCount was analyzed at the
endpoint CCTE_Mundy_HCI_Cortical_Synap&Neur_Matur_BPCount in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter, loss -of-
signal activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay is medium-to-high throughput. It uses primary cortical rat neurons,
seeded on a 96-well plate. Each plate may contain 6 test compounds at up to 9 concentrations, in addition to
vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to chemicals for 5 days prior to
fixation and analysis. Each experiment should be replicated on three separate plates from the same culture
preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of branch points are indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_Synap&Neur assay is designed to investigate changes in synaptogenesis and
neurite maturation in response to chemical exposure in developing rat cortical neurons using a high-content
imaging (HCI) technology. Synaptogenesis is one of several key processes of neurodevelopment. This endpoint
measures the number of branch points as a measure of neurite maturation.

2.2	Scientific Principles: During the development of the nervous system, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is synaptogenesis, where individual cells form close connections that allow for communication by

-------
chemical neurotransmitter. These interconnections between groups of neurons give rise to networks of cells
that connect the nervous system together. This assay utilizes a high-content imaging solution to describe
synaptogenesis in a rat primary cell culture, via the immunocytochemical labelling of cell bodies, neurites and
presynaptic structures.

2.3	Experimental System: adherent NA primary cell used. Primary cortical cultures consist of a mixture of
glutamatergic and GABAergic neurons, as well as glial cells (oligodendrocytes and a few microglia) as
characterized by immunocytochemistry and functional responses to pharmacological agents (Freudenrich and
Mundy, 2000; McConnell et al., 2011; Frank et al., 2017). Primary rat cortical cultures are prepared from the
neocortex dissected from the central nervous system (CNS) of newborn (PND0) Long-Evans rat pups using a
standard protocol (Section 3.7). In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
seeded onto a Poly-L-Lysine coated 96-well plate at a density of 10,000 cells/well and are allowed 2 hours to
attach. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and female
pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Cells are plated onto 96 well plates and allowed 2 hr for attachment to the substrate. At days
in vitro (DIV) 5, Cytosine Arabinoside (AraC) is added to arrest glial cell growth. Chemicals are dosed and a media
change is performed at DIV 7 of differentiation and is continued through DIV 12 when the experiment is
terminated. Cells are fixed with 20% paraformaldehyde and Hoechst Dye. Cells are stained using Millipore
Mouse anti-MAP2MAB3418 (1:800) and Millipore Rabbit anti-Synaptophysinsc-1750 (1:250) primary antibodies
and Alexa Fluor-488 goat anti-mouse and AlexaFluor-546 goat anti-rabbit secondary antibodies to label
neuronal cell nuclei, synapses and neurites. A Cellomics ArrayScan VTi HCS Reader is used for automated image
acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 6.419
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe synaptogenesis and neurite length in
a rat primary cell culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image
analysis protocol is employed to systematically identify targeted structures based on preassigned criteria.
Ultimately, changes in the neurite length, neurite count, number of branch points, the number of pre-synaptic
puncta, and number of synapses is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

0.3 nM
Key positive control:

sodium orthovanadate (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO or water

-------
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All primary and secondary endpoints are assessed based on an immunocytochemical staining
(ICC) of images for each well. Five days after chemical treatment cells were fixed with warm (37C) 4%
paraformaldehyde containing 1.5 ng/ml Hoechst 33342 for 20 min followed by permeabilization and blocking
steps. Cell bodies and dendrites were labeled using a rabbit primary antibody for microtubule associated protein
2 (MAP2) (Millipore Catalog AB5622,1:800) and mouse antibody for synaptophysin (Santa Cruz catalog number
SC-17750, 1:200) followed by AlexaFluor 488 goat anti-rabbit secondary antibody (Molecular Probes catalog
number A11034, 1:500) and AlexaFluor 546 goat antimouse secondary antibody (Molecular Probes catalog
number A11029, 1:500). Images are analyzed using the Cellomics Neuronal Profiling BioApplication (version 4)
to measure neurite morphology. Optimization of nuclear masking and selection, cell body masking and
selection, and neurite tracing parameters is performed on untreated cultures at DIV12 after initial plating. In
each well, multiple unique fields-of-view are acquired until at least 200 neurons are counted. Eight
morphological features are quantified (see table 8.4.1) Neurites are defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 265	Number of chemicals tested: 233

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
108

Inactive hit count: 0
-------
power(pow) model:
linear-polynomial (polyl) model:

22

88

quadratic-polynomialfpoly2) model: 23

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

6

38

66

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-------
8.328

Neutral control median absolute deviation, by plate: nmad	0.678

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.71%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	6.775

Positive control well median absolute deviation, by plate: pmad	0.712

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.792

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	4.326

Negative control well median absolute deviation value, by plate: mmad	0.668

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.743

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 38.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2782

CCTE_Mu ndy_HCI_Cortical_Synap_Neu r_Matu r_Cel I BodySpotCount

1.	General Information

1.1	Assay Title: CCTE's Synaptogenesis and Neurite Maturation Assay in Rat Cortical Cells for Cell Body Spot Count,
Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur is a multiplexed, cell-based-readout assay
that uses rat primary cortical cells with measurements taken at 288.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_CellBodySpotCount is one of eight
components of the CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur assay. It measures synaptogenesis and
neurite maturation related to number of cell body-associated synapses (synaptophysin puncta) per neuron using
High Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_CellBodySpotCount was analyzed at the endpoint
CCTE_Mundy_HCI_Cortical_Synap&Neur_Matur_CellBodySpotCount in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter, loss -of-
signal activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay is medium-to-high throughput. It uses primary cortical rat neurons,
seeded on a 96-well plate. Each plate may contain 6 test compounds at up to 9 concentrations, in addition to
vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to chemicals for 5 days prior to
fixation and analysis. Each experiment should be replicated on three separate plates from the same culture
preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of cell body-associated synapses (synaptophysin puncta) per neuron are
indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_Synap&Neur assay is designed to investigate changes in synaptogenesis and
neurite maturation in response to chemical exposure in developing rat cortical neurons using a high-content
imaging (HCI) technology. Synaptogenesis is one of several key processes of neurodevelopment. This endpoint
measures the number of pre-synaptic puncta in the cell body compartment as a measure of synaptogenesis.

2.2	Scientific Principles: During the development of the nervous system, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental

-------
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is synaptogenesis, where individual cells form close connections that allow for communication by
chemical neurotransmitter. These interconnections between groups of neurons give rise to networks of cells
that connect the nervous system together. This assay utilizes a high-content imaging solution to describe
synaptogenesis in a rat primary cell culture, via the immunocytochemical labelling of cell bodies, neurites and
presynaptic structures.

2.3	Experimental System: adherent NA primary cell used. Primary cortical cultures consist of a mixture of
glutamatergic and GABAergic neurons, as well as glial cells (oligodendrocytes and a few microglia) as
characterized by immunocytochemistry and functional responses to pharmacological agents (Freudenrich and
Mundy, 2000; McConnell et al., 2011; Frank et al., 2017). Primary rat cortical cultures are prepared from the
neocortex dissected from the central nervous system (CNS) of newborn (PND0) Long-Evans rat pups using a
standard protocol (Section 3.7). In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
seeded onto a Poly-L-Lysine coated 96-well plate at a density of 10,000 cells/well and are allowed 2 hours to
attach. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and female
pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Cells are plated onto 96 well plates and allowed 2 hr for attachment to the substrate. At days
in vitro (DIV) 5, Cytosine Arabinoside (AraC) is added to arrest glial cell growth. Chemicals are dosed and a media
change is performed at DIV 7 of differentiation and is continued through DIV 12 when the experiment is
terminated. Cells are fixed with 20% paraformaldehyde and Hoechst Dye. Cells are stained using Millipore
Mouse anti-MAP2MAB3418 (1:800) and Millipore Rabbit anti-Synaptophysinsc-1750 (1:250) primary antibodies
and Alexa Fluor-488 goat anti-mouse and AlexaFluor-546 goat anti-rabbit secondary antibodies to label
neuronal cell nuclei, synapses and neurites. A Cellomics ArrayScan VTi HCS Reader is used for automated image
acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 11.428
Response cutoff threshold used to determine hit calls: 34.284
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe synaptogenesis and neurite length in
a rat primary cell culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image
analysis protocol is employed to systematically identify targeted structures based on preassigned criteria.
Ultimately, changes in the neurite length, neurite count, number of branch points, the number of pre-synaptic
puncta, and number of synapses is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

3 nM
Key positive control:

sodium orthovanadate (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO or water

-------
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of cortical maturation and synaptogenesis (Operating Procedure Neurite Outgrowth:
OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl). Images are acquired using a 20x Pan NeoFLUAR (NA = 0.4) objective
with a solid state LED light source, and an XF100 three channel dichroic filter set with excitation at 365(50) and
475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling BioApplication
(version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell body masking
and selection, and neurite tracing parameters is performed on untreated cultures at DIV12 after initial plating.
In each well, multiple unique fields-of-view are acquired until at least 200 neurons are counted. Eight
morphological features are quantified (see table 8.4.1) Neurites are defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one

-------
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 295	Number of chemicals tested: 260

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
67

Inactive hit count: Oihitc 0.9
228

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

5

17

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

14

132

-------
quadratic-polynomialfpoly2) model: 30

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

5

57

34

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

13.797

Neutral control median absolute deviation, by plate: nmad

1.691

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.25%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	12.22

Positive control well median absolute deviation, by plate: pmad	1.364

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.498

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	16.014

Negative control well median absolute deviation value, by plate: mmad	2.972

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.365

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 34.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2783

CCTE_Mu ndy_HCI_Cortical_Synap_Neu r_Matu r_Neu riteCou nt

1.	General Information

1.1	Assay Title: CCTE's Synaptogenesis and Neurite Maturation Assay in Rat Cortical Cells for Neurite Count, Mundy
Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur is a multiplexed, cell-based-readout assay
that uses rat primary cortical cells with measurements taken at 288.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_NeuriteCount is one of eight
components of the CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur assay. It measures synaptogenesis and
neurite maturation related to number of neurites using High Content Imaging of fluorescently labelled markers.
Data from the assay component CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_NeuriteCount was analyzed at
the endpoint CCTE_Mundy_HCI_Cortical_Synap&Neur_Matur_NeuriteCount in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
loss -of-signal activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay is medium-to-high throughput. It uses primary cortical rat neurons,
seeded on a 96-well plate. Each plate may contain 6 test compounds at up to 9 concentrations, in addition to
vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to chemicals for 5 days prior to
fixation and analysis. Each experiment should be replicated on three separate plates from the same culture
preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of neurites are indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_Synap&Neur assay is designed to investigate changes in synaptogenesis and
neurite maturation in response to chemical exposure in developing rat cortical neurons using a high-content
imaging (HCI) technology. Synaptogenesis is one of several key processes of neurodevelopment. This endpoint
measures the number of neurites as a measure of neurite maturation.

2.2	Scientific Principles: During the development of the nervous system, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is synaptogenesis, where individual cells form close connections that allow for communication by

-------
chemical neurotransmitter. These interconnections between groups of neurons give rise to networks of cells
that connect the nervous system together. This assay utilizes a high-content imaging solution to describe
synaptogenesis in a rat primary cell culture, via the immunocytochemical labelling of cell bodies, neurites and
presynaptic structures.

2.3	Experimental System: adherent NA primary cell used. Primary cortical cultures consist of a mixture of
glutamatergic and GABAergic neurons, as well as glial cells (oligodendrocytes and a few microglia) as
characterized by immunocytochemistry and functional responses to pharmacological agents (Freudenrich and
Mundy, 2000; McConnell et al., 2011; Frank et al., 2017). Primary rat cortical cultures are prepared from the
neocortex dissected from the central nervous system (CNS) of newborn (PND0) Long-Evans rat pups using a
standard protocol (Section 3.7). In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
seeded onto a Poly-L-Lysine coated 96-well plate at a density of 10,000 cells/well and are allowed 2 hours to
attach. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and female
pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Cells are plated onto 96 well plates and allowed 2 hr for attachment to the substrate. At days
in vitro (DIV) 5, Cytosine Arabinoside (AraC) is added to arrest glial cell growth. Chemicals are dosed and a media
change is performed at DIV 7 of differentiation and is continued through DIV 12 when the experiment is
terminated. Cells are fixed with 20% paraformaldehyde and Hoechst Dye. Cells are stained using Millipore
Mouse anti-MAP2MAB3418 (1:800) and Millipore Rabbit anti-Synaptophysinsc-1750 (1:250) primary antibodies
and Alexa Fluor-488 goat anti-mouse and AlexaFluor-546 goat anti-rabbit secondary antibodies to label
neuronal cell nuclei, synapses and neurites. A Cellomics ArrayScan VTi HCS Reader is used for automated image
acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 2.724
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe synaptogenesis and neurite length in
a rat primary cell culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image
analysis protocol is employed to systematically identify targeted structures based on preassigned criteria.
Ultimately, changes in the neurite length, neurite count, number of branch points, the number of pre-synaptic
puncta, and number of synapses is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

0.3 nM
Key positive control:

sodium orthovanadate (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO or water

-------
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All primary and secondary endpoints are assessed based on an immunocytochemical staining
(ICC) of images for each well. Five days after chemical treatment cells were fixed with warm (37C) 4%
paraformaldehyde containing 1.5 ng/ml Hoechst 33342 for 20 min followed by permeabilization and blocking
steps. Cell bodies and dendrites were labeled using a rabbit primary antibody for microtubule associated protein
2 (MAP2) (Millipore Catalog AB5622,1:800) and mouse antibody for synaptophysin (Santa Cruz catalog number
SC-17750, 1:200) followed by AlexaFluor 488 goat anti-rabbit secondary antibody (Molecular Probes catalog
number A11034, 1:500) and AlexaFluor 546 goat antimouse secondary antibody (Molecular Probes catalog
number A11029, 1:500). Images are analyzed using the Cellomics Neuronal Profiling BioApplication (version 4)
to measure neurite morphology. Optimization of nuclear masking and selection, cell body masking and
selection, and neurite tracing parameters is performed on untreated cultures at DIV12 after initial plating. In
each well, multiple unique fields-of-view are acquired until at least 200 neurons are counted. Eight
morphological features are quantified (see table 8.4.1) Neurites are defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 265	Number of chemicals tested: 233

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
70

Inactive hit count: 0
-------
power(pow) model:
linear-polynomial (polyl) model:

81

14

quadratic-polynomialfpoly2) model: 26

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

42

2

66

13

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-------
4.585

Neutral control median absolute deviation, by plate: nmad	0.124

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	2.96%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	4.96

Positive control well median absolute deviation, by plate: pmad	0.193

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.493

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	3.597

Negative control well median absolute deviation value, by plate: mmad	0.292

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.667

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 42.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2784

CCTE_M u ndy_HCI_Cortica l_Syna p_Neu r_Matu r_Neu riteLength

1.	General Information

1.1	Assay Title: CCTE's Synaptogenesis and Neurite Maturation Assay in Rat Cortical Cells for Neurite Length, Mundy
Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur is a multiplexed, cell-based-readout assay
that uses rat primary cortical cells with measurements taken at 288.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_NeuriteLength is one of eight
components of the CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur assay. It measures synaptogenesis and
neurite maturation related to neurite length using High Content Imaging of fluorescently labelled markers. Data
from the assay component CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_NeuriteLength was analyzed at the
endpoint CCTE_Mundy_HCI_Cortical_Synap&Neur_Matur_NeuriteLength in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
loss -of-signal activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay is medium-to-high throughput. It uses primary cortical rat neurons,
seeded on a 96-well plate. Each plate may contain 6 test compounds at up to 9 concentrations, in addition to
vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to chemicals for 5 days prior to
fixation and analysis. Each experiment should be replicated on three separate plates from the same culture
preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the neurite length are indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_Synap&Neur assay is designed to investigate changes in synaptogenesis and
neurite maturation in response to chemical exposure in developing rat cortical neurons using a high-content
imaging (HCI) technology. Synaptogenesis is one of several key processes of neurodevelopment. This endpoint
measures the neurite length as a measure of neurite maturation.

2.2	Scientific Principles: During the development of the nervous system, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is synaptogenesis, where individual cells form close connections that allow for communication by

-------
chemical neurotransmitter. These interconnections between groups of neurons give rise to networks of cells
that connect the nervous system together. This assay utilizes a high-content imaging solution to describe
synaptogenesis in a rat primary cell culture, via the immunocytochemical labelling of cell bodies, neurites and
presynaptic structures.

2.3	Experimental System: adherent NA primary cell used. Primary cortical cultures consist of a mixture of
glutamatergic and GABAergic neurons, as well as glial cells (oligodendrocytes and a few microglia) as
characterized by immunocytochemistry and functional responses to pharmacological agents (Freudenrich and
Mundy, 2000; McConnell et al., 2011; Frank et al., 2017). Primary rat cortical cultures are prepared from the
neocortex dissected from the central nervous system (CNS) of newborn (PND0) Long-Evans rat pups using a
standard protocol (Section 3.7). In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
seeded onto a Poly-L-Lysine coated 96-well plate at a density of 10,000 cells/well and are allowed 2 hours to
attach. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and female
pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Cells are plated onto 96 well plates and allowed 2 hr for attachment to the substrate. At days
in vitro (DIV) 5, Cytosine Arabinoside (AraC) is added to arrest glial cell growth. Chemicals are dosed and a media
change is performed at DIV 7 of differentiation and is continued through DIV 12 when the experiment is
terminated. Cells are fixed with 20% paraformaldehyde and Hoechst Dye. Cells are stained using Millipore
Mouse anti-MAP2MAB3418 (1:800) and Millipore Rabbit anti-Synaptophysinsc-1750 (1:250) primary antibodies
and Alexa Fluor-488 goat anti-mouse and AlexaFluor-546 goat anti-rabbit secondary antibodies to label
neuronal cell nuclei, synapses and neurites. A Cellomics ArrayScan VTi HCS Reader is used for automated image
acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 6.284
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe synaptogenesis and neurite length in
a rat primary cell culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image
analysis protocol is employed to systematically identify targeted structures based on preassigned criteria.
Ultimately, changes in the neurite length, neurite count, number of branch points, the number of pre-synaptic
puncta, and number of synapses is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

3 nM
Key positive control:

sodium orthovanadate (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO or water

-------
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All primary and secondary endpoints are assessed based on an immunocytochemical staining
(ICC) of images for each well. Five days after chemical treatment cells were fixed with warm (37C) 4%
paraformaldehyde containing 1.5 ng/ml Hoechst 33342 for 20 min followed by permeabilization and blocking
steps. Cell bodies and dendrites were labeled using a rabbit primary antibody for microtubule associated protein
2 (MAP2) (Millipore Catalog AB5622,1:800) and mouse antibody for synaptophysin (Santa Cruz catalog number
SC-17750, 1:200) followed by AlexaFluor 488 goat anti-rabbit secondary antibody (Molecular Probes catalog
number A11034, 1:500) and AlexaFluor 546 goat antimouse secondary antibody (Molecular Probes catalog
number A11029, 1:500). Images are analyzed using the Cellomics Neuronal Profiling BioApplication (version 4)
to measure neurite morphology. Optimization of nuclear masking and selection, cell body masking and
selection, and neurite tracing parameters is performed on untreated cultures at DIV12 after initial plating. In
each well, multiple unique fields-of-view are acquired until at least 200 neurons are counted. Eight
morphological features are quantified (see table 8.4.1) Neurites are defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 295	Number of chemicals tested: 260

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
114

Inactive hit count: 0
-------
power(pow) model:
linear-polynomial (polyl) model:

31

79

quadratic-polynomialfpoly2) model: 30

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

48

7

5

73

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-------
346.092

Neutral control median absolute deviation, by plate: nmad	21.741

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.75%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	258.3

Positive control well median absolute deviation, by plate: pmad	20.475

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-3.263

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	271.03

Negative control well median absolute deviation value, by plate: mmad	31.229

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-1.151

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 48.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2785

CCTE_Mu ndy_HCI_Cortical_Synap_Neu r_Matu r_Neu riteSpotCou ntPerN

euriteLength

1.	General Information

1.1	Assay Title: CCTE's Synaptogenesis and Neurite Maturation Assay in Rat Cortical Cells for Neurite Spot Count per
Neurite Length, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur is a multiplexed, cell-based-readout assay
that uses rat primary cortical cells with measurements taken at 288.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_NeuriteSpotCountPerNeuriteLength
is one of eight components of the CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur assay. It measures
synaptogenesis and neurite maturation related to number of neurite-associated synapses (synaptophysin
puncta) per unit length of neurite using High Content Imaging of fluorescently labelled markers. Data from the
assay component CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_NeuriteSpotCountPerNeuriteLength was
analyzed at the endpoint CCTE_Mundy_HCI_Cortical_Synap&Neur_Matur_NeuriteSpotCountPerNeuriteLength
in the positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using
a type of morphology reporter, loss -of-signal activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay is medium-to-high throughput. It uses primary cortical rat neurons,
seeded on a 96-well plate. Each plate may contain 6 test compounds at up to 9 concentrations, in addition to
vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to chemicals for 5 days prior to
fixation and analysis. Each experiment should be replicated on three separate plates from the same culture
preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the number of neurite-associated synapses (synaptophysin puncta) per unit length of
neurite are indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_Synap&Neur assay is designed to investigate changes in synaptogenesis and
neurite maturation in response to chemical exposure in developing rat cortical neurons using a high-content
imaging (HCI) technology. Synaptogenesis is one of several key processes of neurodevelopment. This endpoint
measures the number of neurite-associated puncta per unit length of neurite measure of synaptogenesis.

-------
2.2 Scientific Principles: During the development of the nervous system, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is synaptogenesis, where individual cells form close connections that allow for communication by
chemical neurotransmitter. These interconnections between groups of neurons give rise to networks of cells
that connect the nervous system together. This assay utilizes a high-content imaging solution to describe
synaptogenesis in a rat primary cell culture, via the immunocytochemical labelling of cell bodies, neurites and
presynaptic structures.

2.3	Experimental System: adherent NA primary cell used. Primary cortical cultures consist of a mixture of
glutamatergic and GABAergic neurons, as well as glial cells (oligodendrocytes and a few microglia) as
characterized by immunocytochemistry and functional responses to pharmacological agents (Freudenrich and
Mundy, 2000; McConnell et al., 2011; Frank et al., 2017). Primary rat cortical cultures are prepared from the
neocortex dissected from the central nervous system (CNS) of newborn (PND0) Long-Evans rat pups using a
standard protocol (Section 3.7). In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
seeded onto a Poly-L-Lysine coated 96-well plate at a density of 10,000 cells/well and are allowed 2 hours to
attach. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and female
pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Cells are plated onto 96 well plates and allowed 2 hr for attachment to the substrate. At days
in vitro (DIV) 5, Cytosine Arabinoside (AraC) is added to arrest glial cell growth. Chemicals are dosed and a media
change is performed at DIV 7 of differentiation and is continued through DIV 12 when the experiment is
terminated. Cells are fixed with 20% paraformaldehyde and Hoechst Dye. Cells are stained using Millipore
Mouse anti-MAP2MAB3418 (1:800) and Millipore Rabbit anti-Synaptophysinsc-1750 (1:250) primary antibodies
and Alexa Fluor-488 goat anti-mouse and AlexaFluor-546 goat anti-rabbit secondary antibodies to label
neuronal cell nuclei, synapses and neurites. A Cellomics ArrayScan VTi HCS Reader is used for automated image
acquisition and analysis of neurite outgrowth.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

9	4

Standard minimum concentration tested:	Standard maximum concentration tested:

3 nM	3000 nM

Key positive control:	Neutral vehicle control:

sodium orthovanadate (lOuM)	DMSO or water

Baseline median absolute deviation for the assay (bmad): 8.04
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: This assay utilizes a high-content imaging solution to describe synaptogenesis and neurite length in
a rat primary cell culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image
analysis protocol is employed to systematically identify targeted structures based on preassigned criteria.
Ultimately, changes in the neurite length, neurite count, number of branch points, the number of pre-synaptic
puncta, and number of synapses is quantified.

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

-------
2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The number of cells per field was used as an indicator of cell viability at the time of fixation.
Cellomics Neuronal Profiling BioApplication utilizes the BR3 and BR4 polarity algorithms for this assay. The
purpose of these algorithms is to selectively quantify dendrite lengths in primary cortical cultures during
development. Quantitation of synaptic puncta and dendrite lengths is based upon the differential labeling
patterns observed using antibodies targeted against synaptophysin and MAP2 respectively. The BR3 and BR4
algorithms are "paired protocols", meaning that images are captured and analyzed with one protocol (BR3)
followed by an off-line or "disk-scan" with the second protocol (BR4). These algorithms are appropriate for use
in primary cortical cultures grown for up to 12 days in vitro at densities ranging from 2000 to 10,000 cells / well.
Images are analyzed using the Cellomics Neuronal Profiling BioApplication (version 4) to measure neurite
morphology. Optimization of nuclear masking and selection, cell body masking and selection, and neurite tracing
parameters is performed on untreated cultures at DIV12 after initial plating. In each well, multiple unique fields-
of-view are acquired until at least 200 neurons are counted. Eight morphological features are quantified (see
table 8.4.1) Neurites are defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:

-------
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 295	Number of chemicals tested: 260

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
71

224

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

4
15

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

16

121

quadratic-polynomialfpoly2) model: 28

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2

39

68

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.147

Neutral control median absolute deviation, by plate: nmad	0.015

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.48%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.12

Positive control well median absolute deviation, by plate: pmad	0.015

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.849

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.128

Negative control well median absolute deviation value, by plate: mmad	0.014

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.149

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 39.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2786

CCTE_Mu ndy_HCI_Cortical_Synap_Neu r_Matu r_Neu riteSpotCou ntPerN

euron

1.	General Information

1.1	Assay Title: CCTE's Synaptogenesis and Neurite Maturation Assay in Rat Cortical Cells for Neurite Spot Count per
Neuron Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur is a multiplexed, cell-based-readout assay
that uses rat primary cortical cells with measurements taken at 288.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_NeuriteSpotCountPerNeuron is one
of eight components of the CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur assay. It measures
synaptogenesis and neurite maturation related to number of neurite-associated synapses (synaptophysin
puncta) per neuron using High Content Imaging of fluorescently labelled markers. Data from the assay
component CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_NeuriteSpotCountPerNeuron was analyzed at the
endpoint CCTE_Mundy_HCI_Cortical_Synap&Neur_Matur_NeuriteSpotCountPerNeuron in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology
reporter, loss -of-signal activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay is medium-to-high throughput. It uses primary cortical rat neurons,
seeded on a 96-well plate. Each plate may contain 6 test compounds at up to 9 concentrations, in addition to
vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to chemicals for 5 days prior to
fixation and analysis. Each experiment should be replicated on three separate plates from the same culture
preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the number of neurite-associated synapses (synaptophysin puncta) per neuron are
indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_Synap&Neur assay is designed to investigate changes in synaptogenesis and
neurite maturation in response to chemical exposure in developing rat cortical neurons using a high-content
imaging (HCI) technology. Synaptogenesis is one of several key processes of neurodevelopment. This endpoint
measures the number of pre-synaptic puncta in the neurite compartment as a measure of synaptogenesis.

-------
2.2 Scientific Principles: During the development of the nervous system, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is synaptogenesis, where individual cells form close connections that allow for communication by
chemical neurotransmitter. These interconnections between groups of neurons give rise to networks of cells
that connect the nervous system together. This assay utilizes a high-content imaging solution to describe
synaptogenesis in a rat primary cell culture, via the immunocytochemical labelling of cell bodies, neurites and
presynaptic structures.

2.3	Experimental System: adherent NA primary cell used. Primary cortical cultures consist of a mixture of
glutamatergic and GABAergic neurons, as well as glial cells (oligodendrocytes and a few microglia) as
characterized by immunocytochemistry and functional responses to pharmacological agents (Freudenrich and
Mundy, 2000; McConnell et al., 2011; Frank et al., 2017). Primary rat cortical cultures are prepared from the
neocortex dissected from the central nervous system (CNS) of newborn (PND0) Long-Evans rat pups using a
standard protocol (Section 3.7). In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
seeded onto a Poly-L-Lysine coated 96-well plate at a density of 10,000 cells/well and are allowed 2 hours to
attach. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and female
pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Cells are plated onto 96 well plates and allowed 2 hr for attachment to the substrate. At days
in vitro (DIV) 5, Cytosine Arabinoside (AraC) is added to arrest glial cell growth. Chemicals are dosed and a media
change is performed at DIV 7 of differentiation and is continued through DIV 12 when the experiment is
terminated. Cells are fixed with 20% paraformaldehyde and Hoechst Dye. Cells are stained using Millipore
Mouse anti-MAP2MAB3418 (1:800) and Millipore Rabbit anti-Synaptophysinsc-1750 (1:250) primary antibodies
and Alexa Fluor-488 goat anti-mouse and AlexaFluor-546 goat anti-rabbit secondary antibodies to label
neuronal cell nuclei, synapses and neurites. A Cellomics ArrayScan VTi HCS Reader is used for automated image
acquisition and analysis of neurite outgrowth.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

9	4

Standard minimum concentration tested:	Standard maximum concentration tested:

3 nM	3000 nM

Key positive control:	Neutral vehicle control:

sodium orthovanadate (lOuM)	DMSO or water

Baseline median absolute deviation for the assay (bmad): 10.817
Response cutoff threshold used to determine hit calls: 32.45
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: This assay utilizes a high-content imaging solution to describe synaptogenesis and neurite length in
a rat primary cell culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image
analysis protocol is employed to systematically identify targeted structures based on preassigned criteria.
Ultimately, changes in the neurite length, neurite count, number of branch points, the number of pre-synaptic
puncta, and number of synapses is quantified.

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

-------
2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The number of cells per field was used as an indicator of cell viability at the time of fixation.
Cellomics Neuronal Profiling BioApplication utilizes the BR3 and BR4 polarity algorithms for this assay. The
purpose of these algorithms is to selectively quantify dendrite lengths in primary cortical cultures during
development. Quantitation of synaptic puncta and dendrite lengths is based upon the differential labeling
patterns observed using antibodies targeted against synaptophysin and MAP2 respectively. The BR3 and BR4
algorithms are "paired protocols", meaning that images are captured and analyzed with one protocol (BR3)
followed by an off-line or "disk-scan" with the second protocol (BR4). These algorithms are appropriate for use
in primary cortical cultures grown for up to 12 days in vitro at densities ranging from 2000 to 10,000 cells / well.
Images are analyzed using the Cellomics Neuronal Profiling BioApplication (version 4) to measure neurite
morphology. Optimization of nuclear masking and selection, cell body masking and selection, and neurite tracing
parameters is performed on untreated cultures at DIV12 after initial plating. In each well, multiple unique fields-
of-view are acquired until at least 200 neurons are counted. Eight morphological features are quantified (see
table 8.4.1) Neurites are defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:

-------
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 295	Number of chemicals tested: 260

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
113

182

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

10
15

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

21

104

quadratic-polynomialfpoly2) model: 28

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

2

79

35

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	48.46

Neutral control median absolute deviation, by plate: nmad	5.886

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.13%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	31.7

Positive control well median absolute deviation, by plate: pmad	3.954

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.533

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	37.699

Negative control well median absolute deviation value, by plate: mmad	6.43

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.418

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 35.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2787

CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_NeuronCount

1.	General Information

1.1	Assay Title: CCTE's Synaptogenesis and Neurite Maturation Assay in Rat Cortical Cells for Neuron Count, Mundy
Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur is a multiplexed, cell-based-readout assay
that uses rat primary cortical cells with measurements taken at 288.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_NeuronCount is one of eight
components of the CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur assay. It measures cell viability related
to number of neurons using High Content Imaging of fluorescently labelled markers. Data from the assay
component CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_NeuronCount was analyzed at the endpoint
CCTE_Mundy_HCI_Cortical_Synap&Neur_Matur_NeuronCount in the positive analysis fitting direction relative
to DMSO as the negative control and baseline of activity. Using a type of morphology reporter, loss -of-signal
activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay is medium-to-high throughput. It uses primary cortical rat neurons,
seeded on a 96-well plate. Each plate may contain 6 test compounds at up to 9 concentrations, in addition to
vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to chemicals for 5 days prior to
fixation and analysis. Each experiment should be replicated on three separate plates from the same culture
preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of neurons are indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_Synap&Neur assay is designed to investigate changes in synaptogenesis and
neurite maturation in response to chemical exposure in developing rat cortical neurons using a high-content
imaging (HCI) technology. Synaptogenesis is one of several key processes of neurodevelopment. This endpoint
measures the number of neurons as a measure of cytotoxicity.

2.2	Scientific Principles: During the development of the nervous system, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is synaptogenesis, where individual cells form close connections that allow for communication by

-------
chemical neurotransmitter. These interconnections between groups of neurons give rise to networks of cells
that connect the nervous system together. This assay utilizes a high-content imaging solution to describe
synaptogenesis in a rat primary cell culture, via the immunocytochemical labelling of cell bodies, neurites and
presynaptic structures.

2.3	Experimental System: adherent NA primary cell used. Primary cortical cultures consist of a mixture of
glutamatergic and GABAergic neurons, as well as glial cells (oligodendrocytes and a few microglia) as
characterized by immunocytochemistry and functional responses to pharmacological agents (Freudenrich and
Mundy, 2000; McConnell et al., 2011; Frank et al., 2017). Primary rat cortical cultures are prepared from the
neocortex dissected from the central nervous system (CNS) of newborn (PND0) Long-Evans rat pups using a
standard protocol (Section 3.7). In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
seeded onto a Poly-L-Lysine coated 96-well plate at a density of 10,000 cells/well and are allowed 2 hours to
attach. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and female
pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Cells are plated onto 96 well plates and allowed 2 hr for attachment to the substrate. At days
in vitro (DIV) 5, Cytosine Arabinoside (AraC) is added to arrest glial cell growth. Chemicals are dosed and a media
change is performed at DIV 7 of differentiation and is continued through DIV 12 when the experiment is
terminated. Cells are fixed with 20% paraformaldehyde and Hoechst Dye. Cells are stained using Millipore
Mouse anti-MAP2MAB3418 (1:800) and Millipore Rabbit anti-Synaptophysinsc-1750 (1:250) primary antibodies
and Alexa Fluor-488 goat anti-mouse and AlexaFluor-546 goat anti-rabbit secondary antibodies to label
neuronal cell nuclei, synapses and neurites. A Cellomics ArrayScan VTi HCS Reader is used for automated image
acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 6.928
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe synaptogenesis and neurite length in
a rat primary cell culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image
analysis protocol is employed to systematically identify targeted structures based on preassigned criteria.
Ultimately, changes in the neurite length, neurite count, number of branch points, the number of pre-synaptic
puncta, and number of synapses is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

3 nM
Key positive control:

sodium orthovanadate (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO or water

-------
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The number of cells per field was used as an indicator of cell viability at the time of fixation.
Cellomics Neuronal Profiling BioApplication utilizes the BR3 and BR4 polarity algorithms for this assay. The
purpose of these algorithms is to selectively quantify dendrite lengths in primary cortical cultures during
development. Quantitation of synaptic puncta and dendrite lengths is based upon the differential labeling
patterns observed using antibodies targeted against synaptophysin and MAP2 respectively. The BR3 and BR4
algorithms are "paired protocols", meaning that images are captured and analyzed with one protocol (BR3)
followed by an off-line or "disk-scan" with the second protocol (BR4). These algorithms are appropriate for use
in primary cortical cultures grown for up to 12 days in vitro at densities ranging from 2000 to 10,000 cells / well.
Images are analyzed using the Cellomics Neuronal Profiling BioApplication (version 4) to measure neurite
morphology. Optimization of nuclear masking and selection, cell body masking and selection, and neurite tracing
parameters is performed on untreated cultures at DIV12 after initial plating. In each well, multiple unique fields-
of-view are acquired until at least 200 neurons are counted. Eight morphological features are quantified (see
table 8.4.1) Neurites are defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27:
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 295	Number of chemicals tested: 260

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
109

Inactive hit count: 0
-------
Number of sample-assay endpoints with winning hill model:

26
24

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

72

11

quadratic-polynomialfpoly2) model: 35

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

2

88

36

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

-------
NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	16.75

Neutral control median absolute deviation, by plate: nmad	1.359

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.35%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	14.583

Positive control well median absolute deviation, by plate: pmad	1.364

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.238

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	11.625

Negative control well median absolute deviation value, by plate: mmad	2.162

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.373

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 36.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)

-------
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2788

CCTE_M u ndy_HCI_Cortica l_Syna p_Neu r_Matu r_SynapseCount

1.	General Information

1.1	Assay Title: CCTE's Synaptogenesis and Neurite Maturation Assay in Rat Cortical Cells for Synapse Count, Mundy
Lab

1.2	Assay Summary: CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur is a multiplexed, cell-based-readout assay
that uses rat primary cortical cells with measurements taken at 288.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_SynapseCount is one of eight
components of the CCTE_Mundy_HCI_Cortical_Synap&Neurite_Matur assay. It measures synaptogenesis and
neurite maturation related to number of synpases per neuron using High Content Imaging of fluorescently
labelled	markers.	Data	from	the	assay	component
CCTE_Mundy_HCI_Cortical_Synap_Neur_Matur_SynapseCount was analyzed at the endpoint
CCTE_Mundy_HCI_Cortical_Synap&Neur_Matur_SynapseCount in the positive analysis fitting direction relative
to DMSO as the negative control and baseline of activity. Using a type of morphology reporter, loss -of-signal
activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay is medium-to-high throughput. It uses primary cortical rat neurons,
seeded on a 96-well plate. Each plate may contain 6 test compounds at up to 9 concentrations, in addition to
vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to chemicals for 5 days prior to
fixation and analysis. Each experiment should be replicated on three separate plates from the same culture
preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of synpases per neuron are indicative of neurodevelopment.

The CCTE_Mundy_HCI_Cortical_Synap&Neur assay is designed to investigate changes in synaptogenesis and
neurite maturation in response to chemical exposure in developing rat cortical neurons using a high-content
imaging (HCI) technology. Synaptogenesis is one of several key processes of neurodevelopment. This endpoint
measures the number of synapses as a measure of synaptogenesis.

2.2	Scientific Principles: During the development of the nervous system, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these

-------
processes is synaptogenesis, where individual cells form close connections that allow for communication by
chemical neurotransmitter. These interconnections between groups of neurons give rise to networks of cells
that connect the nervous system together. This assay utilizes a high-content imaging solution to describe
synaptogenesis in a rat primary cell culture, via the immunocytochemical labelling of cell bodies, neurites and
presynaptic structures.

2.3	Experimental System: adherent NA primary cell used. Primary cortical cultures consist of a mixture of
glutamatergic and GABAergic neurons, as well as glial cells (oligodendrocytes and a few microglia) as
characterized by immunocytochemistry and functional responses to pharmacological agents (Freudenrich and
Mundy, 2000; McConnell et al., 2011; Frank et al., 2017). Primary rat cortical cultures are prepared from the
neocortex dissected from the central nervous system (CNS) of newborn (PND0) Long-Evans rat pups using a
standard protocol (Section 3.7). In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
seeded onto a Poly-L-Lysine coated 96-well plate at a density of 10,000 cells/well and are allowed 2 hours to
attach. Sex of pups is not determined, and cultures are presumed to consist of a mixture of male and female
pups since multiple pups are used for each culture.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Cells are plated onto 96 well plates and allowed 2 hr for attachment to the substrate. At days
in vitro (DIV) 5, Cytosine Arabinoside (AraC) is added to arrest glial cell growth. Chemicals are dosed and a media
change is performed at DIV 7 of differentiation and is continued through DIV 12 when the experiment is
terminated. Cells are fixed with 20% paraformaldehyde and Hoechst Dye. Cells are stained using Millipore
Mouse anti-MAP2MAB3418 (1:800) and Millipore Rabbit anti-Synaptophysinsc-1750 (1:250) primary antibodies
and Alexa Fluor-488 goat anti-mouse and AlexaFluor-546 goat anti-rabbit secondary antibodies to label
neuronal cell nuclei, synapses and neurites. A Cellomics ArrayScan VTi HCS Reader is used for automated image
acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 10
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe synaptogenesis and neurite length in
a rat primary cell culture, via the immunocytochemical labelling of cell bodies and neurites. An automated image
analysis protocol is employed to systematically identify targeted structures based on preassigned criteria.
Ultimately, changes in the neurite length, neurite count, number of branch points, the number of pre-synaptic
puncta, and number of synapses is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

3 nM
Key positive control:

sodium orthovanadate (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO or water

-------
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The number of cells per field was used as an indicator of cell viability at the time of fixation.
Cellomics Neuronal Profiling BioApplication utilizes the BR3 and BR4 polarity algorithms for this assay. The
purpose of these algorithms is to selectively quantify dendrite lengths in primary cortical cultures during
development. Quantitation of synaptic puncta and dendrite lengths is based upon the differential labeling
patterns observed using antibodies targeted against synaptophysin and MAP2 respectively. The BR3 and BR4
algorithms are "paired protocols", meaning that images are captured and analyzed with one protocol (BR3)
followed by an off-line or "disk-scan" with the second protocol (BR4). These algorithms are appropriate for use
in primary cortical cultures grown for up to 12 days in vitro at densities ranging from 2000 to 10,000 cells / well.
Images are analyzed using the Cellomics Neuronal Profiling BioApplication (version 4) to measure neurite
morphology. Optimization of nuclear masking and selection, cell body masking and selection, and neurite tracing
parameters is performed on untreated cultures at DIV12 after initial plating. In each well, multiple unique fields-
of-view are acquired until at least 200 neurons are counted. Eight morphological features are quantified (see
table 8.4.1) Neurites are defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 295	Number of chemicals tested: 260

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
109

Inactive hit count: Oihitc 0.9
186

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:
gain-loss (gnls) model:

14

15

-------
power(pow) model:
linear-polynomial (polyl) model:

19

107

quadratic-polynomialfpoly2) model: 27

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

4

0

75

33

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

64.37

Neutral control median absolute deviation, by plate: nmad	7.66

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.2%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	42.93

Positive control well median absolute deviation, by plate: pmad	4.715

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.15

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	53.589

Negative control well median absolute deviation value, by plate: mmad	5.354

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.452

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 33.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2789

CCTE_Mundy_HCI_hN2_NOG_BPCount

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in human hN2 cells for Branch Point Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_hN2_NOG is a multiplexed, cell-based-readout assay that uses immature
neurons derived from hNPl neuroprogenitor cells with measurements taken at 50.33 hours after chemical
dosing in a microplate: 96-well plate. CCTE_Mundy_HCI_hN2_NOG_BPCount is one of four components of the
CCTE_Mundy_HCI_hN2_NOG assay. It measures neurite outgrowth related to number of branch points using
High Content Imaging of fluorescently labelled markers. This assay component is considered less reliable by
experts on the assay due to low occurrence of branch points in the hN2 cells. Use data with caution. Data from
the assay component CCTE_Mundy_HCI_hN2_NOG_BPCount was analyzed at the endpoint
CCTE_Mundy_HCI_hN2_NOG_BPCount in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of morphology reporter, loss -of-signal activity can be used to
understand developmental effects. This assay component endpoint is considered less reliable by experts on the
assay due to low occurrence of branch points in the hN2 cells. Use data with caution.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay as described below is medium-to-high throughput. It uses primary
cortical rat neurons, seeded on a 96-well plate. Each plate may contain 8 test compounds at up to 11
concentrations, in addition to vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to
chemicals for 48 hours prior to fixation and analysis. Each experiment should be replicated on three separate
plates from the same culture preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of branch points are indicative of neurodevelopment.

The CCTE_Mundy_HCI_hN2_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures changes in
neurite branch point count.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these

-------
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent hN2 secondary cell used. The hN2 human embryonic stem cell (hESC)-derived
neural cell line was obtained from Aruna and was derived from neuroepithelial cells of WA09 hESC (Thomson et
al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et al., 2006). These cells are
no longer commercially available.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 20.818
Response cutoff threshold used to determine hit calls: 62.454
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in hN2 human
embryonic stem cell (hESC)-derived neural cells, via the immunocytochemical labelling of cell bodies and
neurites. An automated image analysis protocol is employed to systematically identify targeted structures based
on preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

3 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO or water

Additionally, this assay was annotated to the intended target family of neurodevelopment.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0; pval
= 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27:

-------
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 82	Number of chemicals tested: 82

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
26

Inactive hit count: Oihitc 0.9
43

WINING MODEL SELECTION

NA hit count: hitc^O
13

Number of sample-assay endpoints with winning hill model:

6

14

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

4

17

quadratic-polynomialfpoly2) model:	10

exponential-2 (exp2) model:

0

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

14

16

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0.56

Neutral control median absolute deviation, by plate: nmad

0.074

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

13.52%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

0.155
0.044

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-3.609

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 16.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2790

CCTE_Mundy_HCI_hN2_NOG_NeuriteCount

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in human hN2 cells for Neurite Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_hN2_NOG is a multiplexed, cell-based-readout assay that uses immature
neurons derived from hNPl neuroprogenitor cells with measurements taken at 50.33 hours after chemical
dosing in a microplate: 96-well plate. CCTE_Mundy_HCI_hN2_NOG_NeuriteCount is one of four components of
the CCTE_Mundy_HCI_hN2_NOG assay. It measures neurite outgrowth related to number of neurites using
High Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_hN2_NOG_NeuriteCount	was	analyzed	at	the	endpoint
CCTE_Mundy_HCI_hN2_NOG_NeuriteCount in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, loss -of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay as described below is medium-to-high throughput. It uses primary
cortical rat neurons, seeded on a 96-well plate. Each plate may contain 8 test compounds at up to 11
concentrations, in addition to vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to
chemicals for 48 hours prior to fixation and analysis. Each experiment should be replicated on three separate
plates from the same culture preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of neurites are indicative of neurodevelopment.

The CCTE_Mundy_HCI_hN2_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the number
of neurites.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of

-------
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent hN2 secondary cell used. The hN2 human embryonic stem cell (hESC)-derived
neural cell line was obtained from Aruna and was derived from neuroepithelial cells of WA09 hESC (Thomson et
al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et al., 2006). These cells are
no longer commercially available.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 11.206
Response cutoff threshold used to determine hit calls: 33.618
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in hN2 human
embryonic stem cell (hESC)-derived neural cells, via the immunocytochemical labelling of cell bodies and
neurites. An automated image analysis protocol is employed to systematically identify targeted structures based
on preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

3 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO or water

Additionally, this assay was annotated to the intended target family of neurodevelopment.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0; pval
= 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27:

-------
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 82	Number of chemicals tested: 82

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
28

Inactive hit count: Oihitc 0.9
47

WINING MODEL SELECTION

NA hit count: hitc^O
7

Number of sample-assay endpoints with winning hill model:

8
7

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2

19

quadratic-polynomialfpoly2) model:	10

exponential-2 (exp2) model:

2

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

1

14

18

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

1.655

Neutral control median absolute deviation, by plate: nmad

0.059

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

3.63%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

1

0.089

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-3.676

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 18.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2791

CCTE_M u ndy_HCI_h N 2_N0G_Neu rite Length

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in human hN2 cells for Neurite Length, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_hN2_NOG is a multiplexed, cell-based-readout assay that uses immature
neurons derived from hNPl neuroprogenitor cells with measurements taken at 50.33 hours after chemical
dosing in a microplate: 96-well plate. CCTE_Mundy_HCI_hN2_NOG_NeuriteLength is one of four components
of the CCTE_Mundy_HCI_hN2_NOG assay. It measures neurite outgrowth related to neurite length using High
Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_hN2_NOG_NeuriteLength	was	analyzed	at	the	endpoint
CCTE_Mundy_HCI_hN2_NOG_NeuriteLength in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, loss -of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay as described below is medium-to-high throughput. It uses primary
cortical rat neurons, seeded on a 96-well plate. Each plate may contain 8 test compounds at up to 11
concentrations, in addition to vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to
chemicals for 48 hours prior to fixation and analysis. Each experiment should be replicated on three separate
plates from the same culture preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the neurite length are indicative of neurodevelopment.

The CCTE_Mundy_HCI_hN2_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the length of
the neurites.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of

-------
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent hN2 secondary cell used. The hN2 human embryonic stem cell (hESC)-derived
neural cell line was obtained from Aruna and was derived from neuroepithelial cells of WA09 hESC (Thomson et
al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et al., 2006). These cells are
no longer commercially available.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 15.005
Response cutoff threshold used to determine hit calls: 45.015
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in hN2 human
embryonic stem cell (hESC)-derived neural cells, via the immunocytochemical labelling of cell bodies and
neurites. An automated image analysis protocol is employed to systematically identify targeted structures based
on preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

3 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO or water

Additionally, this assay was annotated to the intended target family of neurodevelopment.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0; pval
= 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27:

-------
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 84	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
28

Inactive hit count: Oihitc 0.9
48

WINING MODEL SELECTION

NA hit count: hitc^O
S

Number of sample-assay endpoints with winning hill model:

7

12

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

4

17

quadratic-polynomialfpoly2) model: 8

exponential-2 (exp2) model:

1

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

1

19

15

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

75.15

Neutral control median absolute deviation, by plate: nmad

4.759

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

7.15%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

32.15
5.219

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-4.35

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 15.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2792

CCTE_Mundy_HCI_hN2_NOG_NeuronCount

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in human hN2 cells for Neuron Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_hN2_NOG is a multiplexed, cell-based-readout assay that uses immature
neurons derived from hNPl neuroprogenitor cells with measurements taken at 50.33 hours after chemical
dosing in a microplate: 96-well plate. CCTE_Mundy_HCI_hN2_NOG_NeuronCount is one of four components of
the CCTE_Mundy_HCI_hN2_NOG assay. It measures cell viability related to number of neurons using High
Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_hN2_NOG_NeuronCount	was	analyzed	at	the	endpoint
CCTE_Mundy_HCI_hN2_NOG_NeuronCount in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, loss -of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The assay as described below is medium-to-high throughput. It uses primary
cortical rat neurons, seeded on a 96-well plate. Each plate may contain 8 test compounds at up to 11
concentrations, in addition to vehicle controls, assay positive controls, and blanks. Cell cultures are exposed to
chemicals for 48 hours prior to fixation and analysis. Each experiment should be replicated on three separate
plates from the same culture preparation.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of neurons are indicative of neurodevelopment.

The CCTE_Mundy_HCI_hN2_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the number
of neurons as a measure of cytotoxicity.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of

-------
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent hN2 secondary cell used. The hN2 human embryonic stem cell (hESC)-derived
neural cell line was obtained from Aruna and was derived from neuroepithelial cells of WA09 hESC (Thomson et
al., 1998) origin according to a previously described protocol (Shin et al., 2005, Shin et al., 2006). These cells are
no longer commercially available.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 34.31
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in hN2 human
embryonic stem cell (hESC)-derived neural cells, via the immunocytochemical labelling of cell bodies and
neurites. An automated image analysis protocol is employed to systematically identify targeted structures based
on preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

3 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO or water

Additionally, this assay was annotated to the intended target family of neurodevelopment.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0; pval
= 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27: ow_bidirectional_loss
(Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis direction. Typically used
for endpoints where only negative responses are biologically relevant.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 84	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
35

Inactive hit count: 0
-------
exponentials (exp5) model:

10

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

23.44

Neutral control median absolute deviation, by plate: nmad

3.558

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

13.47%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

2.328

20

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.099

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 10.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2793

CCTE_Mundy_HCI_hNPl_Casp3_7

1.	General Information

1.1	Assay Title: CCTE's Caspase-Glo 3/7 Assay in human neuroprogenitor cells for Apoptosis, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_hNPl_Casp3_7 is a multiplexed, cell-based-readout assay that uses neural
stem cells derived from a neuroepithelial cell lineage of WA09 embryonic cells with measurements taken at 26.5
hours after chemical dosing in a microplate: 96-well plate. CCTE_Mundy_HCI_hNPl_Casp3_7 is one component
of the CCTE_Mundy_HCI_hNPl_Casp3_7 assay. It measures apoptosis related to caspase activation using
Luminescent Reporter. Data from the assay component CCTE_Mundy_HCI_hNPl_Casp3_7 was analyzed at the
endpoint CCTE_Mundy_HCI_hNPl_Casp3_7_gain in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of viability reporter, gain -of-signal activity can be
used to understand viability effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: CellTiter-Glo 2.0 Assay (Promega G9242), and apoptosis, Caspase-Glo 3/7 Buffer
(Promega G810A), and Caspase-Glo 3/7 Substrate (Promega G811A), are patented. This assay is considered part
of the developmental neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of
Data from the Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The methods described here are described for a 96 well plate format. Typically,
18 plates can be made in one culture (Six for Proliferation, six for apoptosis, six for cytotoxicity), which allows
testing 16 compounds in triplicate technical replicates. With thawing and expansion plating can occur every 14
days, allowing 32 compounds in triplicate at multiple concentrations to be screened per month

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the caspase activation are indicative of cell death.

The CCTE_Mundy_HCI_hNPl_Casp3_7 assay is designed to investigate changes in apoptosis in response to
chemical exposure in human neuroprogenitor cells (hNPl) using a high-content imaging (HCI) technology.
Apoptosis is one of several key processes of neurodevelopment. This endpoint measures intensity of
luminescent signal produced by caspase 3/7 cleavage of a detection reagent. The signal produced is proportional
to the number of apoptotic cells. An increase as compared to control is indicative of increased apoptosis.

2.2	Scientific Principles: Apoptosis is a form of programmed cell death that also plays a critical role in development
of the nervous system. In particular, cells that fail to become integrated into neural networks during neural
development often undergo apoptosis. Chemical exposure can result in changes in apoptosis and such changes
may, like changes in proliferation, alter the number of cells in the nervous system, resulting in developmental

-------
neurotoxicity. Cells are plated onto 96 well clear and opaque plates and allowed 40-44 hr for attachment to the
substrate and recovery. Between 40 and 44 hours all plates are exposed to the chemical ("Dose"). Twenty-four
hours after exposure, cells are either fixed or assayed.

2.3	Experimental System: adherent hNPl primary cell used. Human neural progenitor cells (hNPl) were obtained
as cryopreserved cells from ArunA Biomedical, Athens Georgia. Cells are plated on pre-coated poly-L-
ornithine/laminin clear bottom and opaque 96 well plates (day 0).

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Human neural progenitor cells (hNPl) were obtained as cryopreserved cells from ArunA
Biomedical, Athens Georgia. Cells are plated on pre-coated poly-L-ornithine/laminin clear bottom and opaque
96 well plates (day 0). Plates are placed in a 37oC, 5% C02 incubator for 40 to 44 hours at which time plates are
removed from the incubator and 10 uLof a freshly made chemical solution is added to each well. Apoptosis was
measured based on detection of activated caspase-3/7 using the Caspase-Glo 3/7 assay kit (Promega). Twenty-
four hours after chemical treatment, 50 ul of Caspase-Glo 3/7 assay buffer was added to each well and the plate
incubated for 30 min at room temperature. Luminescence was then recorded in each well using a FLUOstar
Optima plate reader.

Baseline median absolute deviation for the assay (bmad): 4.659
Response cutoff threshold used to determine hit calls: 30
Detection technology used: Luminescent Reporter (Fluorescence)

2.6	Response: The Caspase-Glo 3/7 Assay is a homogeneous, luminescent assay that measures caspase-3 and -7
activities. These members of the cysteine aspartic acid-specific protease (caspase) family play key effector roles
in apoptosis in mammalian cells. The assay provides a luminogenic caspase-3/7 substrate, which contains the
tetrapeptide sequence DEVD, in a reagent optimized for caspase activity, luciferase activity and cell lysis.
Addition of Caspase-Glo 3/7 Reagent in an "add-mix-measure" format results in cell lysis, followed by caspase
cleavage of the substrate and generation of a luminescent signal, produced by luciferase. Luminescence is
proportional to the amount of caspase activity present.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.00313 nM
Key positive control:

staurosporine (0.1 uM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 uM
Neutral vehicle control:

DMSO or water

-------
Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: With the lid removed from the plate, read in the BMG FLUOstar OPTIMA
Fluorescence/Luminescence Microplate Reader following the instructions in OP-NHEERL/ISTD/SBB/TMF/2013-
007-r0 using the "Caspase Glo" protocol (set "Gain" at 3500). All data and calculations are recorded in an Excel
spreadsheet and stored in a laboratory drive.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with

-------
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 346	Number of chemicals tested: 307

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
85

Inactive hit count: 0
-------
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	12.845

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

74034.5
3418.876
4.58%

199269.5
8058.672

NA

-------
Positive control signal-to-background: (pmed/nmed)

NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	71359.75

Negative control well median absolute deviation value, by plate: mmad	3663.875

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.055

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 24.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

-------
• Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2794

CCTE_Mundy_HCI_h N Pl_Cel ITiter

1.	General Information

1.1	Assay Title: CCTE's Cell Titer-Glo 2 Assay in human neuroprogenitor cells for Viability, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_hNPl_CellTiter is a multiplexed, cell-based-readout assay that uses neural
stem cells derived from a neuroepithelial cell lineage of WA09 embryonic cells with measurements taken at 26.5
hours after chemical dosing in a microplate: 96-well plate. CCTE_Mundy_HCI_hNPl_CellTiter is one component
of the CCTE_Mundy_HCI_hNPl_CellTiter assay. It measures cell viability related to ATP content using
Luminescent Reporter. Data from the assay component CCTE_Mundy_HCI_hNPl_CellTiter was analyzed at the
endpoint CCTE_Mundy_HCI_hNPl_CellTiter in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, loss -of-signal activity can be used to
understand viability effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: CellTiter-Glo 2.0 Assay (Promega G9242), and apoptosis, Caspase-Glo 3/7 Buffer
(Promega G810A), and Caspase-Glo 3/7 Substrate (Promega G811A), are patented. This assay is considered part
of the developmental neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of
Data from the Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The methods described here are described for a 96 well plate format. Typically,
18 plates can be made in one culture (Six for Proliferation, six for apoptosis, six for cytotoxicity), which allows
testing 16 compounds in triplicate technical replicates. With thawing and expansion plating can occur every 14
days, allowing 32 compounds in triplicate at multiple concentrations to be screened per month

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the ATP content are indicative of cell viability.

The CCTE_Mundy_CellTiter assay is designed to investigate the number of viable cells in human neuroprogenitor
cells (hNPl) using a high-content imaging (HCI) technology. This endpoint measures the intensity of luminescent
signal produced by detection of cellular ATP. The signal produced is proportional to the number of viable cells.
A decrease as compared to control is indicative of cytotoxicity. This assay is run in parallel with the
CCTE_Mundy_HCI_hNPl_Casp3_7 assay endpoint and CCTE_Mundy_HCI_hNPl_Pro_ResponderAvglnten
endpoint.

2.2	Scientific Principles: Cell viability and apoptosis measurements are performed simultaneously in 96 well plates
using commercial assays to determine toxicity of chemicals and death by apoptosis. Apoptosis is a form of
programmed cell death that also plays a critical role in development of the nervous system. In particular, cells

-------
that fail to become integrated into neural networks during neural development often undergo apoptosis.
Chemical exposure can result in changes in apoptosis and such changes may, like changes in proliferation, alter
the number of cells in the nervous system, resulting in developmental neurotoxicity. Cells are plated onto 96
well clear and opaque plates and allowed 40-44 hr for attachment to the substrate and recovery. Between 40
and 44 hours all plates are exposed to the chemical ("Dose"). Twenty-four hours after exposure, cells are either
fixed or assayed.

2.3	Experimental System: adherent hNPl primary cell used. Human neural progenitor cells (hNPl) were obtained
as cryopreserved cells from ArunA Biomedical, Athens Georgia. Cells are plated on pre-coated poly-L-
ornithine/laminin clear bottom and opaque 96 well plates (day 0).

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Human neural progenitor cells (hNPl) were obtained as cryopreserved cells from ArunA
Biomedical, Athens Georgia. Cells are plated on pre-coated poly-L-ornithine/laminin clear bottom and opaque
96 well plates (day 0). Plates are placed in a 37oC, 5% C02 incubator for 40 to 44 hours at which time plates are
removed from the incubator and 10 uL of a freshly made chemical solution is added to each well. Cell viability
was based on number of cells per field or determined in sister plates using the CellTiter-Glo luminescent assay
(Promega) which measures the amount of ATP present in each well. Twenty-four hours after chemical
treatment, 50 ul of CellTiter-Glo assay buffer was added to each well and the plate incubated for 30 min at room
temperature. Luminescence was then recorded in each well using a FLUOstar Optima plate reader (BMG
LABTECH, Cary, NC).

Baseline median absolute deviation for the assay (bmad): 4.966
Response cutoff threshold used to determine hit calls: 30
Detection technology used: Luminescent Reporter (Fluorescence)

2.6	Response: The Cell Titer-Glo 2 Cell Viability assay is a method for determining the number of viable cells in
culture based on the quantification levels of ATP present (indicative of metabolically active cells). It is an assay
which results in cell lysis and generation of a luminescent signal proportional to the amount of ATP present. The
ATP reacts with beetle luciferin in the presence of recombinant firefly luciferase and results in a stable
luminescent signal. The measure of cell viability assessed by the CellTiter Glo 2 assay depends on the metabolic
activity of cells present in the well. Reduced cell viability indicates either fewer cells present and/or a reduced
metabolic capacity of the cells in the well.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.03 nM
Key positive control:

staurosporine (0.1 uM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 uM
Neutral vehicle control:

DMSO or water

-------
2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: With the lid removed from the plate, read in the BMG FLUOstar OPTIMA
Fluorescence/Luminescence Microplate Reader following the instructions in OP-NHEERL/ISTD/SBB/TMF/2013-
007-rl using the "CellTiter-Glo protocol" (in the program set "Gain" at 3500). All data and calculations are
recorded in an Excel spreadsheet and stored in a laboratory drive.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27:
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 321	Number of chemicals tested: 284

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
66

Inactive hit count: 0
-------
exponentials (exp5) model:

24

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

715455

Neutral control median absolute deviation, by plate: nmad

35007.151

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

4.79%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

14700.35

75778.5

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-12.353

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	585869.5

Negative control well median absolute deviation value, by plate: mmad	17979.861

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.791

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 24.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2797

CCTE_M u ndy_HCI_h N Pl_Pro_ResponderAvgl nten

1.	General Information

1.1	Assay Title: CCTE's Proliferation Assay in human neuroprogenitor cells, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_hNPl_Pro is a multiplexed, cell-based-readout assay that uses neural stem
cells derived from a neuroepithelial cell lineage of WA09 embryonic cells with measurements taken at 26.25
hours after chemical dosing in a microplate: 96-well plate. CCTE_Mundy_HCI_hNPl_Pro_ResponderAvglnten is
one of three components of the CCTE_Mundy_HCI_hNPl_Pro assay. It measures proliferation related to
percentage of Brdll positive cells using High Content Imaging of fluorescently labelled markers. Data from the
assay component CCTE_Mundy_HCI_hNPl_Pro_ResponderAvglnten was analyzed at the endpoint
CCTE_Mundy_HCI_hNPl_Pro_ResponderAvglnten in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of growth reporter, loss -of-signal activity can be used
to understand viability effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: CellTiter-Glo 2.0 Assay (Promega G9242), and apoptosis, Caspase-Glo 3/7 Buffer
(Promega G810A), and Caspase-Glo 3/7 Substrate (Promega G811A), are patented. This assay is considered part
of the developmental neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of
Data from the Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The methods described here are described for a 96 well plate format. Typically,
18 plates can be made in one culture (Six for Proliferation, six for apoptosis, six for cytotoxicity), which allows
testing 16 compounds in triplicate technical replicates. With thawing and expansion plating can occur every 14
days, allowing 32 compounds in triplicate at multiple concentrations to be screened per month

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the percentage of BrdU positive cells are indicative of cell proliferation.

The CCTE_Mundy_HCI_hNPl_Pro assay is designed to investigate changes proliferation in human
neuroprogenitor cells (hNPl) using a high-content imaging (HCI) technology. Proliferation one of several key
processes of neurodevelopment. This endpoint measures intensity of BrdU labeling in the nucleus of each cell,
averaged across all cells in a well. A decrease as compared to control is indicative of decreased cell proliferation.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is proliferation of neurogenitor cells, which determines the number of cells in the nervous system.

-------
Brdll, or bromodeoxyuridine, is a synthetic nucleic acid that may be incorporated into DNA during replication in
lieu of thymine. Cells undergoing DNA replication - S-phase of the cell cycle - incorporate this Brdll into their
DNA, but cells in other phases of the cell cycle may not. Since only four hours are allotted for cells to incorporate
Brdll into their DNA, not enough time is given for S-phase cells to begin mitosis and pass the Brdll-label to their
progeny. Antibodies are selected to screen for this nucleoside to demonstrate which cells were actively dividing
at the time of Brdll exposure (after 20hr of chemical exposure). The purpose of this test is to identify compounds
that may interfere with the normal neurodevelopmental process of neuroprogenitor cell proliferation.
Chemicals that interfere with proliferation may result in too few, or too many cells in the nervous system. Both
of these conditions have been associated with developmental neurotoxicity following chemical exposures.

2.3	Experimental System: adherent hNPl primary cell used. Human neural progenitor cells (hNPl) were obtained
as cryopreserved cells from ArunA Biomedical, Athens Georgia. Cells are plated on pre-coated poly-L-
ornithine/laminin clear bottom and opaque 96 well plates (day 0).

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Human neural progenitor cells (hNPl) were obtained as cryopreserved cells from ArunA
Biomedical, Athens Georgia. Cells are plated on pre-coated poly-L-ornithine/laminin clear bottom and opaque
96 well plates (day 0). Plates are placed in a 37oC, 5% C02 incubator for 40 to 44 hours at which time plates are
removed from the incubator and 10 uL of a freshly made chemical solution is added to each well. Twenty hours
after the addition of the chemicals, the clear plates are removed from the incubator and 11 uLof a Brdll solution
is added to each well. The plates are returned to the incubator for four hours. After four hours, the plates are
fixed with a 4% paraformaldehyde solution.

Baseline median absolute deviation for the assay (bmad): 11.83
Response cutoff threshold used to determine hit calls: 35.491
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Brdll, or bromodeoxyuridine, is a synthetic nucleic acid that may be incorporated into DNA during
replication in lieu of thymine. Cells undergoing DNA replication - S-phase of the cell cycle - incorporate this
Brdll into their DNA, but cells in other phases of the cell cycle may not. Percentage of cells with intensity of Brdll
labeling > 3X background. A decrease as compared to control is indicative of decreased cell proliferation.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
1HM

Key positive control:

aphidicolin (lOuM)

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO or water

-------
DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The primary endpoint of Brdll positive cells is assessed based on an immunocytochemical staining
(ICC) image of cells in each well. Cells are fixed 24 hours after chemical exposure then an ICC staining with
Hoechst for nuclei and anti-Brdll for Brdll positive cells is performed. The plates are scanned using an
automated high content imaging device and all nuclei and Brdll positive cells are identified automatically based
on their intensity and size. Channel 1 (Nuclei) and Channel 2 (Brdll positive). The average of several wells is
taken to determine exposure times. The time for each channel is fixed. Using the fixed exposure times, an image
set from a negative control well is obtained and the algorithm is run. The mean average intensity for channel 2
(Mean_Avg Inten Ch2) is collected and three more data points are obtained. These are averaged and the number
obtained is placed in the Assay Parameter table in the AvglntCh2_Level High line. Any cell with an intensity
above this number is counted as a Brdll positive cell. (Operating Procedure for High Content Imaging of
Proliferation: OP -NHEERL/ISTD/SBB/TJS/2017-001-r2)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

-------
Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27:
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 347	Number of chemicals tested: 308

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
58

Inactive hit count: 0
-------
quadratic-polynomialfpoly2) model: 38

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

7

86

34

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

28.445

Neutral control median absolute deviation, by plate: nmad

3.543

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.4%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	1.12

Positive control well median absolute deviation, by plate: pmad	0.693

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-5.895

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	27.602

Negative control well median absolute deviation value, by plate: mmad	3.195

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.24

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 34.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.,
Carstens, K. E., Carpenter, A. F., Martin, M. M., Harrill, J. A., Shafer, T. J., & Paul Friedman, K. (2022). Integrating
Data From In Vitro New Approach Methodologies for Developmental Neurotoxicity. Toxicological sciences : an
official journal of the Society of Toxicology, 187(1), 62-79. https://doi.org/10.1093/toxsci/kfac018

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3067

CCTE_Mundy_HCIJCellGluta_NOG_BPCount

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in iCellGluta cells for Branch Point Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_iCellGluta_NOG is a multiplexed, cell-based-readout assay that uses iCellA®
GlutaNeurons from FUJIFILM Cellular Dynamics, Inc. (FCDI), human glutamatergic enriched neurons derived
from induced pluripotent stem (iPS) cells, with measurements taken at 50.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_iCellGluta_NOG_BPCount is one of four components of the
CCTE_Mundy_HCI_iCellGluta_NOG assay. It measures neurite outgrowth related to number of branch points
using High Content Imaging of fluorescently labelled markers. This assay component is considered less reliable
by experts on the assay due to low occurrence of branch points in the iCell GlutaNeurons. Use data with caution.
Data from the assay component CCTE_Mundy_HCI_iCellGluta_NOG_BPCount was analyzed at the endpoint
CCTE_Mundy_HCI_iCellGluta_NOG_BPCount in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, loss-of-signal activity can be used
to understand developmental effects. This assay endpoint is considered less reliable by experts on the assay due
to low occurrence of branch points in the iCell GlutaNeurons. Use data with caution.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The methods described here are set up in a 96 well plate format with
automated image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates
and compound dilutions can be automated using a liquid handling system. The methods described here are
described for a 96 well plate format. Typically, 9 plates can be made from one vial of cells, which allows testing
24 compounds in triplicate (technical replicates). If cultures are made every 14 days, 48 compounds per month
can be screened in triplicate at multiple concentrations. The throughput is therefore estimated as medium.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the number of branch points are indicative of neurodevelopment.

The CCTE_Mundy_HCI_iCellGluta_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures changes in
neurite branch point count.

-------
2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent iCell GlutaNeurons secondary cell used. Human glutamatergic-enriched cortical
neurons derived from induced pluripotent stem (iPS) cells are provided as cryopreserved cells, from Fujifilm
Cellular Dynamics, Inc, Madison, Wisconsin. Material originates from human blood of a male aged 50 - 59 at
sampling.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 15.999
Response cutoff threshold used to determine hit calls: 47.998
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in iCellGluta
neurons (Fujifilm Cellular Dynamics, Inc), via the immunocytochemical labelling of cell bodies and neurites. An
automated image analysis protocol is employed to systematically identify targeted structures based on
preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.00313 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

104 nM
Neutral vehicle control:

DMSO or water

-------
DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

-------
Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27:
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 255	Number of chemicals tested: 235

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
67

Inactive hit count: 0
-------
quadratic-polynomialfpoly2) model: 26

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

3

51

30

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0.96

Neutral control median absolute deviation, by plate: nmad

0.126

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	16.92%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.275

Positive control well median absolute deviation, by plate: pmad	0.215

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.329

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.64

Negative control well median absolute deviation value, by plate: mmad	0.078

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.396

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 30.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3068

CCTE_M u ndy_HCI_iCel IGI uta_NOG_Neu riteCou nt

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in iCellGluta cells for Neurite Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_iCellGluta_NOG is a multiplexed, cell-based-readout assay that uses iCellA®
GlutaNeurons from FUJIFILM Cellular Dynamics, Inc. (FCDI), human glutamatergic enriched neurons derived
from induced pluripotent stem (iPS) cells, with measurements taken at 50.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_iCellGluta_NOG_NeuriteCount is one of four components of the
CCTE_Mundy_HCI_iCellGluta_NOG assay. It measures neurite outgrowth related to number of neurites using
High Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_iCellGluta_NOG_NeuriteCount was analyzed at the endpoint
CCTE_Mundy_HCI_iCellGluta_NOG_NeuriteCount in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of morphology reporter, loss-of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The methods described here are set up in a 96 well plate format with
automated image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates
and compound dilutions can be automated using a liquid handling system. The methods described here are
described for a 96 well plate format. Typically, 9 plates can be made from one vial of cells, which allows testing
24 compounds in triplicate (technical replicates). If cultures are made every 14 days, 48 compounds per month
can be screened in triplicate at multiple concentrations. The throughput is therefore estimated as medium.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of neurites are indicative of neurodevelopment.

The CCTE_Mundy_HCI_iCellGluta_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the number
of neurites.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these

-------
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent iCell GlutaNeurons secondary cell used. Human glutamatergic-enriched cortical
neurons derived from induced pluripotent stem (iPS) cells are provided as cryopreserved cells, from Fujifilm
Cellular Dynamics, Inc, Madison, Wisconsin. Material originates from human blood of a male aged 50 - 59 at
sampling.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 4.043
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in iCellGluta
neurons (Fujifilm Cellular Dynamics, Inc), via the immunocytochemical labelling of cell bodies and neurites. An
automated image analysis protocol is employed to systematically identify targeted structures based on
preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.00313 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

104 nM
Neutral vehicle control:

DMSO or water

Additionally, this assay was annotated to the intended target family of neurodevelopment.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27:

-------
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 255	Number of chemicals tested: 235

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
63

Inactive hit count: Oihitc 0.9
174

WINING MODEL SELECTION

NA hit count: hitc^O
IS

Number of sample-assay endpoints with winning hill model:

13
16

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

28

82

quadratic-polynomialfpoly2) model: 22

exponential-2 (exp2) model:

17

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

25

51

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

2.17

Neutral control median absolute deviation, by plate: nmad

0.089

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

4.02%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

1.34
0.334

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	2.013

Negative control well median absolute deviation value, by plate: mmad	0.078

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.2

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 25.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3069

CCTE_M u ndy_HCI_iCel IGI uta_NOG_Neu riteLength

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in iCellGluta cells for Neurite Length, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_iCellGluta_NOG is a multiplexed, cell-based-readout assay that uses iCellA®
GlutaNeurons from FUJIFILM Cellular Dynamics, Inc. (FCDI), human glutamatergic enriched neurons derived
from induced pluripotent stem (iPS) cells, with measurements taken at 50.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_iCellGluta_NOG_NeuriteLength is one of four components of the
CCTE_Mundy_HCI_iCellGluta_NOG assay. It measures neurite outgrowth related to neurite length using High
Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_iCellGluta_NOG_NeuriteLength was analyzed at the endpoint
CCTE_Mundy_HCI_iCellGluta_NOG_NeuriteLength in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of morphology reporter, loss-of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The methods described here are set up in a 96 well plate format with
automated image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates
and compound dilutions can be automated using a liquid handling system. The methods described here are
described for a 96 well plate format. Typically, 9 plates can be made from one vial of cells, which allows testing
24 compounds in triplicate (technical replicates). If cultures are made every 14 days, 48 compounds per month
can be screened in triplicate at multiple concentrations. The throughput is therefore estimated as medium.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the neurite length are indicative of neurodevelopment.

The CCTE_Mundy_HCI_iCellGluta_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the length of
the neurites.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these

-------
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent iCell GlutaNeurons secondary cell used. Human glutamatergic-enriched cortical
neurons derived from induced pluripotent stem (iPS) cells are provided as cryopreserved cells, from Fujifilm
Cellular Dynamics, Inc, Madison, Wisconsin. Material originates from human blood of a male aged 50 - 59 at
sampling.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 9.124
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in iCellGluta
neurons (Fujifilm Cellular Dynamics, Inc), via the immunocytochemical labelling of cell bodies and neurites. An
automated image analysis protocol is employed to systematically identify targeted structures based on
preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.00313 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

104 nM
Neutral vehicle control:

DMSO or water

Additionally, this assay was annotated to the intended target family of neurodevelopment.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27:

-------
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 255	Number of chemicals tested: 235

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
78

Inactive hit count: Oihitc 0.9
148

WINING MODEL SELECTION

NA hit count: hitc^O
29

Number of sample-assay endpoints with winning hill model:

17
19

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

33

75

quadratic-polynomialfpoly2) model: 24

exponential-2 (exp2) model:

12

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

4

46

25

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

157.63

Neutral control median absolute deviation, by plate: nmad

13.336

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

9.56%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed	62.605

Positive control well median absolute deviation, by plate: pmad	22.202

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.644

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	148.113

Negative control well median absolute deviation value, by plate: mmad	8.718

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.476

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 25.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3070

CCTE_Mundy_HCIJCellGluta_NOG_NeuronCount

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in iCellGluta cells for Neuron Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_iCellGluta_NOG is a multiplexed, cell-based-readout assay that uses iCellA®
GlutaNeurons from FUJIFILM Cellular Dynamics, Inc. (FCDI), human glutamatergic enriched neurons derived
from induced pluripotent stem (iPS) cells, with measurements taken at 50.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_iCellGluta_NOG_NeuronCount is one of four components of the
CCTE_Mundy_HCI_iCellGluta_NOG assay. It measures cell viability related to number of neurons using High
Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_iCellGluta_NOG_NeuronCount was analyzed at the endpoint
CCTE_Mundy_HCI_iCellGluta_NOG_NeuronCount in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used
to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The methods described here are set up in a 96 well plate format with
automated image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates
and compound dilutions can be automated using a liquid handling system. The methods described here are
described for a 96 well plate format. Typically, 9 plates can be made from one vial of cells, which allows testing
24 compounds in triplicate (technical replicates). If cultures are made every 14 days, 48 compounds per month
can be screened in triplicate at multiple concentrations. The throughput is therefore estimated as medium.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of neurons are indicative of neurodevelopment.

The CCTE_Mundy_HCI_iCellGluta_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the number
of neurons as a measure of cytotoxicity.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these

-------
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent iCell GlutaNeurons secondary cell used. Human glutamatergic-enriched cortical
neurons derived from induced pluripotent stem (iPS) cells are provided as cryopreserved cells, from Fujifilm
Cellular Dynamics, Inc, Madison, Wisconsin. Material originates from human blood of a male aged 50 - 59 at
sampling.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 10.081
Response cutoff threshold used to determine hit calls: 30.244
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in iCellGluta
neurons (Fujifilm Cellular Dynamics, Inc), via the immunocytochemical labelling of cell bodies and neurites. An
automated image analysis protocol is employed to systematically identify targeted structures based on
preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.00313 nM
Key positive control:

Rac 1 inhibitor (lOuM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

104 nM
Neutral vehicle control:

DMSO or water

Additionally, this assay was annotated to the intended target family of neurodevelopment.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27:

-------
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 255	Number of chemicals tested: 235

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
61

Inactive hit count: Oihitc 0.9
170

WINING MODEL SELECTION

NA hit count: hitc^O
24

Number of sample-assay endpoints with winning hill model:

6

25

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

24

114

quadratic-polynomialfpoly2) model: 30

exponential-2 (exp2) model:

7

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

3

33

13

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

22.46

Neutral control median absolute deviation, by plate: nmad

2.165

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

10.43%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed	16.915

Positive control well median absolute deviation, by plate: pmad	6.664

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.152

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	20.085

Negative control well median absolute deviation value, by plate: mmad	2.194

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.135

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 13.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3164

CCTE_Mundy_HCI_iCellGABA_NOG_BPCount

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in iCellGABA cells for Branch Point Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_iCellGABA_NOG is a multiplexed, cell-based-readout assay that uses iCell
GABANeurons from FUJIFILM Cellular Dynamics, Inc. (FCDI), human GABAergic enriched neurons derived from
induced pluripotent stem (iPS) cells, with measurements taken at 50.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_iCellGABA_NOG_BPCount is one of four components of the
CCTE_Mundy_HCI_iCellGABA_NOG assay. It measures neurite outgrowth related to number of branch points
using High Content Imaging of fluorescently labelled markers. This assay component is considered less reliable
by experts on the assay due to low occurrence of branch points in the iCell GABANeurons. Use data with caution.
Data from the assay component CCTE_Mundy_HCI_iCellGABA_NOG_BPCount was analyzed at the endpoint
CCTE_Mundy_HCI_iCellGABA_NOG_BPCount in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, loss-of-signal activity can be used
to understand developmental effects. This assay component is considered less reliable by experts on the assay
due to low occurrence of branch points in the iCell GABANeurons. Use data with caution.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The methods described here are set up in a 96 well plate format with
automated image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates
and compound dilutions can be automated using a liquid handling system. The methods described here are
described for a 96 well plate format. Typically, 9 plates can be made from one vial of cells, which allows testing
24 compounds in triplicate (technical replicates). If cultures are made every 14 days, 48 compounds per month
can be screened in triplicate at multiple concentrations. The throughput is therefore estimated as medium.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the number of branch points are indicative of neurodevelopment.

The CCTE_Mundy_HCI_iCellGABA_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures changes in
neurite branch point count.

-------
2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent iCell GABANeurons secondary cell used. Human GABAergic-enriched cortical
neurons derived from induced pluripotent stem (iPS) cells are provided as cryopreserved cells, from Fujifilm
Cellular Dynamics, Inc, Madison, Wisconsin. Material originates from human blood of a male aged 50 - 59 at
sampling.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 14.76
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in iCellGABA
neurons (Fujifilm Cellular Dynamics, Inc), via the immunocytochemical labelling of cell bodies and neurites. An
automated image analysis protocol is employed to systematically identify targeted structures based on
preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.001 nM
Key positive control:

Rac 1 inhibitor (30uM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

-------
DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of
the difference between the corrected (cval) and baseline (bval) values divided the difference between
the positive control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-
bval)*100.), 6: resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 32:
pval.zero (Set the positive control value (pval) to 0; pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

-------
Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27: ow_bidirectional_loss
(Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis direction. Typically used
for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 27	Number of chemicals tested: 27

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
11

Inactive hit count: 0
-------
exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

3

2

5

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0.765

Neutral control median absolute deviation, by plate: nmad

0.052

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

7.26%

-------
POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0

Positive control well median absolute deviation, by plate: pmad	0

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-14.164

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 5.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487., Druwe,
I., T. Freudenrich, K. Wallace, Tim Shafer, AND W. Mundy. Comparison of Human Induced Pluripotent Stem Cell-
Derived Neurons and Rat Primary CorticalNeurons as In Vitro Models of Neurite Outgrowth. Applied In Vitro
Toxicology. Mary Ann Liebert, Inc., Larchmont, NY, 2(l):26-36, (2016).

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3165

CCTE_Mundy_HCIJCellGABA_NOG_NeuriteCount

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in iCellGABA cells for Neurite Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_iCellGABA_NOG is a multiplexed, cell-based-readout assay that uses iCell
GABANeurons from FUJIFILM Cellular Dynamics, Inc. (FCDI), human GABAergic enriched neurons derived from
induced pluripotent stem (iPS) cells, with measurements taken at 50.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_iCellGABA_NOG_NeuriteCount is one of four components of the
CCTE_Mundy_HCI_iCellGABA_NOG assay. It measures neurite outgrowth related to number of neurites using
High Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_iCellGABA_NOG_NeuriteCount was analyzed at the endpoint
CCTE_Mundy_HCI_iCellGABA_NOG_NeuriteCount in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of morphology reporter, loss-of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The methods described here are set up in a 96 well plate format with
automated image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates
and compound dilutions can be automated using a liquid handling system. The methods described here are
described for a 96 well plate format. Typically, 9 plates can be made from one vial of cells, which allows testing
24 compounds in triplicate (technical replicates). If cultures are made every 14 days, 48 compounds per month
can be screened in triplicate at multiple concentrations. The throughput is therefore estimated as medium.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of neurites are indicative of neurodevelopment.

The CCTE_Mundy_HCI_iCellGABA_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the number
of neurites.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these

-------
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent iCell GABANeurons secondary cell used. Human GABAergic-enriched cortical
neurons derived from induced pluripotent stem (iPS) cells are provided as cryopreserved cells, from Fujifilm
Cellular Dynamics, Inc, Madison, Wisconsin. Material originates from human blood of a male aged 50 - 59 at
sampling.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 5.155
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in iCellGABA
neurons (Fujifilm Cellular Dynamics, Inc), via the immunocytochemical labelling of cell bodies and neurites. An
automated image analysis protocol is employed to systematically identify targeted structures based on
preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.001 nM
Key positive control:

Rac 1 inhibitor (30uM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

Additionally, this assay was annotated to the intended target family of neurodevelopment.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of
the difference between the corrected (cval) and baseline (bval) values divided the difference between
the positive control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-
bval)*100.), 6: resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 32:
pval.zero (Set the positive control value (pval) to 0; pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27: ow_bidirectional_loss
(Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis direction. Typically used
for endpoints where only negative responses are biologically relevant.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 27	Number of chemicals tested: 27

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
12

Inactive hit count: 0
-------
exponentials (exp5) model:

3

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

2.705

Neutral control median absolute deviation, by plate: nmad

0.067

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

2.48%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

0.015

0.16

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-37.829

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 3.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487., Druwe,
I., T. Freudenrich, K. Wallace, Tim Shafer, AND W. Mundy. Comparison of Human Induced Pluripotent Stem Cell-
Derived Neurons and Rat Primary CorticalNeurons as In Vitro Models of Neurite Outgrowth. Applied In Vitro
Toxicology. Mary Ann Liebert, Inc., Larchmont, NY, 2(l):26-36, (2016).

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3166

CCTE_Mundy_HCIJCellGABA_NOG_NeuriteLength

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in iCellGABA cells for Neurite Length, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_iCellGABA_NOG is a multiplexed, cell-based-readout assay that uses iCell
GABANeurons from FUJIFILM Cellular Dynamics, Inc. (FCDI), human GABAergic enriched neurons derived from
induced pluripotent stem (iPS) cells, with measurements taken at 50.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_iCellGABA_NOG_NeuriteLength is one of four components of the
CCTE_Mundy_HCI_iCellGABA_NOG assay. It measures neurite outgrowth related to neurite length using High
Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_iCellGABA_NOG_NeuriteLength was analyzed at the endpoint
CCTE_Mundy_HCI_iCellGABA_NOG_NeuriteLength in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of morphology reporter, loss-of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The methods described here are set up in a 96 well plate format with
automated image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates
and compound dilutions can be automated using a liquid handling system. The methods described here are
described for a 96 well plate format. Typically, 9 plates can be made from one vial of cells, which allows testing
24 compounds in triplicate (technical replicates). If cultures are made every 14 days, 48 compounds per month
can be screened in triplicate at multiple concentrations. The throughput is therefore estimated as medium.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the neurite length are indicative of neurodevelopment.

The CCTE_Mundy_HCI_iCellGABA_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the length of
the neurites.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these

-------
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent iCell GABANeurons secondary cell used. Human GABAergic-enriched cortical
neurons derived from induced pluripotent stem (iPS) cells are provided as cryopreserved cells, from Fujifilm
Cellular Dynamics, Inc, Madison, Wisconsin. Material originates from human blood of a male aged 50 - 59 at
sampling.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 14.187
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in iCellGABA
neurons (Fujifilm Cellular Dynamics, Inc), via the immunocytochemical labelling of cell bodies and neurites. An
automated image analysis protocol is employed to systematically identify targeted structures based on
preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.001 nM
Key positive control:

Rac 1 inhibitor (30uM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

Additionally, this assay was annotated to the intended target family of neurodevelopment.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of
the difference between the corrected (cval) and baseline (bval) values divided the difference between
the positive control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-
bval)*100.), 6: resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 32:
pval.zero (Set the positive control value (pval) to 0; pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27: ow_bidirectional_loss
(Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis direction. Typically used
for endpoints where only negative responses are biologically relevant.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 27	Number of chemicals tested: 27

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
11

Inactive hit count: 0
-------
exponentials (exp5) model:

2

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

158.02

Neutral control median absolute deviation, by plate: nmad

11.142

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

6.5%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

4.045

0.719

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-14.694

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487., Druwe,
I., T. Freudenrich, K. Wallace, Tim Shafer, AND W. Mundy. Comparison of Human Induced Pluripotent Stem Cell-
Derived Neurons and Rat Primary CorticalNeurons as In Vitro Models of Neurite Outgrowth. Applied In Vitro
Toxicology. Mary Ann Liebert, Inc., Larchmont, NY, 2(l):26-36, (2016).

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3167

CCTE_M u ndy_HCI_iCel IG ABA_NOG_Neu ronCou nt

1.	General Information

1.1	Assay Title: CCTE's Neurite Outgrowth Assay in iCellGABA cells for Neuron Count, Mundy Lab

1.2	Assay Summary: CCTE_Mundy_HCI_iCellGABA_NOG is a multiplexed, cell-based-readout assay that uses iCell
GABANeurons from FUJIFILM Cellular Dynamics, Inc. (FCDI), human GABAergic enriched neurons derived from
induced pluripotent stem (iPS) cells, with measurements taken at 50.33 hours after chemical dosing in a
microplate: 96-well plate. CCTE_Mundy_HCI_iCellGABA_NOG_NeuronCount is one of four components of the
CCTE_Mundy_HCI_iCellGABA_NOG assay. It measures cell viability related to number of neurons using High
Content Imaging of fluorescently labelled markers. Data from the assay component
CCTE_Mundy_HCI_iCellGABA_NOG_NeuronCount was analyzed at the endpoint
CCTE_Mundy_HCI_iCellGABA_NOG_NeuronCount in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used
to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Mundy Lab at the EPA Center for Computational Toxicology and Exposure utilizes high content
imaging to characterize chemical effects in neurodevelopment.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is considered part of the developmental neurotoxicity in vitro battery. See the
OECD Initial Recommendations on Evaluation of Data from the Developmental Neurotoxicity (DNT) In Vitro
Testing Battery document: https://one.oecd.org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 96-well plate. The methods described here are set up in a 96 well plate format with
automated image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates
and compound dilutions can be automated using a liquid handling system. The methods described here are
described for a 96 well plate format. Typically, 9 plates can be made from one vial of cells, which allows testing
24 compounds in triplicate (technical replicates). If cultures are made every 14 days, 48 compounds per month
can be screened in triplicate at multiple concentrations. The throughput is therefore estimated as medium.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of neurons are indicative of neurodevelopment.

The CCTE_Mundy_HCI_iCellGABA_NOG assay is designed to investigate changes in neurite outgrowth (NOG) in
response to chemical exposure in developing rat cortical neurons using a high-content imaging (HCI) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. This endpoint measures the number
of neurons as a measure of cytotoxicity.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these

-------
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together.

2.3	Experimental System: adherent iCell GABANeurons secondary cell used. Human GABAergic-enriched cortical
neurons derived from induced pluripotent stem (iPS) cells are provided as cryopreserved cells, from Fujifilm
Cellular Dynamics, Inc, Madison, Wisconsin. Material originates from human blood of a male aged 50 - 59 at
sampling.

2.4	Metabolic Competence: Metabolic competence has not been characterized extensively to date in this assay.

2.5	Exposure Regime: Exposure starts 2 hours post-plating by adding 10 ul of the working solution to 90 ul of media
in the wells for a 1/10 dilution. Forty-eight hours after chemical treatment cells are fixed by direct addition of
100 ul of warm (37C) Dulbecco's phosphate buffered saline (DPBS) fixative solution containing 8%
paraformaldehyde, 8% sucrose, and 0.1% of 3 mg/ml Hoechst 33342 into each well. Primary antibodies are
prepared by dilution in Immunocytochemical Staining Buffer (ISB), 10X Dulbecco's PBS, 0.1% Saponin, 5% Bovine
Serum Albumin, 0.5% NaN3 (Sodium Azide) and pill-tubulin (rabbit anti-pill-tubulin, 1:800) followed by Alexa
Fluor-488 secondary (1:500) to label neuronal cell bodies and neurites. A Cellomics ArrayScan VTi HCS Reader is
used for automated image acquisition and analysis of neurite outgrowth.

Baseline median absolute deviation for the assay (bmad): 18.125
Response cutoff threshold used to determine hit calls: 30
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content imaging solution to describe neurite outgrowth in iCellGABA
neurons (Fujifilm Cellular Dynamics, Inc), via the immunocytochemical labelling of cell bodies and neurites. An
automated image analysis protocol is employed to systematically identify targeted structures based on
preassigned criteria. Ultimately, changes in the number and length of young outgrowths is quantified.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.001 nM
Key positive control:

Rac 1 inhibitor (30uM)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

Additionally, this assay was annotated to the intended target family of neurodevelopment.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A Cellomics ArrayScan VTi HCS Reader (Thermo Fisher Scientific) is used for automated image
acquisition and analysis of neurite outgrowth (Operating Procedure for High Content Imaging of Neurite
Outgrowth: OP-NHEERL-H/ISTD/SBB/TMF/2018-008-rl. Images are acquired using a 20x Pan NeoFLUAR (NA =
0.4) objective with a solid state LED light source, and an XF100 two channel dichroic filter set with excitation at
365(50) and 475(40) and emission at 535(45). Images are analyzed using the Cellomics Neuronal Profiling
BioApplication (version 4) to measure neurite morphology. Optimization of nuclear masking and selection, cell
body masking and selection, and neurite tracing parameters is performed on untreated cultures at 48 h after
initial plating. In each well, multiple unique fields-of-view are acquired until at least 300 neurons are counted.
Four morphological features are quantified: 1) number of cells (neurons) per field, 2) total neurite length per
neuron, 3) number of neurites per neuron, and 4) number of neurite branch points per neuron. Neurites are
defined as processes > 10 nm in length.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of
the difference between the corrected (cval) and baseline (bval) values divided the difference between
the positive control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-
bval)*100.), 6: resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 32:
pval.zero (Set the positive control value (pval) to 0; pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27: ow_bidirectional_loss
(Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis direction. Typically used
for endpoints where only negative responses are biologically relevant.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 27	Number of chemicals tested: 27

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
13

Inactive hit count: 0
-------
exponentials (exp5) model:

4

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

17.6

Neutral control median absolute deviation, by plate: nmad

1.112

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

5.51%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

15.55

1.016

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.706

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 4.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Harrill JA, Freudenrich T, Wallace K, Ball K, Shafer TJ, Mundy WR. Testing for developmental
neurotoxicity using a battery of in vitro assays for key cellular events in neurodevelopment. Toxicol Appl
Pharmacol. 2018 Sep l;354:24-39. doi: 10.1016/j.taap.2018.04.001. Epub 2018 Apr 5. PMID: 29626487., Druwe,
I., T. Freudenrich, K. Wallace, Tim Shafer, AND W. Mundy. Comparison of Human Induced Pluripotent Stem Cell-
Derived Neurons and Rat Primary CorticalNeurons as In Vitro Models of Neurite Outgrowth. Applied In Vitro
Toxicology. Mary Ann Liebert, Inc., Larchmont, NY, 2(l):26-36, (2016).

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1507

CCTE_Padilla_ZF_144hpf_TERATOSCORE

1.	General Information

1.1	Assay Title: (Legacy) CCTE's 144 Hour Post-fertilization Zebrafish Assay Teratoscore, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_144hpf is a whole embryo, multiplexed assay. Zebrafish embryos were
exposed to chemicals starting 6 hr post-fertilization for 5 days in 96-well plates. Developing larvae were
examined microscopically daily for malformation and mortality. CCTE_Padilla_ZF_144hpf_TERATOSCORE is one
of one assay component(s) measured or calculated from the CCTE_Padilla_ZF_144hpf assay. It is designed to
make measurements of developmental malformations and mortality as detected with brightfield microscopy.
Data from the assay component CCTE_Padilla_ZF_144hpf_TERATOSCORE was analyzed into 1 assay endpoint.
This assay endpoint, CCTE_Padilla_ZF_144hpf_TERATOSCORE, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphological
reporter, gain-of-signal activity can be used to understand changes in developmental defect as they relate to
the whole embryo. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the malformation intended target family, where the subfamily is total.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Individual embryos were examined daily for malformations, failure to hatch and mortality. Results
were scored by customized rubric and a composite score derived.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

-------
2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were carried out in
accordance with the National Research Council's Guide for the Care and Use of Laboratory Animals, and were
approved by the Institutional Care and Use Committee at the U.S. EPA National Health and Environmental
Effects Research Laboratory. The animal facility is an internationally accredited Association for Assessment and
Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were wild-type adult
zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both Aquatic Research
Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL. The adult zebrafish were maintained at
a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks (Tecniplast USA, West
Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered with Instant Ocean Sea
Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co., Ewing, NJ). The water
was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite (maintained at 0 ppm)
and nitrate (allowed in insignificant amounts). The fish were fed twice a day with decapsulated artemia (E-Z
Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet (Skretting, Westbrook, ME).
Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h). For embryo production,
groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15 months old) were moved
into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West Chester, PA) about one week
before embryos were needed. Then on the afternoon before embryos were needed, mesh spawning platform
inserts were added. Embryos were collected the following morning approximately 45 minutes after the room
lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Single Concentration Study: The
chemicals were arrayed on a 96-well stock plate, 80 chemicals per plate, 20 mM concentration of each, with 16
DMSO (vehicle) controls. The embryos were exposed to the chemicals (80 Mfinal concentration, renewed daily)
by immersion from 0 dpf until 5 dpf. Eighty micromolar was chosen as the highest concentration for two reasons
(1) because the stock solutions were prepared at 20 mM in 100% DMSO. On 6 dpf, each embryo/larva was

-------
assessed for viability, hatching status and malformations. There were 4 embryos per concentration per chemical
(each embryo on a separate microtiter plate). If more than two controls on a plate (i.e., >2/16 = 12.5%) showed
lethality or significant malformations, the data from that entire plate were rejected, and the experiment was
repeated. Chemical exposures: Zebrafish embryos were exposed in 96-well plates. Briefly, on day 0,
approximately 6-8 h after fertilization, zebrafish embryos (with chorions) were placed 1 embryo per well in
Millipore Multiscreen Nylon mesh plates (catalog number MANMN4050, Millipore Corp, Bedford, MA) and
exposed to nominal concentrations of the chemicals. In each well, 1 ul of the chemical in DMSO from the stock
plate was diluted with 250 ul of 10% Hanks' solution; the final DMSO concentration was 0.4% (v/v) in all wells;
vehicle controls receives DMSO only. Each plate was sealed with a non-adhesive material (Type A, BioRad,
Hercules, CA), covered with a lid, and wrapped in Para film to minimize evaporation. All embryos and larvae were
kept in a 26C incubator with a 14:10 h light-dark cycle (with lights on at 08:30 h and off at 22:30 h). Embryos
were exposed to the chemicals for 5 days post fertilization (dpf) (i.e., 120 hours post-fertilization) with daily
dosing (i.e., complete solution change with chemical renewal every 24 h), followed by a wash-out in Hanks'
buffer for 1 day prior to the lethality, hatching, and malformation assessments performed on 144 hours post-
fertilization (hpf) or 6 dpf. Embryo assessments: At 144 hpf, each larva was assessed by visual inspection under
a dissection microscope (Olympus SZH10 Research Stereo). If a larva was dead, no more assessments were
made. If a larva was viable, it was then determined if it had hatched or not. If the larva had not hatched, then
that information was recorded as an endpoint. If a larva was alive and hatched, an assessment of the degree of
malformation was made. Embryos were considered dead if there were signs of coagulation, decay, or no visible
heartbeat. Embryos were considered not hatched if they remained encased in the chorion. If a larva was alive
and hatched it was assessed by an observer, blinded to the treatment.

Baseline median absolute deviation for the assay (bmad): 1.535
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: Embryos/larvae were considered dead at 6 dpf if there were signs of coagulation, decay, or no visible
heartbeat. Embryos/larvae were considered not hatched if they remained encased in the chorion. If a larva was
alive and hatched it was assessed by an observer, blinded to the treatment. Larva was assessed for
malformations of general categories. In brief this involved the following assessments: (1) spine (e.g., stunted
skeletal growth, curved spine, kink in tail), (2) fins (e.g., malformed or stunted fins), (3) cranial/facial (e.g.,
abnormal head, eyes, or otoliths),(4)thorax (e.g., distension, heart malformations),(5) abdomen (e.g., edema,
emaciation), and (6) position in the water column (e.g., floating, lying on side). These features were scored for
each of the categories, which thus may contain a number of possible malformations that could occur. Some
malformations were scored in binary fashion (1,0 for present or not) while others were scored by relative
degree, from not present (0) through severe (4). The aggregated scores across all categories of malformations
were then summed for each condition and defined as the "Malformation Index". Higher Malformation Indices
denote more severely malformed fish, and the indices for the present study went as high as 34, with the
historical control values normally between 0 and 3.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
9

Standard minimum concentration tested:

0.168625718 nM
Key positive control:

Chlorpyrifos

I arget (nominal) number of replicates:

3

Standard maximum concentration tested:

500 nM
Neutral vehicle control:

DMSO

-------
2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The descriptive data was assigned a numerical score: 40 for lethality and 20 for non-hatching. If
the larva was alive and hatched, then the teratoscore was equal to the malformation index based on a count of
malformations. A chemical was considered active in single concentration if the mean teratoscore of the four
technical repeats for each chemical was greater than the overall mean teratoscore of the control fish in the
study. For example, in the Single Concentration Study there were 228 controls; the overall mean Toxicity Score
for the controls was 2.24 ± 9.53 (SD; standard deviation). Therefore any chemical with a mean Toxicity Score
above 2.24 was considered active in the Single Concentration Study. Chemical potencies were estimated for
each compound tested in multi-concentration as half-maximal activity concentrations (AC50). The "response"
was the combined teratoscore, which ranged from 0 to a maximum imputed value of 40.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -

-------
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 754	Number of chemicals tested: 722

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
507

Inactive hit count: 0
-------
linear-polynomial (polyl) model:

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

412

238

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

2.03

-------
Neutral control median absolute deviation, by plate: nmad

1.153

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	58.01%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 412.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate

-------
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Padilla S, Corum D, Padnos B, Hunter DL, Beam A, Houck KA, Sipes N, Kleinstreuer N, Knudsen T,
Dix DJ, Reif DM. Zebrafish developmental screening of the ToxCast™ Phase I chemical library. Reprod Toxicol.
2012 Apr;33(2):174-87. doi: 10.1016/j.reprotox.2011.10.018. Epub 2011 Dec 9. PubMed PMID: 22182468.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3211

CCTE_Pad i I la_ZF_Score. Livi ng

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for Viability, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score.Living is one of one assay
component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component CCTE_Padilla_ZF_Score.Living was analyzed into 1 assay endpoints.
This assay endpoint, CCTE_Padilla_ZF_Score.Living, was analyzed in the positive analysis fitting direction relative
to DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain-of-signal activity
can be used to understand changes in developmental as they relate to the whole embryo. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is embryonic mortality.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Mortality is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the

-------
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 185	Number of chemicals tested: 182

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
39

Inactive hit count: Oihitc 0.9
146

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

3
2

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

76

19

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 76.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3212

CCTE_Padilla_ZF_Score.Hatched

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for Hatched, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score. Hatched is one of one assay
component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component CCTE_Padilla_ZF_Score.Hatched was analyzed into 1 assay endpoints.
This assay endpoint, CCTE_Padilla_ZF_Score.Hatched, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain-of-
signal activity can be used to understand changes in developmental as they relate to the whole embryo. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is hatching.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the

-------
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 185	Number of chemicals tested: 182

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: Oihitc 0.9
185

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

0
0

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

0

100

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 100.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3213

CCTE_Padilla_ZF_Score.General

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for General Malformation, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score.General is one of one assay
component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component CCTE_Padilla_ZF_Score.General was analyzed into 1 assay endpoints.
This assay endpoint, CCTE_Padilla_ZF_Score.General, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain-of-
signal activity can be used to understand changes in developmental as they relate to the whole embryo. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the

-------
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 315	Number of chemicals tested: 312

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
70

Inactive hit count: Oihitc 0.9
245

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

17
7

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

70

123

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 123.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3214

CCTE_Padilla_ZF_Score.Swim_bladder

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for Swim Bladder Malformation, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score.Swim_bladder is one of one
assay component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed
to make measurements of zebrafish development as detected with brightfield microscopy of developing
zebrafish embryos. Data from the assay component CCTE_Padilla_ZF_Score.Swim_bladder was analyzed into 1
assay endpoints. This assay endpoint, CCTE_Padilla_ZF_Score.Swim_bladder, was analyzed in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of viability
reporter, gain-of-signal activity can be used to understand changes in developmental as they relate to the
whole embryo. To generalize the intended target to other relatable targets, this assay endpoint is annotated to
the zebrafish development intended target family, where the subfamily is swim bladder morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the

-------
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 185	Number of chemicals tested: 182

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
41

Inactive hit count: Oihitc 0.9
144

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

14
7

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

40

39

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 40.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3215

CCTE_Pad i I la_ZF_Score.Cra n iofacia I

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for Craniofacial Malformation, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score.Craniofacial is one of one
assay component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed
to make measurements of zebrafish development as detected with brightfield microscopy of developing
zebrafish embryos. Data from the assay component CCTE_Padilla_ZF_Score.Craniofacial was analyzed into 1
assay endpoints. This assay endpoint, CCTE_Padilla_ZF_Score.Craniofacial, was analyzed in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of viability
reporter, gain-of-signal activity can be used to understand changes in developmental as they relate to the
whole embryo. To generalize the intended target to other relatable targets, this assay endpoint is annotated to
the zebrafish development intended target family, where the subfamily is craniofacial morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the

-------
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 185	Number of chemicals tested: 182

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
21

Inactive hit count: Oihitc 0.9
164

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

9
3

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

32

56

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 56.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3216

CCTE_Padilla_ZF_Score.Edema

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for Edema Malformation, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score.Edema is one of one assay
component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component CCTE_Padilla_ZF_Score.Edema was analyzed into 1 assay endpoints.
This assay endpoint, CCTE_Padilla_ZF_Score.Edema, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain-of-
signal activity can be used to understand changes in developmental as they relate to the whole embryo. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic edema.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the

-------
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 185	Number of chemicals tested: 182

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
40

Inactive hit count: Oihitc 0.9
145

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

17
2

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

48

33

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 48.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3217

CCTE_Padilla_ZF_Score.Spine

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for Spine Malformation, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score.Spine is one of one assay
component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component CCTE_Padilla_ZF_Score.Spine was analyzed into 1 assay endpoints.
This assay endpoint, CCTE_Padilla_ZF_Score.Spine, was analyzed in the positive analysis fitting direction relative
to DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain-of-signal activity
can be used to understand changes in developmental as they relate to the whole embryo. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is body axis morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the

-------
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 185	Number of chemicals tested: 182

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
3

Inactive hit count: Oihitc 0.9
182

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

1
1

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

23

75

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 75.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3218

CCTE_Padilla_ZF_Score.Pigmentation

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for Pigmentation Malformation, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score.Pigmentation is one of one
assay component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed
to make measurements of zebrafish development as detected with brightfield microscopy of developing
zebrafish embryos. Data from the assay component CCTE_Padilla_ZF_Score.Pigmentation was analyzed into 1
assay endpoints. This assay endpoint, CCTE_Padilla_ZF_Score.Pigmentation, was analyzed in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of viability
reporter, gain-of-signal activity can be used to understand changes in developmental as they relate to the
whole embryo. To generalize the intended target to other relatable targets, this assay endpoint is annotated to
the zebrafish development intended target family, where the subfamily is embryonic pigmentation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the

-------
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 185	Number of chemicals tested: 182

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: Oihitc 0.9
185

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

0
0

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

3

97

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 97.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3219

CCTE_Padilla_ZF_Score.Position

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for Position, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score.Position is one of one assay
component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component CCTE_Padilla_ZF_Score.Position was analyzed into 1 assay endpoints.
This assay endpoint, CCTE_Padilla_ZF_Score.Position, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain-of-
signal activity can be used to understand changes in developmental as they relate to the whole embryo. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic mobility.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the

-------
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 185	Number of chemicals tested: 182

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
18

Inactive hit count: Oihitc 0.9
167

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

8
6

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

29

57

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 57.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3220

CCTE_Padilla_ZF_ScoreIail

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for Tail Malformation, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score.Tail is one of one assay
component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component CCTE_Padilla_ZF_Score.Tail was analyzed into 1 assay endpoints. This
assay endpoint, CCTE_Padilla_ZF_Score.Tail, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain-of-signal activity
can be used to understand changes in developmental as they relate to the whole embryo. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is lower trunk morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the

-------
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 185	Number of chemicals tested: 182

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
5

Inactive hit count: Oihitc 0.9
180

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

4
2

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

33

61

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 61.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3221

CCTE_Padi lla_ZF_Score. Blood_pooli ng

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for Blood Pooling, Padilla Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score.Blood_pooling is one of one
assay component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed
to make measurements of zebrafish development as detected with brightfield microscopy of developing
zebrafish embryos. Data from the assay component CCTE_Padilla_ZF_Score.Blood_pooling was analyzed into 1
assay endpoints. This assay endpoint, CCTE_Padilla_ZF_Score.Blood_pooling, was analyzed in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of viability
reporter, gain-of-signal activity can be used to understand changes in developmental as they relate to the
whole embryo. To generalize the intended target to other relatable targets, this assay endpoint is annotated to
the zebrafish development intended target family, where the subfamily is swim bladder morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the

-------
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 185	Number of chemicals tested: 182

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
7

Inactive hit count: Oihitc 0.9
178

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

5
3

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

30

62

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 62.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3222

CCTE_Padilla_ZF_Score.Any

1.	General Information

1.1	Assay Title: CCTE's 144 Hour Post-fertilization Zebrafish Assay Score for Any Mortality or Malformation, Padilla
Lab

1.2	Assay Summary: CCTE_Padilla_ZF_Phenotype_Score is a whole embryo, multiplexed assay using zebrafish ova
exposed for 144 hours post fertilization on a 96-well plate. CCTE_Padilla_ZF_Score.Any is one of one assay
component(s) measured or calculated from the CCTE_Padilla_ZF_Phenotype_Score assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component CCTE_Padilla_ZF_Score.Any was analyzed into 1 assay endpoints. This
assay endpoint, CCTE_Padilla_ZF_Score.Any, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain-of-signal activity
can be used to understand changes in developmental as they relate to the whole embryo. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is embryonic morphogenesis and mortality.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Padilla laboratory at the EPA Center for Computational Toxicology focuses on the
development and screening of zebrafish assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; larva were assessed under a dissection microscope (Olympus
SZH10 Research Stereo).

1.9	Assay Throughput: 96-well plate. Zebrafish embryos plated on 96-well plates and exposed from 6 hours post-
fertilization to 144 hours post-fertilization to ToxCast chemicals.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: An algorithm was used to combine results from 12 CCTE_Padilla_ZF_score assay components.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Wild-type zebrafish (Danio rerio) embryos placed 1
embryo per well in a 96-well plate. All research and breeding procedures in this study were reviewed and
approved by the Office of Research and Development's Health Institutional Animal Care and Use Committee
(IACUC) at the U.S. EPA in Research Triangle Park, NC (Protocol 21-08-003; approved Aug. 8, 2018 and Protocol
24-09-002; approved Sept. 2, 2021). The animal facility is an internationally accredited Association for
Assessment and Accreditation of Laboratory Animal Care (AAALAC) facility (Unit 000509). The parental fish were
wild-type adult zebrafish (Danio rerio) descended from an undefined outbred stock originally supplied by both
Aquatic Research Organisms, Hampton, NH, and EkkWill Waterlife Resources, Ruskin, FL The adult zebrafish
were maintained at a density of 7 fish/L in 3.5-liter tanks and housed in recirculating zebrafish housing racks
(Tecniplast USA, West Chester, PA) with reverse osmosis purified tap water (Durham, NC) which was buffered
with Instant Ocean Sea Salt (Spectrum Brands, Blacksburg, VA) and sodium bicarbonate (Church & Dwight, Co.,
Ewing, NJ). The water was maintained at 28C, pH 7.4, and conductivity (1000 uS/cm), with ammonia and nitrite
(maintained at 0 ppm) and nitrate (allowed in insignificant amounts). The fish were fed twice a day with
decapsulated artemia (E-Z Egg; Brine Shrimp Direct, Ogden, UT) and Gemma Micro 300 formulated diet
(Skretting, Westbrook, ME). Housing rooms were illuminated on a 14:10 h light: dark cycle (lights on at 07:00 h).
For embryo production, groups of approximately 150 same age mixed sex zebrafish (ages ranging from 3 to 15
months old) were moved into 16-L on rack recirculating spawning tanks (Z-Park tanks, Tecniplast USA, West
Chester, PA) about one week before embryos were needed. Then on the afternoon before embryos were
needed, mesh spawning platform inserts were added. Embryos were collected the following morning
approximately 45 minutes after the room lights came on (07:45 h) and were maintained at 28C for 1 to 2 hours
until washing.

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from wild type adult zebrafish (undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842). Zebrafish are a good model in which to study
metabolism because they possess all the key organs required for metabolic control in humans, from the appetite
circuits that are present in the hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle
and white adipose tissue (WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Wild type adult zebrafish (Danio rerio; undefined outbred stock
obtained from Aquatic Research Organisms, Hampton, NH, 03842) were housed in an AAALAC-approved animal
facility at 28C with a 14:10 h light: dark cycle (lights on at 08:30 h). Adult fish (2-3 females per male; density =
15-20 adults per tank) were kept in one of several 9-liter (L) flow-through colony tanks (Aquaneering Inc., San
Diego, CA). All adults in a colony tank were placed in a 2 L (static) breeding tank (Aquatic Habitats, Apopka, FL)
one hour prior to light onset. Typically, adults from two to three colony tanks were mated on the same day. Two
hours after light onset the adults were returned to the colony tank. All embryos were gathered from each
breeder tank, pooled, and placed in a 28C water bath for 2 h, followed by two washes with 0.06% bleach (v/v)
in 10% Hanks' Balanced Salt Solution (13.7 mM NaCI 0.54 mM KCI, 25 uM Na2HP04, 130 uM CaCI2 100 um
MgS04and 420 um NaHC03,), hereafter referred to as Hanks' solution, for 5 min in order to remove any residual
bacteria or fungi. Immediately following washing, the embryos were examined, and healthy embryos were
separated from dead or unfertilized eggs and moved into fresh Hanks'. Chemical exposures: Zebrafish embryos
were exposed in 96-well plates. Between 6- and 8-hours post fertilization (hpf), healthy embryos (with chorions)
were transferred, one embryo per well, into 96 well (0.5 mL) microtiter plates (Cell Culture-Treated, Flat-Bottom
Microplate 96 well [Corning Costar, Kennebunk, ME; Cat 09-761-145]) filled with 200 uL of Hanks' solution. A
random number generator was used to assign the order in which the rows of embryos were plated in 96 well
plates each week. After plating, the embryos were dosed with 0.8 uL of the chemical dosing solution then

-------
immediately sealed with AlumaSeal II (Excel Scientific Inc, Victorville, CA) to prevent volatilization of the
chemicals, and then placed in a leakproof secondary container. These containers were placed in an incubator
maintained on a 14:10 h light: dark cycle at 26C for rearing. On 5 dpf, live larvae were gently moved out of the
plate with test chemicals and into a new 96 well mesh plate (Millipore Corp., Bedford, MA) with just the Hanks'
solution. Once within the mesh well plate, the animals were rinsed with 400 uL of fresh Hanks' three times, and
then the plates were covered with a non-adhesive material (Microseal A, BioRad, Hercules, CA), plate lid added,
and then plate sides wrapped in Parafilm (PM992, Bermis Company, Neenah, Wl), and returned to the incubator.
This rinse on 5 dpf was conducted to lessen the possible exposure of the human assessors to the chemicals
during detailed examination of each larva at 6 dpf. Larval assessments: On 6 dpf between 8 and 10 AM, two
individual assessors, blinded to the chemical treatments, independently examined each larva for mortality,
hatching status, and malformations using an Olympus SZH10 stereo microscope. Mortality was defined as a lack
of heartbeat or presence of coagulation. Malformations were defined as uninflated swim bladder, craniofacial
defects, edema, spinal defects, decreased pigmentation, abnormal position in water column, tail defects, or
blood pooling. If more than 15% of the negative control larvae were abnormal (i.e., dead, not hatched, or
malformed) or if the positive control larvae were not at least 50% abnormal, that plate was not used for any
analysis. Over the course of the experiment there were no plates that needed to be removed from analysis
based on the criteria above. Additionally, the overall rate of normal animals in the controls for the entire study
was 97%. After all plates were assessed, the larvae were anaesthetized using cold shock, and then euthanized
with cold 20% bleach solution

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 16
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of counts of larvae observed at each of the
twelve endpoints: living (dead or alive), hatching (hatched or unhatched), swim bladder (non-inflation),
craniofacial defects (dysmorphology of the head or eyes), edema, spinal defects (curved spine), pigmentation,
position (in water column, either persistent lying on side or upside down), tail defects (e.g., kinks), or blood
pooling. Each assessor also assigned every larva a general ranking for overall condition: normal (no defects
present), abnormal (defects present), or severely abnormal (life-threatening defects present). Observations
were recorded for each chemical and concentration. If any of these defects were present in a larva, an additional
endpoint "any" was set to 1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

100 nM
Key positive control:

Chlorpyrifos

Target (nominal) number of replicates:

1

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For endpoint-chemical per concentration index, counts were aggregated to a percentage, called
endpoint scores, with dead larvae excluded. For example, if there were 5 (out of 6) living larvae and 2 had edema
at a tested concentration, the edema score would equal 2/5 * 100 = 40%. No additional normalization was
performed, and outliers were not excluded. For each endpoint-chemical pair, the concentration-response series
was fit to 5 bounded models (constant, hill, gain-loss, exponential 4-5) with the winning model selected by the
lowest Akaike Information Criteria (AIC) score, a statistical calculation to compare model quality. Unbounded
models available in tcpl were excluded given the dichotomous nature of observations. To estimate activity, a
cutoff threshold was set at 16% and a continuous hit call (hitc) value was determined as the product of the
following components: 1) at least one median response greater than the assay cutoff threshold, 2) the maximal
efficacy in the fitted response is larger than the assay cutoff, 3) the AIC score of the winning model is less than
the constant model. Classification criteria for continuous hit calls were set in line with other in vitro screening
efforts as: hitc = 0 as negative, 0 < hitc < 0.9 as equivocal, hitc > 0.9 as positive. In addition to estimated activity
concentrations inducing a specified level of responses (e.g. 10%, 50%, etc.), a benchmark concentration (BMC)
was also derived in tcpl using a specified benchmark response level (BMR) of 1.349 times the standard derivation
of baseline (10%). Given lack of baseline variability given dichotomous observations, the baseline median
absolute deviation was set as 5%.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero

-------
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the negative
analysis direction. Typically used for endpoints where only positive responses are biologically relevant.),
29: pcl6 (Add a cutoff value of 16. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 185	Number of chemicals tested: 182

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
60

Inactive hit count: Oihitc 0.9
125

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

16
7

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

-------
quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

48

29

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 48.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2454

CCTE_Shafer_M EA_acute_spike_n u mber

1.	General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Total Number of Spikes, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_spike_number is a component of the CCTE_Shafer_MEA_acute assay. It measures the
total number of spikes (action potential firings) in a 40-minute recording in a microelectrode array (MEA) using
Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. The recordings before
and after chemical treatment are used to calculate a percent change in the spike number in each well. Data
from the assay component CCTE_Shafer_MEA_acute_spike_number was analyzed into 1 assay endpoint. This
assay endpoint, CCTE_Shafer_MEA_acute_spike_number, was analyzed in the negative analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or
loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The number of spikes is a measure of general activity in the network. Changes in electrical activity
are indicative of effects on the spontaneous neural activity.

-------
The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the total number of spikes observed across the well over the recording period.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 23.128
Response cutoff threshold used to determine hit calls: 69.384

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:
tetrodotoxin

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures the total number of spikes over the duration of the analysis.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp

-------
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
195

Inactive hit count: Oihitc 0.9
330

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

35
38

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

56

122

quadratic-polynomialfpoly2) model: 75

exponential-2 (exp2) model:

10

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

49

3

130

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-35.927

Neutral control median absolute deviation, by plate: nmad

13.882

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-39.97%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

32.6
25.575

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	2.357

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 49.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2456

CCTE_Shafer_M EA_acute_fi ri ng_rate_mea n

1.	General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Mean Firing Rate, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_firing_rate_mean is a component of the CCTE_Shafer_MEA_acute assay. It measures
the frequency number of spikes (action potential firings) as the total number of spikes divided by the length of
the recording in a microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the
AxIS adaptive spike detector. The firing rate is divided by the number of electrodes to report a well-mean value.
The recordings before and after chemical treatment are used to calculate a percent change in the mean firing
rate in each well. Data from the assay component CCTE_Shafer_MEA_acute_firing_rate_mean was analyzed
into 1 assay endpoint. This assay endpoint, CCTE_Shafer_MEA_acute_firing_rate_mean, was analyzed in the
negative analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of functional reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The mean firing rate is a measure of general activity in the network. Changes in electrical activity
are indicative of effects on the spontaneous neural activity.

-------
The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the mean firing rate across all electrodes in the well.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 23.657
Response cutoff threshold used to determine hit calls: 70.97

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:
tetrodotoxin

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures the total number of spikes divided by the duration of the analysis in Hz.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp

-------
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
188

Inactive hit count: Oihitc 0.9
337

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

35
38

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

54

123

quadratic-polynomialfpoly2) model: 76

exponential-2 (exp2) model:

9

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

45

2

136

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-35.914

Neutral control median absolute deviation, by plate: nmad

14.533

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-38.22%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

32.465
24.904

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	2.304

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 45.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2458

CCTE_Shafer_M EA_acute_bu rst_n umber

1.	General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Single-Electrode Bursts, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_burst_number is a component of the CCTE_Shafer_MEA_acute assay. It measures the
total number of single-electrode bursts (temporally-clustered groups of action potential firing) during a
recording in a microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS
adaptive spike detector. The recordings before and after chemical treatment are used to calculate a percent
change in the burst number in each well. Data from the assay component
CCTE_Shafer_MEA_acute_burst_number was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_acute_burst_number, was analyzed in the negative analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or loss-of-signal
activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The number of bursts is a measure of general activity in the network. Changes in electrical activity
are indicative of effects on the spontaneous neural activity.

-------
The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 27.31
Response cutoff threshold used to determine hit calls: 81.929

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures the total number of single-electrode bursts over the duration of the analysis.
For a well, the total number of electrode bursts across all electrodes in a well is reported.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp

-------
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
115

Inactive hit count: Oihitc 0.9
410

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

37
30

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

48

151

quadratic-polynomialfpoly2) model: 62

exponential-2 (exp2) model:

4

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

46

3

137

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-36.445

Neutral control median absolute deviation, by plate: nmad

13.538

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-39.58%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

-23.961
18.762

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.338

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 46.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2460

CCTE_Shafer_MEA_acute_burst_duration_mean

1.	General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Mean Burst Duration, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_burst_duration_mean is a component of the CCTE_Shafer_MEA_acute assay. It
measures the average time from the first spike (action potential firing) to the last spike in a single-electrode
burst (temporally-clustered group of spikes) during a recording in a microelectrode array (MEA) using Axion
Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. The recordings before and
after chemical treatment are used to calculate a percent change in the mean burst duration in each well. Data
from the assay component CCTE_Shafer_MEA_acute_burst_duration_mean was analyzed into 1 assay
endpoint. This assay endpoint, CCTE_Shafer_MEA_acute_burst_duration_mean, was analyzed in the negative
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of
functional reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The mean burst duration is a measure of bursting activity in a neural network. Changes in electrical
activity are indicative of effects on the spontaneous neural activity.

-------
The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the average duration of bursts across all electrodes in the well.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 11.855
Response cutoff threshold used to determine hit calls: 35.564

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures the average time from the first spike to last spike in a single-electrode burst.
For an electrode, the average across bursts is reported. For a well, the average across electrode averages is
reported. Longer bursts indicate more excitation, less inhibition, as it takes longer to shut down a burst.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp

-------
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
66

Inactive hit count: Oihitc 0.9
459

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

4
18

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

35

248

quadratic-polynomialfpoly2) model: 90

exponential-2 (exp2) model:

11

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

94

1

16

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

4.126

Neutral control median absolute deviation, by plate: nmad

9.834

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

50.92%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

93.654
31.554

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	2.192

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 16.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2462

CCTE_Shafer_M EA_acute_per_bu rst_spi ke_n u m ber_mea n

1.	General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Average Number of Spikes Per Burst, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_per_burst_spike_number_mean is a component of the CCTE_Shafer_MEA_acute
assay. It measures the average number of spikes (action potential firings) in a single-electrode burst (temporally-
clustered group of spikes) during a recording in a microelectrode array (MEA) using Axion Biosystems Maestro
768 channel amplifier with the AxIS adaptive spike detector. The recordings before and after chemical treatment
are used to calculate a percent change in the mean number of spikes per burst in each well. Data from the assay
component CCTE_Shafer_MEA_acute_per_burst_spike_number_mean was analyzed into 1 assay endpoint.
This assay endpoint, CCTE_Shafer_MEA_acute_per_burst_spike_number_mean, was analyzed in the negative
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of
functional reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The mean number of spikes in a burst is a measure of bursting activity in a neural network. Changes
in electrical activity are indicative of effects on the spontaneous neural activity.

-------
The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the average number of spikes per burst in the well.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 10.392
Response cutoff threshold used to determine hit calls: 31.175

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures the average number of spikes in a single-electrode burst. For an electrode,
the average across bursts is reported. For a well, the average across electrode averages is reported.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp

-------
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
119

Inactive hit count: Oihitc 0.9
406

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

15
27

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

35

225

quadratic-polynomialfpoly2) model: 87

exponential-2 (exp2) model:

9

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

95

24

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-4.769

Neutral control median absolute deviation, by plate: nmad

7.495

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-63.82%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

89.386
27.215

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	2.772

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 24.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2464

CCTE_Shafer_M EA_acute_i nterbu rstj nterva l_mea n

1.	General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Mean Interburst Interval, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_interburst_interval_mean is a component of the CCTE_Shafer_MEA_acute assay. It
measures the average time between the start of consecutive single-electrode bursts (temporally-clustered
groups of action potential firing) during a recording in a microelectrode array (MEA) using Axion Biosystems
Maestro 768 channel amplifier with the AxIS adaptive spike detector. The recordings before and after chemical
treatment are used to calculate a percent change in the mean inter-burst interval in each well. Data from the
assay component CCTE_Shafer_MEA_acute_interburst_interval_mean was analyzed into 1 assay endpoint. This
assay endpoint, CCTE_Shafer_MEA_acute_interburst_interval_mean, was analyzed in the negative analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of functional
reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The mean inter-burst interval is a measure of bursting activity in a neural network. Changes in
electrical activity are indicative of effects on the spontaneous neural activity.

-------
The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the interval between bursts within a well.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 34.057
Response cutoff threshold used to determine hit calls: 102.172

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures the average time between the start of single-electrode bursts. For an
electrode, the average across bursts is reported. For a well, the average across electrode averages is reported.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp

-------
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
110

Inactive hit count: Oihitc 0.9
415

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

9

28

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

21

191

quadratic-polynomialfpoly2) model:	101

exponential-2 (exp2) model:

12

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

1

136

18

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

73.234

Neutral control median absolute deviation, by plate: nmad

48.719

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

73.38%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

8.043
28.397

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.882

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 18.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2466

CCTE_Shafer_M EA_acute_bu rst_percentage_mea n

1. General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Percentage of Spikes in a Burst, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_burst_percentage_mean is a component of the CCTE_Shafer_MEA_acute assay. It
measures the percentage of spikes (action potential firing) in a single-electrode burst (temporally-clustered
groups of spikes) as the number of spikes in bursts divided by the total number of spikes during a recording in a
microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector. The average of the burst percentages across electrodes is calculated to report a well-level value. The
recordings before and after chemical treatment are used to calculate a percent change in the mean burst
percentage in each well. Data from the assay component CCTE_Shafer_MEA_acute_burst_percentage_mean
was analyzed into 1 assay endpoint. This assay endpoint, CCTE_Shafer_MEA_acute_burst_percentage_mean,
was analyzed in the negative analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of functional reporter, gain or loss-of-signal activity can be used to understand electrical
activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1 Purpose: The mean percent of spikes in bursts is a measure of bursting activity in a neural network. Changes
in electrical activity are indicative of effects on the spontaneous neural activity.

The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the percentage of spikes that occur in a network burst in the well.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 4.859
Response cutoff threshold used to determine hit calls: 14.577

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

-------
2.6 Response: This endpoint measures the number of spikes in single-electrode bursts divided by the total number
of spikes, multiplied by 100. For a well, the average across electrode burst percentages is reported.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
235

Inactive hit count: 0
-------
quadratic-polynomialfpoly2) model: 94

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

49

26

2

104

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-6.163

Neutral control median absolute deviation, by plate: nmad

3.783

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-58.22%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	3.295

Positive control well median absolute deviation, by plate: pmad	6.381

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	1.431

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 49.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2468

CCTE_Shafer_M EA_acute_bu rst_percentage_std

1. General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Standard Deviation of Burst Percentage, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_burst_percentage_std is a component of the CCTE_Shafer_MEA_acute assay. It
measures the percentage of spikes (action potential firing) in a single-electrode burst (temporally-clustered
groups of spikes) as the number of spikes in bursts divided by the total number of spikes during a recording in a
microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector. The standard deviation of the burst percentages across electrodes is calculated to report a well-level
value. The recordings before and after chemical treatment are used to calculate a percent change in the
standard deviation of the burst percentage in each well. Data from the assay component
CCTE_Shafer_MEA_acute_burst_percentage_std was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_acute_burst_percentage_std, was analyzed in the negative analysis fitting direction relative
to DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or loss-of-
signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1 Purpose: The standard deviation of the percent of spikes in bursts is a measure of bursting activity in a neural
network. Changes in electrical activity are indicative of effects on the spontaneous neural activity.

The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the standard deviation of the burst percentage in the well.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 18.069
Response cutoff threshold used to determine hit calls: 54.207

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures the standard deviation of the burst percentage.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one

-------
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
105

Inactive hit count: 0
-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

73

16

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

2.195

Neutral control median absolute deviation, by plate: nmad

16.594

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

73.68%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

0.537
31.739

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.161

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 16.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2470

CCTE_Shafer_MEA_acute_per_network_burst_spike_number_mean

1. General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Average Number of Spikes in a Network Burst, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_per_network_burst_spike_number_mean is a component of the
CCTE_Shafer_MEA_acute assay. It measures the average number of spikes (action potential firings) in a network
burst (temporally-clustered group of spikes across multiple electrodes) as the total number of spikes in a
network burst divided by the number of network bursts during a recording in a microelectrode array (MEA)
using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. The recordings
before and after chemical treatment are used to calculate a percent change in the mean number of spikes in a
network burst in each well. Data from the assay component
CCTE_Shafer_MEA_acute_per_network_burst_spike_number_mean was analyzed into 1 assay endpoint. This
assay endpoint, CCTE_Shafer_MEA_acute_per_network_burst_spike_number_mean, was analyzed in the
negative analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of functional reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1 Purpose: Networks bursts occur as the result of network connectivity. Changes in the mean number of spikes
in a network burst are indicative of changes in coordinated neural network activity.

The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the mean number of spiks in a network burst in the well.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 20.737
Response cutoff threshold used to determine hit calls: 62.212

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures the average number of spikes in a network burst.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one

-------
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
99

Inactive hit count: 0
-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

21

125

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-24.058

Neutral control median absolute deviation, by plate: nmad

13.212

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-43.05%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

199.014
63.921

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	3.529

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 21.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2472

CCTE_Shafer_MEA_acute_per_network_burst_spike_number_std

1. General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Standard Deviation of Number of Spikes in a Network Burst, Shafer
Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_per_network_burst_spike_number_std is a component of the
CCTE_Shafer_MEA_acute assay. It measures the standard deviation of the number of spikes (action potential
firings) in a network burst (temporally-clustered group of spikes across multiple electrodes) during a recording
in a microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive
spike detector. The recordings before and after chemical treatment are used to calculate a percent change in
the standard deviation of the number of spikes in network bursts in each well. Data from the assay component
CCTE_Shafer_MEA_acute_burst_spike_number_std was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_acute_burst_spike_number_std, was analyzed in the negative analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or
loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1 Purpose: Networks bursts occur as the result of network connectivity. Changes in the standard deviation of
the number of spikes in a network burst are indicative of changes in coordinated neural network activity.

The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the standard deviation of the number of spikes in a network burst in a well.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 17.371
Response cutoff threshold used to determine hit calls: 52.112

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: The endpoint measures the standard deviation of the number of spikes in a network burst.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one

-------
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
121

Inactive hit count: 0
-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2

23

112

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

3.576

Neutral control median absolute deviation, by plate: nmad

13.52

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

48.36%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

-41.373
14.568

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.653

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 23.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2474

CCTE_Shafer_M EA_acute_per_network_bu rst_electrodes_nu m ber_mea

n

1. General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Mean Number of Electrodes in a Network Burst, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_per_network_burst_electrodes_number_mean is a component of the
CCTE_Shafer_MEA_acute assay. It measures the average number of electrodes with activity during a network
burst (a temporally-clustered group of action potential firing across multiple electrodes) in a recording in a
microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector. The recordings before and after chemical treatment are used to calculate a percent change in the
mean number of electrodes participating in network bursts in each well. Data from the assay component
CCTE_Shafer_MEA_acute_per_network_burst_electrodes_number_mean was analyzed into 1 assay endpoint.
This assay endpoint, CCTE_Shafer_MEA_acute_per_network_burst_electrodes_number_mean, was analyzed in
the negative analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a
type of functional reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1 Purpose: Networks bursts occur as the result of network connectivity. Changes in the average number of
electrodes participating in a network burst are indicative of changes in coordinated neural network activity.

The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the average number electrodes participating in a network burst in the well.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 6.926
Response cutoff threshold used to determine hit calls: 20.777

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

-------
2.6 Response: This endpoint measures the average number of electrodes participating in a network burst, if it's
greater than the burst electrode criterion.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
105

Inactive hit count: 0
-------
quadratic-polynomialfpoly2) model: 81

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

8

102

13

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-5.165

Neutral control median absolute deviation, by plate: nmad

5.192

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-69.61%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	11.635

Positive control well median absolute deviation, by plate: pmad	7.247

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	2.073

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 13.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2476

CCTE_Shafer_M EA_acute_network_bu rst_percentage

1.	General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Percentage of Network Bursts, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_network_burst_percentage is a component of the CCTE_Shafer_MEA_acute assay. It
measures the percent of spikes (action potential firings) in network bursts (temporally-clustered groups of
spikes across multiple electrodes) as the number of spikes occuring in network bursts divided by the total
number of spikes during a recording in a microelectrode array (MEA) using Axion Biosystems Maestro 768
channel amplifier with the AxIS adaptive spike detector. The recordings before and after chemical treatment
are used to calculate a percent change in the network burst percentage in each well. Data from the assay
component CCTE_Shafer_MEA_acute_network_burst_percentage was analyzed into 1 assay endpoint. This
assay endpoint, CCTE_Shafer_MEA_acute_network_burst_percentage, was analyzed in the negative analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of functional
reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Networks bursts occur as the result of network connectivity. Changes in the percent of spikes in a
network burst are indicative of changes in coordinated neural network activity.

-------
The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the percentage of network bursts in a well.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 1.773
Response cutoff threshold used to determine hit calls: 5.319

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures the number of spikes in network bursts divided by the total number of
spikes, multiplied by 100.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp

-------
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
148

Inactive hit count: Oihitc 0.9
377

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

5

13

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

27

279

quadratic-polynomialfpoly2) model: 62

exponential-2 (exp2) model:

33

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2

79

17

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.943

Neutral control median absolute deviation, by plate: nmad

1.604

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-117.78%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

2.482
1.653

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	1.422

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 17.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2478

CCTE_Shafer_MEA_acute_cross_correlation_area

1.	General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Area of Cross Correlation, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_cross_correlation_area is a component of the CCTE_Shafer_MEA_acute assay. It
measures the area under the well-wide pooled inter-electrode cross-correlation of action potential firing (not
normalized to auto-correlations) within a window centered around 0 for the phase lag during a recording in a
microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector. The recordings before and after chemical treatment are used to calculate a percent change in the
cross correlation area in each well. Data from the assay component
CCTE_Shafer_MEA_acute_cross_correlation_area was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_acute_cross_correlation_area, was analyzed in the negative analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or
loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The cross correlation area is a measure of the synchrony of a neural network. Changes in the
synchrony are indicative of changes in coordinated neural network activity.

-------
The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the area of the cross correlegram of activity across all electrodes in the well. Smaller values indicate
higher levels of correlation of activity.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 31.013
Response cutoff threshold used to determine hit calls: 93.04

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures the area under the well-wide pooled inter-electrode cross-correlation of
action potential firing (not normalized to auto-correlations) within a window centered around 0 for the phase

-------
lag during a recording. The recordings before and after chemical treatment are used to calculate a percent
change in the cross correlation area in each well. This endpoint can also be normalized to the auto-correlations,
where higher areas indicate greater synchrony.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
67

Inactive hit count: Oihitc 0.9
458

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

48
47

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

46

138

-------
quadratic-polynomialfpoly2) model: 62

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

4

54

1

118

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-45.992

Neutral control median absolute deviation, by plate: nmad

14.285

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-31.06%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	23.221

Positive control well median absolute deviation, by plate: pmad	30.255

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	1.712

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 54.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2480

CCTE_Shafer_M EA_acute_cross_correlation_H WH M

1.	General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Cross Correlation at Half Width at Half Maximum, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_cross_correlation_HWHM is a component of the CCTE_Shafer_MEA_acute assay. It
measures the distance along the x-axis (phase lag) from the left half maximum height to the center of the cross-
correlogram of action potential firing during a recording in a microelectrode array (MEA) using Axion Biosystems
Maestro 768 channel amplifier with the AxIS adaptive spike detector. Smaller values indicate a narrower
correlogram (greater synchrony). The recordings before and after chemical treatment are used to calculate a
percent change in the half width at half maximum of the cross-correlogram in each well. Data from the assay
component CCTE_Shafer_MEA_acute_cross_correlation_HWHM was analyzed into 1 assay endpoint. This assay
endpoint, CCTE_Shafer_MEA_acute_cross_correlation_HWHM, was analyzed in the negative analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of functional reporter,
gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The half width at half maximum of the cross-correlogram is a measure of the synchrony of a neural
network. Changes in the synchrony are indicative of changes in coordinated neural network activity.

-------
The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the half width of the cross correlation at half maximum across all electrodes in the well. The smaller
the value, the more highly correlated the activity is across electrodes.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 10.18
Response cutoff threshold used to determine hit calls: 30.541

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures the distance along the x-axis (phase lag) from the left half maximum height
to the center of the cross-correlogram of action potential firing during a recording. This endpoint is a measure

-------
of network synchrony; higher half widths indicate a wider correlogram (less synchrony) whereas lower half
widths indicate a taller/more narrow correlogram (greater synchrony). The recordings before and after chemical
treatment are used to calculate a percent change in the half width at half maximum of the cross-correlogram in
each well.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)

-------
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
199

Inactive hit count: 0
-------
power(pow) model:
linear-polynomial (polyl) model:

38

197

quadratic-polynomialfpoly2) model: 96

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

25

2

97

39

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-------
-0.744

Neutral control median absolute deviation, by plate: nmad	8.578

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-23.889

Positive control well median absolute deviation, by plate: pmad	37.75

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.396

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 39.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2482

CCTE_Shafer_M EA_acute_synch ronyjndex

1.	General Information

1.1	Assay Title: CCTE's Neural Acute Assay for Synchrony, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_synchrony_index is a component of the CCTE_Shafer_MEA_acute assay. It measures
the synchrony of the neural network as a unitless measure between 0 (low synchrony) to 1 (high synchrony)
during a recording in a microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with
the AxIS adaptive spike detector. The recordings before and after chemical treatment are used to calculate a
percent change in the synchrony index in each well. Data from the assay component
CCTE_Shafer_MEA_acute_synchrony_index was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_acute_synchrony_index, was analyzed in the negative analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or loss-of-signal
activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The synchrony index is a measure of the synchrony of a neural network. Changes in the synchrony
are indicative of changes in coordinated neural network activity.

-------
The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures the synchrony of activity across electrodes within a well.

2.2	Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Spontaneous activity in the cortical cultures was recorded using an Axion Biosystems Maestro
768 channel amplifier and Axion Integrated Studios (AxIS) vl.8 (or later) software. The amplifier recorded from
all channels simultaneously (gain = 1200 x; sampling rate = 12.5 kHz/channel): raw signals were filtered with a
Butterworth band-pass filter (300-5000 Hz), which filters out slower local field potentials leaving only fast
potentials, i.e., "spikes", arising from extracellular currents associated with action potentials (Pine 2006;
Wheeler and Nam 2011). On-line spike detection of filtered signals was conducted with the AxIS adaptive spike
detector, using a threshold of 8 x the root mean squared (rms) noise on each channel. Any electrodes with rms
noise levels greater than 5 nV were grounded (e.g., no data were recorded). Once grounded, an electrode was
grounded for the duration of the experiment. All recordings were conducted at 37 C. Wells were deemed usable
if on the day of the exposure > 10 electrodes were active (defined as > 5 spikes/min). On DIV 13 or 15, a
minimum of three wells from one cortical culture preparation were treated with each compound (0.03-40 nM).
Prior to recording baseline activity, each mwMEA plate was placed in the Maestro at 37 C and allowed to sit for
20 min while firing rates stabilized. Baseline activity (40 min) was recorded before the addition of each
compound. An additional 40 min of spontaneous activity was recorded in the presence of each compound.
Changes in network parameters relative to baseline were assessed following compound treatment.

Baseline median absolute deviation for the assay (bmad): 3.447
Response cutoff threshold used to determine hit calls: 10.34

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

3 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

2.6 Response: This endpoint measures a unitless measure of synchrony between 0 and 1 (Paiva et al 2010), where
values closer to 1 indicate higher synchrony.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neuroactivity.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Raw recordings were re-played and analyzed with the AxIS 2.3 Neural Statistics Compiler.
(Operating Procedure for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 49: pval.neg.100 (Calculate positive control value (pval) as -
100 for endpoints in the down direction; pval = -100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp

-------
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
208

Inactive hit count: Oihitc 0.9
317

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

11
14

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

30

214

quadratic-polynomialfpoly2) model: 85

exponential-2 (exp2) model:

29

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

46

87

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-3.074

Neutral control median absolute deviation, by plate: nmad

2.525

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-59.78%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

-0.333
3.909

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.449

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 46.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2494

CCTE_Sh afer_M EA_d ev_fi ri ng_rate_mea n

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Mean Firing Rate, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_firing_rate_mean is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the frequency of spikes (action potential firings) as the total number
of spikes divided by the length of the recording in a microelectrode array (MEA) using Axion Biosystems Maestro
768 channel amplifier with the AxIS adaptive spike detector. The firing rate is averaged across the active
electrodes to report a well-mean value. To collapse across multiple recordings made on days post-plating 5, 7,
9 and 12, the trapezoidal area-under-the-curve of the component is calculated. Data from the assay component
CCTE_Shafer_MEA_dev_firing_rate_mean was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_dev_firing_rate_mean, was analyzed in the negative analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or loss-of-signal
activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: Mean firing rate is a measure of general activity in the network. Changes in electrical activity are
indicative of effects on the spontaneous neural activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the mean
firing rate across electrodes in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 23.408
Response cutoff threshold used to determine hit calls: 70.225

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the frequency of spikes (action potential firings) as the total number of
spikes divided by the length of the recording in a microelectrode array (MEA) using Axion Biosystems Maestro
768 channel amplifier with the AxIS adaptive spike detector. The firing rate is averaged across the active
electrodes to report a well-mean value. To collapse across multiple recordings made on days post-plating 5, 7,
9 and 12, the trapezoidal area-under-the-curve of the component is calculated. Mean firing rate is a measure of
general activity in the network. Changes in electrical activity are indicative of effects on the spontaneous neural
activity.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579	Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	8.873

Neutral control median absolute deviation, by plate: nmad	2.142

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	23.58%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	3.248

Positive control well median absolute deviation, by plate: pmad	0.387

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.88

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	11.096

Negative control well median absolute deviation value, by plate: mmad	2.373

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.238

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 54.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2496

CCTE_Shafer_M EA_dev_bu rst_rate

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Burst Rate, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_burst_rate is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the frequency of single-electrode bursts (temporally-clustered
groups of action potential firing) as the total number of bursts divided by the length of the recording in a
microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector. The mean bursting rate is averaged across the active electrodes to report a well-mean value. To
collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-
curve of the component is calculated. Data from the assay component CCTE_Shafer_MEA_dev_burst_rate was
analyzed into 1 assay endpoint. This assay endpoint, CCTE_Shafer_MEA_dev_burst_rate, was analyzed in the
negative analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of functional reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The mean bursting rate is a measure of general activity in the network. Changes in mean burst rate
are indicative of effects on the spontaneous neural activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the burst rate
in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 25.376
Response cutoff threshold used to determine hit calls: 76.127

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the frequency of single-electrode bursts (temporally-clustered groups of
action potential firing) as the total number of bursts divided by the length of the recording in a microelectrode
array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. The
mean bursting rate is averaged across the active electrodes to report a well-mean value. To collapse across
multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the
component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579	Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	13.661

Neutral control median absolute deviation, by plate: nmad	3.734

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	24.37%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	2.828

Positive control well median absolute deviation, by plate: pmad	1.659

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.047

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	13.087

Negative control well median absolute deviation value, by plate: mmad	3.048

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.117

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 52.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2498

CCTE_Shafer_M EA_dev_active_electrodes_n u mber

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Number of Active Electrodes, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_active_electrodes_number is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the number of electrodes in a well with a mean firing rate of at least
5 spikes (action potential firings) per minute during a recording in a microelectrode array (MEA) using Axion
Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. To collapse across multiple
recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the component is
calculated. Data from the assay component CCTE_Shafer_MEA_dev_active_electrodes_number was analyzed
into 1 assay endpoint. This assay endpoint, CCTE_Shafer_MEA_dev_active_electrodes_number, was analyzed
in the negative analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using
a type of functional reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: The number of active electrodes is a measure of general activity in the network. Changes in the
number of active electrodes are indicative of effects on the spontaneous neural activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the number
of active electrodes in the well

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 8.825
Response cutoff threshold used to determine hit calls: 30

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: The endpoint measures the number of electrodes in a well with a mean firing rate of at least 5 spikes
(action potential firings) per minute during a recording in a microelectrode array (MEA) using Axion Biosystems
Maestro 768 channel amplifier with the AxIS adaptive spike detector. To collapse across multiple recordings
made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27: ow_bidirectional_loss
(Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis direction. Typically used
for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579

Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
265

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	86.5

Neutral control median absolute deviation, by plate: nmad	7.413

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.1%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	41.25

Positive control well median absolute deviation, by plate: pmad	4.448

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-3.57

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	102.75

Negative control well median absolute deviation value, by plate: mmad	5.56

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.583

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 66.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2500

CCTE_Shafer_M EA_d ev_b u rsti ng_electrod es_n u mber

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Number of Bursting Electrodes, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_bursting_electrodes_number is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the number of electrodes in a well with a burst rate of at least 1 burst
(temporally-clustered group of action potential firing) every 2 minutes during a recording in a microelectrode
array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. To
collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-
curve of the component is calculated. Data from the assay component
CCTE_Shafer_MEA_dev_bursting_electrodes_number was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_dev_bursting_electrodes_number, was analyzed in the negative analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or
loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The number of actively bursting electrodes is a measure of general activity in a network. Changes
in the number of actively bursting electrodes are indicative of effects on the spontaneous neural activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the number
of bursting electrodes in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 13.563
Response cutoff threshold used to determine hit calls: 30

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the number of electrodes in a well with a burst rate of at least 1 burst
(temporally-clustered group of action potential firing) every 2 minutes during a recording in a microelectrode
array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. To
collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-
curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 27: ow_bidirectional_loss
(Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis direction. Typically used
for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579	Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	64.5

Neutral control median absolute deviation, by plate: nmad	8.154

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.48%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	17.75

Positive control well median absolute deviation, by plate: pmad	7.413

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-3.057

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	80

Negative control well median absolute deviation value, by plate: mmad	10.749

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.236

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 79.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2502

CCTE_Shafer_MEA_dev_per_burst_interspike_interval

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Burst Interspike Interval, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_per_burst_interspike_interval is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the average time interval between spikes (action potential firings)
within a single-electrode burst (temporally-clustered group of spikes) during a recording in a microelectrode
array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. The
mean inter-spike interval within bursts is averaged across the actively bursting electrodes to report a well-mean
value. To collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-
under-the-curve of the component is calculated. Data from the assay component
CCTE_Shafer_MEA_dev_per_burst_interspike_interval was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_dev_per_burst_interspike_interval, was analyzed in the negative analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or
loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023) 13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	CV: Coefficient of Variation

AOP: Adverse Outcome Pathway	DMSO: Dimethyl Sulfoxide

-------
ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package

SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The inter-spike interval within bursts is a measure of bursting activity in the network. Changes in
the inter-spike interval within bursts are indicative of effects on the spontaneous neural activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the average
interval between spikes within a burst in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 22.823
Response cutoff threshold used to determine hit calls: 68.469

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the average time interval between spikes (action potential firings) within a
single-electrode burst (temporally-clustered group of spikes) during a recording in a microelectrode array (MEA)
using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. The mean inter-
spike interval within bursts is averaged across the actively bursting electrodes to report a well-mean value. To
collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-
curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579	Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.314

Neutral control median absolute deviation, by plate: nmad	0.079

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	24.7%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.137

Positive control well median absolute deviation, by plate: pmad	0.052

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-3.593

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.39

Negative control well median absolute deviation value, by plate: mmad	0.107

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.083

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 81.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2504

CCTE_Shafer_MEA_dev_per_burst_spike_percent

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Percent of Spikes in a Burst, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_per_burst_spike_percent is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the percentage of spikes (action potential firings) in a single-
electrode burst (temporally-clustered group of spikes) as the total number spikes in bursts divided by the total
number of spikes during a recording in a microelectrode array (MEA) using Axion Biosystems Maestro 768
channel amplifier with the AxIS adaptive spike detector. The percent of spikes in bursts is averaged across the
actively bursting electrodes to report a well-mean value. To collapse across multiple recordings made on days
post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the component is calculated. Data from the
assay component CCTE_Shafer_MEA_dev_per_burst_spike_percent was analyzed into 1 assay endpoint. This
assay endpoint, CCTE_Shafer_MEA_dev_per_burst_spike_percent, was analyzed in the negative analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of functional reporter,
gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	CV: Coefficient of Variation

AOP: Adverse Outcome Pathway	DMSO: Dimethyl Sulfoxide

-------
ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package

SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The percent of spikes in bursts is a measure of bursting activity in a network. Changes in the
percent of spikes in bursts are indicative of effects on the spontaneous neural activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the percent
of spikes occurring in a burst.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 11.804
Response cutoff threshold used to determine hit calls: 35.412

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the percentage of spikes (action potential firings) in a single-electrode burst
(temporally-clustered group of spikes) as the total number spikes in bursts divided by the total number of spikes
during a recording in a microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with
the AxIS adaptive spike detector. The percent of spikes in bursts is averaged across the actively bursting
electrodes to report a well-mean value. To collapse across multiple recordings made on days post-plating 5, 7,
9 and 12, the trapezoidal area-under-the-curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579	Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	380.305

Neutral control median absolute deviation, by plate: nmad	45.356

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.1%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	175.401

Positive control well median absolute deviation, by plate: pmad	36.448

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.077

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	393.315

Negative control well median absolute deviation value, by plate: mmad	34.559

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.098

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 82.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2506

CCTE_Shafer_M EA_dev_burst_d u ration_mean

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Mean Burst Duration, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_burst_duration_mean is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the average time interval from the first spike (action potential firing)
to the last spike in a singe-electrode burst (temporally-clustered group of spikes) during a recording in a
microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector. The mean burst duration is averaged across the actively bursting electrodes to report a well-mean
value. To collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-
under-the-curve of the component is calculated. Data from the assay component
CCTE_Shafer_MEA_dev_burst_duration_mean was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_dev_burst_duration_mean, was analyzed in the negative analysis fitting direction relative
to DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or loss-of-
signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	CV: Coefficient of Variation

AOP: Adverse Outcome Pathway	DMSO: Dimethyl Sulfoxide

-------
ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package

SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The mean burst duration is a measure of bursting activity in a network. Changes in the mean burst
duration are indicative of effects on the spontaneous neural activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the mean
burst duration in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 26.854
Response cutoff threshold used to determine hit calls: 80.563

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the average time interval from the first spike (action potential firing) to the
last spike in a singe-electrode burst (temporally-clustered group of spikes) during a recording in a
microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector. The mean burst duration is averaged across the actively bursting electrodes to report a well-mean
value. To collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-
under-the-curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 30: osd_coff_bmr (Overwrite the osd value so that bmr == coff)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579	Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4.316

Neutral control median absolute deviation, by plate: nmad	1.014

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	27.3%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	2.588

Positive control well median absolute deviation, by plate: pmad	2.264

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.92

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	7.021

Negative control well median absolute deviation value, by plate: mmad	1.449

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.246

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 64.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2508

CCTE_Shafer_MEA_devJnterburstJnterval_mean

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Mean Interburst Interval, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_interburst_interval_mean is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the average time interval between consecutive single-electrode
bursts (temporally-clustered groups of action potential firing) during a recording in a microelectrode array
(MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. The mean
inter-burst interval is averaged across the actively bursting electrodes to report a well-mean value. To collapse
across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of
the component is calculated. Data from the assay component
CCTE_Shafer_MEA_dev_interburst_interval_mean was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_dev_interburst_interval_mean, was analyzed in the negative analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or
loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	CV: Coefficient of Variation

AOP: Adverse Outcome Pathway	DMSO: Dimethyl Sulfoxide

-------
ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package

SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The mean inter-burst interval is a measure of bursting activity in a network. Changes in the mean
inter-burst interval are indicative of effects on the spontaneous neural activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the mean
interval between bursts.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 22.575
Response cutoff threshold used to determine hit calls: 67.725

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the average time interval between consecutive single-electrode bursts
(temporally-clustered groups of action potential firing) during a recording in a microelectrode array (MEA) using
Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. The mean inter-burst
interval is averaged across the actively bursting electrodes to report a well-mean value. To collapse across
multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the
component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579	Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	223.502

Neutral control median absolute deviation, by plate: nmad	47.911

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	22.94%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	115.323

Positive control well median absolute deviation, by plate: pmad	21.&01

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.428

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	277.896

Negative control well median absolute deviation value, by plate: mmad	51.353

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.089

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 82.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2510

CCTE_Shafer_M EA_dev_network_spike_n u mber

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Number of Network Spikes, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_network_spike_number is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the number of network spikes (temporally-clustered groups of action
potential firing across multiple electrodes) during a recording in a microelectrode array (MEA) using Axion
Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. To collapse across multiple
recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the component is
calculated. Data from the assay component CCTE_Shafer_MEA_dev_network_spike_number was analyzed into
1 assay endpoint. This assay endpoint, CCTE_Shafer_MEA_dev_network_spike_number, was analyzed in the
negative analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of functional reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: The number of network spikes is a measure of the network connectivity. Changes in the number
of network spikes are indicative of changes in coordinated neural network activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the number
of network spikes in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 25.468
Response cutoff threshold used to determine hit calls: 76.404

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the number of network spikes (temporally-clustered groups of action
potential firing across multiple electrodes) during a recording in a microelectrode array (MEA) using Axion
Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. To collapse across multiple
recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the component is
calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579

Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
139

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	121.25

Neutral control median absolute deviation, by plate: nmad	32.988

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	24.97%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	26.5

Positive control well median absolute deviation, by plate: pmad	19.274

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.486

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	73.25

Negative control well median absolute deviation value, by plate: mmad	16.309

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.3

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 73.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2512

CCTE_Shafer_MEA_dev_network_spike_peak

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Mean Number of Electrodes in Network Spike, Shafer
Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_network_spike_peak is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the average number of electrodes participating at the peak of a
network spike (the peak of a temporally-clustered group of action potential firing across multiple electrodes)
during a recording in a microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with
the AxIS adaptive spike detector. To collapse across multiple recordings made on days post-plating 5, 7, 9 and
12, the trapezoidal area-under-the-curve of the component is calculated. Data from the assay component
CCTE_Shafer_MEA_dev_network_spike_peak was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_dev_network_spike_peak, was analyzed in the negative analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or loss-of-signal
activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	CV: Coefficient of Variation

AOP: Adverse Outcome Pathway	DMSO: Dimethyl Sulfoxide

-------
ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package

SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The network spike peak is a measure of the network connectivity. Changes in the network spike
peak are indicative of changes in coordinated neural network activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the mean
number of electrodes participating in network spikes in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 8.452
Response cutoff threshold used to determine hit calls: 25.357

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the average number of electrodes participating at the peak of a network
spike (the peak of a temporally-clustered group of action potential firing across multiple electrodes) during a
recording in a microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS
adaptive spike detector. To collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the
trapezoidal area-under-the-curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579	Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	76.609

Neutral control median absolute deviation, by plate: nmad	6.026

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.08%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	32

Positive control well median absolute deviation, by plate: pmad	4.183

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-5.053

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	96.91

Negative control well median absolute deviation value, by plate: mmad	11.214

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.666

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 91.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2514

CCTE_Shafer_M EA_dev_spi ke_du ration_mea n

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Mean Spike Duration, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_spike_duration_mean is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the average duration of network spikes (temporally-clustered groups
of action potential firing across multiple electrodes) during a recording in a microelectrode array (MEA) using
Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. To collapse across
multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the
component is calculated. Data from the assay component CCTE_Shafer_MEA_dev_spike_duration_mean was
analyzed into 1 assay endpoint. This assay endpoint, CCTE_Shafer_MEA_dev_spike_duration_mean, was
analyzed in the negative analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of functional reporter, gain or loss-of-signal activity can be used to understand electrical
activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The mean network spike duration is a measure of the network connectivity. Changes in the mean
network spike duration are indicative of changes in coordinated neural network activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the mean
spike duration in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 14.86
Response cutoff threshold used to determine hit calls: 44.581

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the average duration of network spikes (temporally-clustered groups of
action potential firing across multiple electrodes) during a recording in a microelectrode array (MEA) using Axion
Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. To collapse across multiple
recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the component is
calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579	Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.358

Neutral control median absolute deviation, by plate: nmad	0.196

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	15.15%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.824

Positive control well median absolute deviation, by plate: pmad	0.171

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.788

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	3.243

Negative control well median absolute deviation value, by plate: mmad	0.736

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.117

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 76.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2516

CCTE_Shafer_MEA_dev_network_spike_duration_std

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Standard Deviation of Network Spike Duration, Shafer
Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_network_spike_duration_std is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the standard deviation of the duration of network spikes (temporally-
clustered groups of action potential firing across multiple electrodes) during a recording in a microelectrode
array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. To
collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-
curve of the component is calculated. Data from the assay component
CCTE_Shafer_MEA_dev_network_spike_duration_std was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_dev_network_spike_duration_std, was analyzed in the negative analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or
loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	CV: Coefficient of Variation

AOP: Adverse Outcome Pathway	DMSO: Dimethyl Sulfoxide

-------
ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package

SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The standard deviation of network spike duration is a measure of network connectivity. Changes
in the standard deviation of network spike duration are indicative of changes in coordinated neural network
activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the standard
deviation of network spike duration in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this

-------
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 27.656
Response cutoff threshold used to determine hit calls: 82.967

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the standard deviation of the duration of network spikes (temporally-
clustered groups of action potential firing across multiple electrodes) during a recording in a microelectrode
array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike detector. To
collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-under-the-
curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical

-------
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579

Number of chemicals tested: 460

Active hit count: hitc>0.9
112

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.403

Neutral control median absolute deviation, by plate: nmad	0.1

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	27.34%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.146

Positive control well median absolute deviation, by plate: pmad	0.107

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.157

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.5

Negative control well median absolute deviation value, by plate: mmad	0.264

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.428

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 64.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2518

CCTE_Shafer_M EA_dev_i nter_network_spi kej nterva l_mea n

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Mean Inter-network Spike Interval, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_inter_network_spike_interval_mean is 1 of 19 assay components
of the CCTE_Shafer_MEA_dev assay. It measures the average time interval between network spikes (temporally-
clustered groups of actional potential firing across multiple electrodes) as the average time between peaks of
consecutive network spikes during a recording in a microelectrode array (MEA) using Axion Biosystems Maestro
768 channel amplifier with the AxIS adaptive spike detector. To collapse across multiple recordings made on
days post-plating 5, 7,9 and 12, the trapezoidal area-under-the-curve of the component is calculated. Data from
the assay component CCTE_Shafer_MEA_dev_inter_network_spike_interval_mean was analyzed into 1 assay
endpoint. This assay endpoint, CCTE_Shafer_MEA_dev_inter_network_spike_interval_mean, was analyzed in
the negative analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a
type of functional reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The inter-network spike interval is a measure of the network connectivity. Changes in the inter-
network spike interval are indicative of changes in coordinated neural network activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the inter-
network spike interval in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 29.999
Response cutoff threshold used to determine hit calls: 89.996

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the average time interval between network spikes (temporally-clustered
groups of actional potential firing across multiple electrodes) as the average time between peaks of consecutive
network spikes during a recording in a microelectrode array (MEA) using Axion Biosystems Maestro 768 channel
amplifier with the AxIS adaptive spike detector. To collapse across multiple recordings made on days post-
plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579	Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	323.877

Neutral control median absolute deviation, by plate: nmad	106.145

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	28.82%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	146.998

Positive control well median absolute deviation, by plate: pmad	111.272

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.067

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	796.529

Negative control well median absolute deviation value, by plate: mmad	285.097

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.043

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 69.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2520

CCTE_Shafer_MEA_dev_per_network_spike_spike_number_mean

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Mean Number of Spikes in a Network Spike, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_per_network_spike_spike_number_mean is 1 of 19 assay
components of the CCTE_Shafer_MEA_dev assay. It measures the average number of spikes (action potential
firings) in a network spike (temporally-clustered group of spikes across multiple electrodes) as the number of
spikes that occur within a 0.05 s time window at the peak of a network spike divided by the total number of
network spikes during a recording in a microelectrode array (MEA) using Axion Biosystems Maestro 768 channel
amplifier with the AxIS adaptive spike detector. To collapse across multiple recordings made on days post-
plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the component is calculated. Data from the assay
component CCTE_Shafer_MEA_dev_per_network_spike_spike_number_mean was analyzed into 1 assay
endpoint. This assay endpoint, CCTE_Shafer_MEA_dev_per_network_spike_spike_number_mean, was
analyzed in the negative analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of functional reporter, gain or loss-of-signal activity can be used to understand electrical
activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	AOP: Adverse Outcome Pathway

-------
CV: Coefficient of Variation
DMSO: Dimethyl Sulfoxide
ToxCast: US EPA's Toxicity Forecaster Program

tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The mean number of spikes in network spikes is a measure of the network connectivity. Changes
in the mean number of network spikes are indicative of changes in coordinated neural network activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the mean
number of spikes in a network spike in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this

-------
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 17.889
Response cutoff threshold used to determine hit calls: 53.666

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the average number of spikes (action potential firings) in a network spike
(temporally-clustered group of spikes across multiple electrodes) as the number of spikes that occur within a
0.05 s time window at the peak of a network spike divided by the total number of network spikes during a
recording in a microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS
adaptive spike detector. To collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the
trapezoidal area-under-the-curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical

-------
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579

Number of chemicals tested: 460

Active hit count: hitc>0.9
224

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	344.931

Neutral control median absolute deviation, by plate: nmad	58.201

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	17.45%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	317.812

Positive control well median absolute deviation, by plate: pmad	53.772

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.614

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	627.611

Negative control well median absolute deviation value, by plate: mmad	122.581

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.074

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 70.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2522

CCTE_Shafer_MEA_dev_per_network_spike_spike_percent

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Percent of Spikes in a Network Spike, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_per_network_spike_spike_percent is 1 of 19 assay components of
the CCTE_Shafer_MEA_dev assay. It measures the percent of spikes (action potential firings) in a network spike
(temporally-clustered group of spikes across multiple electrodes) as the number of spikes that occur within a
0.05 s time window at the peak of a network spike divided by the total number of spikes during a recording in a
microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector. To collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-
under-the-curve of the component is calculated. Data from the assay component
CCTE_Shafer_MEA_dev_per_network_spike_spike_percent was analyzed into 1 assay endpoint. This assay
endpoint, CCTE_Shafer_MEA_dev_per_network_spike_spike_percent, was analyzed in the negative analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of functional
reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	CV: Coefficient of Variation

AOP: Adverse Outcome Pathway	DMSO: Dimethyl Sulfoxide

-------
ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package

SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: The percent of spikes in network spikes is a measure of the network connectivity. Changes in the
percent of spikes in network spikes are indicative of changes in coordinated neural network activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the percent
of spikes occurring in a network spike (vs those spikes that are not part of the networks spike) in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 19.231
Response cutoff threshold used to determine hit calls: 57.692

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the percent of spikes (action potential firings) in a network spike
(temporally-clustered group of spikes across multiple electrodes) as the number of spikes that occur within a
0.05 s time window at the peak of a network spike divided by the total number of spikes during a recording in a
microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector. To collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-
under-the-curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579	Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	59.583

Neutral control median absolute deviation, by plate: nmad	10.038

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	19.16%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	13.215

Positive control well median absolute deviation, by plate: pmad	10.773

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.333

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	29.18

Negative control well median absolute deviation value, by plate: mmad	4.208

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.154

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 77.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2524

CCTE_Shafer_MEA_dev_correlation_coefficient_mean

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Mean Correlation Coefficient, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_correlation_coefficient_mean is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It measures the correlation coefficent of spikes (action potential firings) between
every pair of electrodes in a well during a recording in a microelectrode array (MEA) using Axion Biosystems
Maestro 768 channel amplifier with the AxIS adaptive spike detector. The pairwise correlations are averaged
across active electrodes to report a well-mean value. To collapse across multiple recordings made on days post-
plating 5, 7, 9 and 12, the trapezoidal area-under-the-curve of the component is calculated. Data from the assay
component CCTE_Shafer_MEA_dev_correlation_coefficient_mean was analyzed into 1 assay endpoint. This
assay endpoint, CCTE_Shafer_MEA_dev_correlation_coefficient_mean, was analyzed in the negative analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of functional
reporter, gain or loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	DMSO: Dimethyl Sulfoxide

AOP: Adverse Outcome Pathway	ToxCast: US EPA's Toxicity Forecaster Program

CV: Coefficient of Variation	tcpl: ToxCast Data Analysis Pipeline R Package

-------
SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: Electrical activity is captured from neurons cultured over electrodes. Increases in correlation
coefficient indicate that the activity of individual neurons in the network is more synchronous, reflecting
increased coordination of activity.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the mean
correlation coefficient across electrodes in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 16.666
Response cutoff threshold used to determine hit calls: 49.999

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint measures the correlation coefficent of spikes (action potential firings) between every
pair of electrodes in a well during a recording in a microelectrode array (MEA) using Axion Biosystems Maestro
768 channel amplifier with the AxIS adaptive spike detector. The pairwise correlations are averaged across active
electrodes to report a well-mean value. To collapse across multiple recordings made on days post-plating 5, 7,
9 and 12, the trapezoidal area-under-the-curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest

-------
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 579

Active hit count: hitc>0.9
229

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.327

Neutral control median absolute deviation, by plate: nmad	0.181

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	16.5%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.724

Positive control well median absolute deviation, by plate: pmad	0.239

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.501

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	1.318

Negative control well median absolute deviation value, by plate: mmad	0.16

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.206

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 75.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2526

CCTE_Shafer_MEA_dev_mutualJnformation_norm

1. General Information

1.1	Assay Title: CCTE's Neural Network Formation Assay for Mutual Information Normalization, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_mutual_information_norm is 1 of 19 assay components of the
CCTE_Shafer_MEA_dev assay. It is a measure of shared information between electrodes during a recording in a
microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector. To collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-
under-the-curve of the component is calculated. Data from the assay component
CCTE_Shafer_MEA_dev_mutual_information_norm was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_dev_mutual_information_norm, was analyzed in the negative analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of functional reporter, gain or
loss-of-signal activity can be used to understand electrical activity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023) 13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: Normalized Mutual Information is a normalized measure of complexity and synchrony in a network
that is robust to changes in network size. It is a scalar (rather than pairwise) measure of mutual information in
a multivariate network (See Ball et al., Neural Networks. 2017. 95, 29-43 for further information).

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures the
normalized mutual information across electrodes in the well.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 23.313
Response cutoff threshold used to determine hit calls: 69.938

Detection technology used: Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector (microelectrode array (MEA))

2.6	Response: This endpoint aggregates shared information between electrodes during a recording in a
microelectrode array (MEA) using Axion Biosystems Maestro 768 channel amplifier with the AxIS adaptive spike
detector. To collapse across multiple recordings made on days post-plating 5, 7, 9 and 12, the trapezoidal area-
under-the-curve of the component is calculated.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Spiking, bursting, and coordinated network activity were determined by analysis of each
recording on each DIV. Trapezoidal area under the curve (AUC) measurements were used to collapse data across
time and concentration and generate concentration-response curves for each parameter. (Operating Procedure
for Axion Maestro Pro Microelectrode Array (MEA) system: l-BCTD-RADB-SOP-5037-0)

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579

Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
197

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.037

Neutral control median absolute deviation, by plate: nmad	0.007

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	21.83%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.022

Positive control well median absolute deviation, by plate: pmad	0.003

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.887

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.049

Negative control well median absolute deviation value, by plate: mmad	0.007

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.129

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 74.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2529

CCTE_Shafer_MEA_dev_LDH

1. General Information

1.1	Assay Title: CCTE's Lactate Dehydrogenase Assay for Cytotoxicity in the Neural Network Formation Assay, Shafer
Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_LDH is 1 of 19 assay components of the CCTE_Shafer_MEA_dev
assay. It measures the lactate dehydrogenase enzyme activity using spectrophotometry. Data from the assay
component CCTE_Shafer_MEA_dev_LDH was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_dev_LDH, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used to
understand cell viability.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Changes in enzymatic activity (of lactate dehydrogenase) are indicative of compromised cell
health. Reductions in the total LDH (in cells) indicates cell loss or death.

The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures cell viability
by the amount of extracellular LDH (lactate dehydrogenase).

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:

Target (nominal) number of replicates:

-------
7

Standard minimum concentration tested:

0.104 nM
Key positive control:

NA

3

Standard maximum concentration tested:

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 8.068
Response cutoff threshold used to determine hit calls: 24.203
Detection technology used: enzyme activity (spectrophotometry)

2.6	Response: This endpoint measures the lactate dehydrogenase enzyme activity using spectrophotometry.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Absorbance was determined in a Molecular Devices VersaMax plate reader at 490 nm.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 573

Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
244

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.906

Neutral control median absolute deviation, by plate: nmad	0.072

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.29%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.352

Positive control well median absolute deviation, by plate: pmad	0.166

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.528

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.975

Negative control well median absolute deviation value, by plate: mmad	0.065

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.072

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 60.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2530

CCTE_Shafer_MEA_dev_AB

1.	General Information

1.1	Assay Title: CCTE's Alamar Blue Assay for Cytotoxicity in the Neural Network Formation Assay, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_dev assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_dev assay uses primary cultures of
rat cortical neurons. Recordings are made on days post-plating 5, 7, 9 and 12. As these cultures mature over
time, neural networks develop functional activity and form cohesive networks in which electrical activity can be
highly coordinated. CCTE_Shafer_MEA_dev_AB is 1 of 19 assay components of the CCTE_Shafer_MEA_dev
assay. It measures the resazurin reduction using fluorescence. Data from the assay component
CCTE_Shafer_MEA_dev_AB was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_dev_AB, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used to
understand cell viability.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers. Considered part of the developmental
neurotoxicity in vitro battery. See the OECD Initial Recommendations on Evaluation of Data from the
Developmental Neurotoxicity (DNT) In Vitro Testing Battery document:
https://one.oecd. org/document/ENV/CBC/MONO(2023)13/en/pdf.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system. Recordings are made on days post-
plating 5, 7, 9 and 12, as these cultures mature over time, and neural networks develop functional activity while
forming cohesive networks in which electrical activity can be highly coordinated.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in enzymatic activity (decrease in mitochondrial enzyme, alamar blue reduction) are
indicative of compromised cellular metabolism, possibly indicating cell death.

-------
The CCTE_Shafer_MEA_dev assay is designed to investigate changes in neural network formation in response
to chemical exposure in developing rat cortical neurons using a microelectrode array (MEA) technology. Neural
network formation is one of several key processes of neurodevelopment. This endpoint measures cell viability
assessed by the Alamar blue assay (mitochondrial reductase activity).

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is formation of properly connected networks of neurons. Disruption of network formation can give
rise to developmental neurotoxicity. Cortical cells are plated at a high density in the center of the well where
the microelectrodes are located. At early days in vitro neurons in the culture spontaneously extend neurites
(neurite initiation), which become axons and dendrites (polarization) after a few days. Finally, synaptic
connections are made between days in vitro (DIV) 7 and 15. Neurons become electrically active over the same
time-frame. Electrical activity begins as unorganized, random events first observed on DIV 3-5, and increases
thereafter and becomes progressively more organized into bursts on individual electrodes and organized
(synchronous) bursts across all of the electrodes in a single well. Typically, the cultures show an ontogeny of
activity that occurs rapidly in the first 12-14 days in vitro (DIV) and then becomes more stable in terms of the
network activity thereafter (Cotterill et al., 2016).

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: On the day prior to culture, each well was coated thoroughly with 150 uL of 0.05%
polyethyleneimine (PEI, Sigma Cat P3143) in 50 mM HEPES (Sigma Cat H7523) at a pH of 8. The plate was
incubated at 37C for one hour. PEI was rinsed out with 500 uL of sterile water three times. Plates were stored
at 4C until the day of culture. This culture was plated at a seeding density of 1.5 x 105 cells/well on a 48-well
MEA plate, prepared as described above. Cells were administered via a 25 uL media/laminin (20 ng/ml; Sigma
Cat L2020-1MG) drop directly onto the microelectrode array, as adding the cells with the laminin results in
better attachment than pre-coating with laminin. After 2 hours, an additional 475 uL of cortical media was added
and the cells returned to the incubator. Exposure starts at day 0 of plating and is continued over twelve days of
network development until the experiment is terminated. Cells are fed with fresh medium on DIV 5 and 9,
following the recordings on those days. The entire volume of media in the well is exchanged and any vehicle or
compound treatment is renewed. Complete details are found in OP-NHEERL/ISTD/SBB/TJS/2015-03-r0
(available on request, email shafer.tim@epa.gov). Measurement of the electrical activity is non-invasive, and
recordings can be made from the same plate over multiple different days of the culture. In cortical neurons, this
allows for the measurement of extracellular action potentials over the period during which networks form in
the MEA plate.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

-------
0.104 nM
Key positive control:
NA

3000 nM
Neutral vehicle control:

DMSO, water, or ethanol

Baseline median absolute deviation for the assay (bmad): 6.4
Response cutoff threshold used to determine hit calls: 19.201

Detection technology used: resazurin reduction: alamar blue dye converted into fluorescent end product
(fluorescence)

2.6	Response: This endpoint measures the resazurin reduction using fluorescence.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Fluorescence was measured in a Fluorostar Optima fluorimeter using an excitation wavelength
of 544 nm and an emission wavelength of 590 nm.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 579

Number of chemicals tested: 460

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
212

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	23503.667

Neutral control median absolute deviation, by plate: nmad	1454.431

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.52%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	11342.167

Positive control well median absolute deviation, by plate: pmad	2756.153

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-5.515

((pmed - nmed) / sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	32529.333

Negative control well median absolute deviation value, by plate: mmad	1888.091

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.473

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 69.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Strickland J D, Martin MT, Richard AM, Houck KA, ShaferTJ. Screening the ToxCast phase II libraries
for alterations in network function using cortical neurons grown on multi-well microelectrode array (mwMEA)
plates. Arch Toxicol. 2018 Jan;92(l):487-500. doi: 10.1007/s00204-017-2035-5. Epub 2017 Aug 2. PMID:
28766123; PMCID: PMC6438628., Kosnik MB, Strickland JD, Marvel SW, Wallis DJ, Wallace K, Richard AM, Reif
DM, ShaferTJ. Concentration-response evaluation of ToxCast compounds for multivariate activity patterns of
neural network function. Arch Toxicol. 2020 Feb;94(2):469-484. doi: 10.1007/s00204-019-02636-x. Epub 2019
Dec 10. PMID: 31822930; PMCID: PMC7371233., Valdivia P, Martin M, LeFew WR, Ross J, Houck KA, Shafer TJ.
Multi-well microelectrode array recordings detect neuroactivity of ToxCast compounds. Neurotoxicology. 2014
Sep;44:204-17. doi: 10.1016/j.neuro.2014.06.012. Epub 2014 Jul 2. PMID: 24997244., ShaferTJ, Brown JP, Lynch
B, Davila-Montero S, Wallace K, Friedman KP. Evaluation of Chemical Effects on Network Formation in Cortical
Neurons Grown on Microelectrode Arrays. Toxicol Sci. 2019 Jun l;169(2):436-455. doi: 10.1093/toxsci/kfz052.
PMID: 30816951.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2540

CCTE_Shafer_MEA_acute_LDH

1.	General Information

1.1	Assay Title: CCTE's Lactate Dehydrogenase Assay for Cytotoxicity in the Neural Acute Assay, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_LDH is a component of the CCTE_Shafer_MEA_acute assay. It measures the lactate
dehydrogenase enzyme activity using spectrophotometry. Data from the assay component
CCTE_Shafer_MEA_acute_LDH was analyzed into 1 assay endpoint. This assay endpoint,
CCTE_Shafer_MEA_acute_LDH, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, gain-of-signal activity can be used to
understand cell viability.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in enzymatic activity (increase in lactate dehydrogenase) are indicative of compromised
cell health. Increases in extracellular LDH indicate cell loss or death.

The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures cell viability by the amount of extracellular LDH (lactate dehydrogenase).

-------
2.2

Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Immediately following the 40 min recording in the presence of compounds, 50 uL of media
was removed from each well and transferred to a sterile 96-well plate. This was used to determine LDH released
from cells during compound exposure using a kit from Promega where absorbance was determined in a
Molecular Devices VersaMax plate reader at 490 nm.

Baseline median absolute deviation for the assay (bmad): 1.769
Response cutoff threshold used to determine hit calls: 5.306
Detection technology used: enzyme activity (spectrophotometry)

2.6	Response: This endpoint measures the lactate dehydrogenase enzyme activity using spectrophotometry.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:
1HM

Key positive control:

TritonxlOO

I arget (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

NA

3.

Additionally, this assay was annotated to the intended target family of cell cycle.
Data Interpretation

-------
The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Absorbance was determined in a Molecular Devices VersaMax plate reader at 490 nm.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 13:
pval.apid.pwlls.med (Calculate the positive control value (pval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) for single-concentration gain-of-signal positive control wells (wilt
= p).), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by
assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration
index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 525	Number of chemicals tested: 508

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

1	405	113

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	7

gain-loss (gnls) model:	17

power(pow) model:	24

linear-polynomial (polyl) model:	205

quadratic-polynomial(poly2) model:	35

exponential-2 (exp2) model:	3

exponential-3 (exp3) model:	1

exponential-4 (exp4) model:	197

exponential-5 (exp5) model:	30

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	8.198

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

0.087
0.031
34.12%

1.798
0.197

NA

Positive control signal-to-background: (pmed/nmed)

NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 30.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6. Bibliography: NA

7. Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2541

CCTE_Shafer_M EA_acute_AB

1.	General Information

1.1	Assay Title: CCTE's Alamar Blue Assay for Cytotoxicity in the Neural Acute Assay, Shafer Lab

1.2	Assay Summary: CCTE_Shafer_MEA_acute assay is conducted using Axion Biosystems 48 well microelectrode
array (MEA) plates and Maestro recording system. Each well of the MEA plate contains a grid of 16
microelectrodes embedded in the culture surface. Electrically active cells, such as neurons, can be cultured over
the electrodes, and electrical activity in these cells can be recorded extracellularly. The spontaneous firing of
neurons is captured from each electrode on a microsecond timescale providing both temporally and spatially
precise data. Because the recordings do not impact the health of the cells, multiple recordings can be made
from the same neural network over time in culture. The CCTE_Shafer_MEA_acute assay uses primary cultures
of rat cortical neurons. As these cultures mature over time, neural networks develop functional activity and
form cohesive networks in which electrical activity can be highly coordinated. Baseline 40-minute recordings
are made on days post-plating 13 or 15, followed by a secondary 40-minute recording after chemical exposure.
CCTE_Shafer_MEA_acute_AB is a component of the CCTE_Shafer_MEA_acute assay. It measures the resazurin
reduction using fluorescence. Data from the assay component CCTE_Shafer_MEA_acute_AB was analyzed into
1 assay endpoint. This assay endpoint, CCTE_Shafer_MEA_acute_AB, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter,
loss-of-signal activity can be used to understand cell viability.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Shafer lab at the EPA Center for Computational Toxicology and Exposure focus on
developmental neurotoxicity (DNT) hazard identificaton using microelectrode array (MEA) assays.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Multi-well microelectrode array plates and assay readout detection technology and AxIS
software are commercially available from Axion Biosystems Inc. (Atlanta Ga). MEA systems are also available
from other manufacturers. Cell viability assays utilized are commercially available from Promega (Madison, Wl).
Cell viability assays are also available from other manufacturers.

1.9	Assay Throughput: 48-well plate. The assay uses primary cultures of rat cortical neurons, the Axion Biosystems
48 well microelectrode array (MEA) plates, and Maestro recording system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in enzymatic activity (decrease in mitochondrial enzyme, alamar blue reduction) are
indicative of compromised cellular metabolism, possibly indicating cell death.

The CCTE_Shafer_MEA_acute assay is designed to investigate changes in neural network activity in response to
chemical exposure in rat cortical neurons using a microelectrode array (MEA) technology. This endpoint
measures cell viability assessed by the Alamar blue assay (mitochondrial reductase activity).

-------
2.2

Scientific Principles: Nervous system function is sensitive to chemical perturbation. The microelectrode arrays
(MEAs) offer an efficient approach to screen compounds for neuroactivity and distinguish between compound
effects on firing, bursting, and connectivity patterns.

2.3	Experimental System: adherent primary cortical cell culture primary cell used. Primary rodent cortical cell
cultures are prepared on site from the neocortex dissected from the central nervous system of newborn rats
using a standardized protocol. In a typical culture, cells are isolated from the combined cortices of 3-5 pups,
plated onto multiwell microelectrode array plates and allowed 2 hrs to attach. The cells are maintained in a
humidified incubator at 37C and 5% C02. Sex of pups is not determined and cultures are presumed to consist
of a mixture of male and female pups since multiple pups are used for each culture. The primary culture model
consists of glutamatergic (excitatory) neurons, GABAergic (inhibitory) neuron, astrocytes and sparse microglia
(Harrill et al., 2011; Frank et al., 2017).

2.4	Metabolic Competence: The metabolic capacity of the primary cortical cultures has not been extensively
studied. mRNA expression of various Cyp enzymes is low on DIV 1, however, by DIV 14, mRNA for Cyp 211c »
4x1 > 2d4 > lsl > lal. Functional expression of these proteins has not been confirmed (Shafer et al., 2015).

2.5	Exposure Regime: Metabolic activity was determined using the CellTiter Blue (CTB) assay (Promega Cat. #8081),
following the manufacturer's instructions with the following modifications. Following the removal of the 50 uL
sample for the LDH assay, 450 uL of media was removed from each well and replaced with 200 uL of fresh media
containing a 1:6 dilution of CTB reagent for the determination of metabolic activity. Fluorescence was measured
in a Fluorostar Optima fluorimeter using an excitation wavelength of 544 nm and an emission wavelength of
590 nm.

Baseline median absolute deviation for the assay (bmad): 13.376
Response cutoff threshold used to determine hit calls: 40.127

Detection technology used: resazurin reduction: alamar blue dye converted into fluorescent end product
(fluorescence)

2.6	Response: This endpoint measures the resazurin reduction using fluorescence.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:
1HM

Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO or water

NA

-------
Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Fluorescence was measured in a Fluorostar Optima fluorimeter using an excitation wavelength
of 544 nm and an emission wavelength of 590 nm.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0; pval
= 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with

-------
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 519	Number of chemicals tested: 503

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
15

Inactive hit count: Oihitc 0.9
338

WINING MODEL SELECTION

NA hit count: hitc^O
166

Number of sample-assay endpoints with winning hill model:

1
8

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

27

254

quadratic-polynomial(poly2) model: 31

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

3

9

180

-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

28413.667

Neutral control median absolute deviation, by plate: nmad

2868.831

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

9.87%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

940.667

235.486

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

NA

-------
Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-9.401

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 891

CEET0X_H295R_11DC0RT

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for 11-Deoxycortisol (11DCORT)

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEET0X_H295R_11DC0RT is one of 23 assay component(s) measured or calculated from the CEETOX_H295R
assay. It is designed to make measurements of hormone induction, a form of inducible reporter, as detected
with absorbance signals by HPLC-MS-MS technology. Data from the assay component
CEET0X_H295R_11DC0RT was analyzed into 1 assay endpoint. This assay endpoint, CEET0X_H295R_11DC0RT,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a
type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-MS was used to understand synthesis of
11-Deoxycortisol in H295R cell line at 48hr of chemical exposure. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the steroid hormone intended target family, where the
subfamily is glucocorticoids.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in

-------
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4

-------
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.099
Response cutoff threshold used to determine hit calls: 0.595
Detection technology used: HPLC-MS-MS (Spectrophotometry)

2.6	Response: Decreased or increased production of 11-deoxycortisol following interference with steroidogenesis
was quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were log2-transformed and analyzed as a fold-change compared to DMSO control wells as
baseline response. All statistical analyses were conducted using R programming language, employing tcpl
package to generate model parameters and confidence intervals.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412666667 nM
Key positive control:

Prochloraz;Forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 588	Number of chemicals tested: 576

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	264.78

Neutral control median absolute deviation, by plate: nmad	8.781

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	3.7%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	628.142

Positive control well median absolute deviation, by plate: pmad	18.733

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	15.674

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	16.05

Negative control well median absolute deviation value, by plate: mmad	1.027

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-24.839

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 62.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 893

CEETOX_H295R_OHPREG

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for OH-Pregnenolone (OHPREG)

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEETOX_H295R_OHPREG is one of 23 assay component(s) measured or calculated from the CEETOX_H295R
assay. It is designed to make measurements of hormone induction, a form of inducible reporter, as detected
with absorbance signals by HPLC-MS-MS technology. Data from the assay component CEETOX_H295R_OHPREG
was analyzed into 1 assay endpoint. This assay endpoint, CEETOX_H295R_OHPREG, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, gain or loss-of-signal activity using HPLC-MS-MS was used to understand synthesis of 17alpha-
hydroxypregnenolone in H295Rcell line at48hr of chemical exposure. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the steroid hormone intended target family, where the
subfamily is progestagens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in

-------
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4

-------
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.129
Response cutoff threshold used to determine hit calls: 0.771
Detection technology used: HPLC-MS-MS (Spectrophotometry)

2.6	Response: Decreased or increased production of OH-pregnenolone following interference with steroidogenesis
was quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were log2-transformed and analyzed as a fold-change compared to DMSO control wells as
baseline response. All statistical analyses were conducted using R programming language, employing tcpl
package to generate model parameters and confidence intervals.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412666667 nM
Key positive control:

Prochloraz;Forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 588	Number of chemicals tested: 576

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	7.468

Neutral control median absolute deviation, by plate: nmad	0.426

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.95%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	30.17

Positive control well median absolute deviation, by plate: pmad	1.023

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	17.568

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	3.536

Negative control well median absolute deviation value, by plate: mmad	0

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-8.63

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 84.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 895

CEETOX_H295R_OHPROG

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for OH-Progesterone (OHPROG)

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEETOX_H295R_OHPROG is one of 23 assay component(s) measured or calculated from the CEETOX_H295R
assay. It is designed to make measurements of hormone induction, a form of inducible reporter, as detected
with absorbance signals by HPLC-MS-MS technology. Data from the assay component CEETOX_H295R_OHPROG
was analyzed into 1 assay endpoint. This assay endpoint, CEETOX_H295R_OHPROG, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, gain or loss-of-signal activity using HPLC-MS-MS was used to understand synthesis of 17alpha-
hydroxyprogesterone in H295R cell line at 48hr of chemical exposure. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the steroid hormone intended target family, where the
subfamily is progestagens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in

-------
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4

-------
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.151
Response cutoff threshold used to determine hit calls: 0.905
Detection technology used: HPLC-MS-MS (Spectrophotometry)

2.6	Response: Decreased or increased production of OH-progesterone following interference with steroidogenesis
was quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were log2-transformed and analyzed as a fold-change compared to DMSO control wells as
baseline response. All statistical analyses were conducted using R programming language, employing tcpl
package to generate model parameters and confidence intervals.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412666667 nM
Key positive control:

Prochloraz;Forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 588	Number of chemicals tested: 576

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	10.165

Neutral control median absolute deviation, by plate: nmad	0.482

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.36%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	24.418

Positive control well median absolute deviation, by plate: pmad	0.953

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	11.398

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	9.095

Negative control well median absolute deviation value, by plate: mmad	0.478

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-0.824

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 72.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 897

CEETOX_H295R_ANDR

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Androstenedione (ANDR)

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEETOX_H295R_ANDR is one of 23 assay component(s) measured or calculated from the CEETOX_H295R assay.
It is designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. Data from the assay component CEETOX_H295R_ANDR was
analyzed into 1 assay endpoint. This assay endpoint, CEETOX_H295R_ANDR, was analyzed with bidirectional
fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain
or loss-of-signal activity using HPLC-MS-MS was used to understand synthesis of Androstenedione in H295R cell
line at 48hr of chemical exposure. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the steroid hormone intended target family, where the subfamily is androgens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. The ToxCast HTS program adapted the OECD-validated H295R

-------
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish

-------
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.126
Response cutoff threshold used to determine hit calls: 0.754
Detection technology used: HPLC-MS-MS (Spectrophotometry)

2.6	Response: Decreased or increased production of androstenedione following interference with steroidogenesis
was quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were log2-transformed and analyzed as a fold-change compared to DMSO control wells as
baseline response. All statistical analyses were conducted using R programming language, employing tcpl
package to generate model parameters and confidence intervals.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412666667 nM
Key positive control:

Prochloraz;Forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 588

Number of chemicals tested: 576

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
172

Inactive hit count: 0
-------
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

89.015
4.529
5.72%

252.643
9.5

NA

11.618

NA
NA

1.95
0.1
NA

-17.164

NA

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 49.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 899

CEETOX_H295R_CORTIC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Corticosterone (CORTIC)

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEETOX_H295R_CORTIC is one of 23 assay component(s) measured or calculated from the CEETOX_H295R
assay. It is designed to make measurements of hormone induction, a form of inducible reporter, as detected
with absorbance signals by HPLC-MS-MS technology. Data from the assay component CEETOX_H295R_CORTIC
was analyzed into 1 assay endpoint. This assay endpoint, CEETOX_H295R_CORTIC, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, gain or loss-of-signal activity using HPLC-MS-MS was used to understand synthesis of Corticosterone
in H295R cell line at 48hr of chemical exposure. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the steroid hormone intended target family, where the subfamily is
glucocorticoids.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in

-------
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4

-------
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.196
Response cutoff threshold used to determine hit calls: 1.175
Detection technology used: HPLC-MS-MS (Spectrophotometry)

2.6	Response: Decreased or increased production of corticosterone following interference with steroidogenesis
was quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were log2-transformed and analyzed as a fold-change compared to DMSO control wells as
baseline response. All statistical analyses were conducted using R programming language, employing tcpl
package to generate model parameters and confidence intervals.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412666667 nM
Key positive control:

Prochloraz;Forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 588	Number of chemicals tested: 576

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.835

Neutral control median absolute deviation, by plate: nmad	0.096

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.27%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	6.498

Positive control well median absolute deviation, by plate: pmad	0.341

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	12.77

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.354

Negative control well median absolute deviation value, by plate: mmad	0

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-7.378

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 108.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 901

CEETOX_H295R_CORTISOL

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Cortisol

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEETOX_H295R_CORTISOL is one of 23 assay component(s) measured or calculated from the CEETOX_H295R
assay. It is designed to make measurements of hormone induction, a form of inducible reporter, as detected
with absorbance signals by HPLC-MS-MS technology. Data from the assay component
CEETOX_H295R_CORTISOLwas analyzed into 1 assay endpoint. This assay endpoint, CEETOX_H295R_CORTISOL,
was analyzed with bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a
type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-MS was used to understand synthesis of
Cortisol in H295R cell line at 48hr of chemical exposure. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the steroid hormone intended target family, where the subfamily is
glucocorticoids.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in

-------
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4

-------
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.169
Response cutoff threshold used to determine hit calls: 1.015
Detection technology used: HPLC-MS-MS (Spectrophotometry)

2.6	Response: Decreased or increased production of Cortisol following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were log2-transformed and analyzed as a fold-change compared to DMSO control wells as
baseline response. All statistical analyses were conducted using R programming language, employing tcpl
package to generate model parameters and confidence intervals.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412666667 nM
Key positive control:

Prochloraz;Forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 588	Number of chemicals tested: 576

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	22.52

Neutral control median absolute deviation, by plate: nmad	1.438

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.35%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	69.308

Positive control well median absolute deviation, by plate: pmad	4.737

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	10.249

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.615

Negative control well median absolute deviation value, by plate: mmad	0.044

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-13.355

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 41.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 905

CEETOX_H295R_DOC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for DOC

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEETOX_H295R_DOC is one of 23 assay component(s) measured or calculated from the CEETOX_H295R assay.
It is designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. Data from the assay component CEETOX_H295R_DOC was
analyzed into 1 assay endpoint. This assay endpoint, CEETOX_H295R_DOC, was analyzed with bidirectional
fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain
or loss-of-signal activity using HPLC-MS-MS was used to understand synthesis of 11-Deoxycorticosterone in
H295R cell line at 48hr of chemical exposure. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the steroid hormone intended target family, where the subfamily is
glucocorticoids.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in

-------
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4

-------
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.145
Response cutoff threshold used to determine hit calls: 0.873
Detection technology used: HPLC-MS-MS (Spectrophotometry)

2.6	Response: Decreased or increased production of DOC following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were log2-transformed and analyzed as a fold-change compared to DMSO control wells as
baseline response. All statistical analyses were conducted using R programming language, employing tcpl
package to generate model parameters and confidence intervals.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412666667 nM
Key positive control:

Prochloraz;Forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 588	Number of chemicals tested: 576

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4.447

Neutral control median absolute deviation, by plate: nmad	0.256

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.35%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	10.975

Positive control well median absolute deviation, by plate: pmad	0.486

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	8.338

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	10.062

Negative control well median absolute deviation value, by plate: mmad	0.626

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	5.571

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 58.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 907

CEETOX_H295R_ESTRADIOL

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Estradiol

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEETOX_H295R_ESTRADIOL is one of 23 assay component(s) measured or calculated from the CEETOX_H295R
assay. It is designed to make measurements of hormone induction, a form of inducible reporter, as detected
with absorbance signals by HPLC-MS-MS technology. Data from the assay component
CEETOX_H295R_ESTRADIOL was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_ESTRADIOL, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-MS was
used to understand synthesis of Estradiol in H295R cell line at 48hr of chemical exposure. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the steroid hormone intended
target family, where the subfamily is estrogens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in

-------
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4

-------
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.186
Response cutoff threshold used to determine hit calls: 1.117
Detection technology used: HPLC-MS-MS (Spectrophotometry)

2.6	Response: Decreased or increased production of estradiol following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were log2-transformed and analyzed as a fold-change compared to DMSO control wells as
baseline response. All statistical analyses were conducted using R programming language, employing tcpl
package to generate model parameters and confidence intervals.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412666667 nM
Key positive control:

Prochloraz;Forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 588	Number of chemicals tested: 576

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.411

Neutral control median absolute deviation, by plate: nmad	0.024

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.65%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	2.258

Positive control well median absolute deviation, by plate: pmad	0.096

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	11.694

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.04

Negative control well median absolute deviation value, by plate: mmad	0

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-8.687

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 55.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 909

CEETOX_H295R_ESTRONE

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Estrone

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEETOX_H295R_ESTRONE is one of 23 assay component(s) measured or calculated from the CEETOX_H295R
assay. It is designed to make measurements of hormone induction, a form of inducible reporter, as detected
with absorbance signals by HPLC-MS-MS technology. Data from the assay component CEETOX_H295R_ESTRONE
was analyzed into 1 assay endpoint. This assay endpoint, CEETOX_H295R_ESTRONE, was analyzed with
bidirectional fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, gain or loss-of-signal activity using HPLC-MS-MS was used to understand synthesis of Estrone in H295R
cell line at 48hr of chemical exposure. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the steroid hormone intended target family, where the subfamily is estrogens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. The ToxCast HTS program adapted the OECD-validated H295R

-------
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish

-------
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.152
Response cutoff threshold used to determine hit calls: 0.91
Detection technology used: HPLC-MS-MS (Spectrophotometry)

2.6	Response: Decreased or increased production of estrone following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were log2-transformed and analyzed as a fold-change compared to DMSO control wells as
baseline response. All statistical analyses were conducted using R programming language, employing tcpl
package to generate model parameters and confidence intervals.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412666667 nM
Key positive control:

Prochloraz;Forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 588

Number of chemicals tested: 576

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
96

Inactive hit count: 0
-------
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

3.417
0.193
8.1%

22.57
0.693

NA

17.487

NA
NA

0.105
0.007
NA

-11.444

NA

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 58.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 913

CEETOX_H295R_PROG

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Progesterone (PROG)

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEETOX_H295R_PROG is one of 23 assay component(s) measured or calculated from the CEETOX_H295R assay.
It is designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. Data from the assay component CEETOX_H295R_PROG was
analyzed into 1 assay endpoint. This assay endpoint, CEETOX_H295R_PROG, was analyzed with bidirectional
fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain
or loss-of-signal activity using HPLC-MS-MS was used to understand synthesis of Progesterone in H295R cell
line at 48hr of chemical exposure. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the steroid hormone intended target family, where the subfamily is progestagens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. The ToxCast HTS program adapted the OECD-validated H295R

-------
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish

-------
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.152
Response cutoff threshold used to determine hit calls: 0.915
Detection technology used: HPLC-MS-MS (Spectrophotometry)

2.6	Response: Decreased or increased production of progesterone following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were log2-transformed and analyzed as a fold-change compared to DMSO control wells as
baseline response. All statistical analyses were conducted using R programming language, employing tcpl
package to generate model parameters and confidence intervals.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412666667 nM
Key positive control:

Prochloraz;Forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 588

Number of chemicals tested: 576

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
225

Inactive hit count: 0
-------
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

0.358
0.022
5.23%

0.445
0.022

NA

1.742

NA
NA

8.95
0.456
NA

21.002

NA

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 89.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 915

CEETOX_H295R_TESTO

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Testosterone (TESTO)

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEETOX_H295R_TESTO is one of 23 assay component(s) measured or calculated from the CEETOX_H295R assay.
It is designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. Data from the assay component CEETOX_H295R_TESTO was
analyzed into 1 assay endpoint. This assay endpoint, CEETOX_H295R_TESTO, was analyzed with bidirectional
fitting relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain
or loss-of-signal activity using HPLC-MS-MS was used to understand synthesis of Testosterone in H295R cell line
at 48hr of chemical exposure. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the steroid hormone intended target family, where the subfamily is androgens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. The ToxCast HTS program adapted the OECD-validated H295R

-------
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish

-------
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.166
Response cutoff threshold used to determine hit calls: 0.998
Detection technology used: HPLC-MS-MS (Spectrophotometry)

2.6	Response: Decreased or increased production of testosterone following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data were log2-transformed and analyzed as a fold-change compared to DMSO control wells as
baseline response. All statistical analyses were conducted using R programming language, employing tcpl
package to generate model parameters and confidence intervals.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412666667 nM
Key positive control:

Prochloraz;Forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 588

Number of chemicals tested: 576

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
137

Inactive hit count: 0
-------
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

1.995
0.126
7.73%

5.325
0.241

NA

8.77

NA
NA

0.12
0.007
NA

-10.303

NA

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 52.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1664

CEETOX_H295R_MTT_cell_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the CeeTox HT-H295R MTT Assay

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate.
CEETOX_H295R_MTT_cell_viability is one of 23 assay component(s) measured or calculated from the
CEETOX_H295R assay. It is designed to make measurements of cell number, a form of viability reporter, as
detected with fluorescence intensity signals by MTT cytotoxicity assay technology. The assay component
endpoint CEETOX_H295R_MTT_cell_viability was analyzed with bidirectional fitting relation to DMSO as the
negative control and baseline of activity. Using a type of viability reporter in the MTT cytotoxicity assay, gain or
loss-of-signal activity can be used to monitor cellular processes to understand changes in proliferation. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to cell cycle, where
the subfamily is proliferation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. The ToxCast HTS program adapted the OECD-validated H295R

-------
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four 10|aM forskolin replicates to
control for hormone stimulation, four 3|jM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish

-------
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 5.966
Response cutoff threshold used to determine hit calls: 20
Detection technology used: MTT cytotoxicity assay (Fluorescence)

2.6	Response: A cell viability/cytotoxicity assay can be used to determine the potential impact of the test chemical
on cell viability. This is an important feature as it allows for the discrimination between effects that are due to
cytotoxicity from those due to the direct interaction of chemicals with steroidogenic pathway.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cytotoxicity was used to establish a maximum tolerated concentration (MTC) per chemical
sample whereby the ToxCast chemicals were evaluated at a maximum nominal concentration of 100 nM, where
possible. A target cell viability > 70% was sought. Chemicals resulting in H295R cell viability of 20%-70% were
diluted 10-fold, whereas those with < 20% viability were diluted 100-fold and re-evaluated. Dilutions were made
until > 70% viability was achieved for all chemicals establishing the MTC. The 70% cutoff criteria was established
based on 5 times the baseline median absolute deviation (5 x BMAD), which uses the estimated baseline noise

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

1.23456790123457 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100.3 nM
Neutral vehicle control:

DMSO

-------
level of the assay to inform on significant changes in cell viability. For the MTT data analysis, BMAD was defined
as the median absolute deviation (MAD) of all chemical samples evaluated during MTC determination, resulting
in a BMAD of 6%; hence with a 5 x BMAD cutoff the allowable viability loss was set as 30%. MTT cytotoxicity
evaluation was also conducted for the duplicates of all concentrations for chemicals tested in the CR studies
(CR; 6-point CR established by 3-fold serial dilutions from the MTC).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 11:
bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive
control value (pval) to 0; pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates

-------
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 677	Number of chemicals tested: 655

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.465

Neutral control median absolute deviation, by plate: nmad	0.013

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	2.86%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.011

Positive control well median absolute deviation, by plate: pmad	0.002

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-32.263

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 49.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis

-------
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2143

CEETOX_H295R_llDCORT_noMTC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for 11-Deoxycortisol (11DCORT), maximum
tolerated concentration (MTC) filtered results only

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate, one of 12 assay
component(s) measured or calculated from the CEETOX_H295R assay (11 hormones and 1 viability assay). It is
designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. The concentrations for these components have not been pre-
filtered for the maximum tolerated concentration (MTC) based on the MTT assay (_noMTC). Data from the assay
component CEETOX_H295R_llDCORT_noMTC was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_llDCORT_noMTC, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-
MS was used to understand synthesis of 11-Deoxycortisol in H295R cell line at 48hr of chemical exposure,
without filtering concentrations for maximum tolerated concentration (MTC). To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the steroid hormone intended target family,
where the subfamily is glucocorticoids.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: steroid hormone biosynthesis

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

-------
2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard

-------
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four lO^M forskolin replicates to
control for hormone stimulation, four 3nM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.068
Response cutoff threshold used to determine hit calls: 0.409
Detection technology used: HPLC-MS-MS (HPLC-MS-MS)

2.6	Response: Decreased or increased production of 11-deoxycortisol following interference with steroidogenesis
was quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The concentrations for these components were pre-filtered for the maximum tolerated
concentration (MTC) based on the MTT assay (_noMTC). Data were log2-transformed and analyzed as a fold-

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412 nM
Key positive control:

prochloraz; forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
change compared to DMSO control wells as baseline response. All statistical analyses were conducted using R
programming language, employing tcpl package to generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 85	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

375.998
8.199
2.19%

862.42
28.307

NA

17.304

NA
NA

26.77
1.205
NA

-35.091

NA

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 10.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2145

CEETOX_H295R_OHPREG_noMTC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for OH-Pregnenolone (OHPREG), maximum
tolerated concentration (MTC) filtered results only

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate, one of 12 assay
component(s) measured or calculated from the CEETOX_H295R assay (11 hormones and 1 viability assay). It is
designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. The concentrations for these components have not been pre-
filtered for the maximum tolerated concentration (MTC) based on the MTT assay (_noMTC). Data from the assay
component CEETOX_H295R_OHPREG_noMTC was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_OHPREG_noMTC, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-
MS was used to understand synthesis of 17alpha-hydroxypregnenolone in H295R cell line at 48hr of chemical
exposure, without filtering concentrations for maximum tolerated concentration (MTC). To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the steroid hormone intended
target family, where the subfamily is progestagens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: steroid hormone biosynthesis

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

-------
2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard

-------
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four lO^M forskolin replicates to
control for hormone stimulation, four 3nM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.102
Response cutoff threshold used to determine hit calls: 0.613
Detection technology used: HPLC-MS-MS (HPLC-MS-MS)

2.6	Response: Decreased or increased production of OH-pregnenolone following interference with steroidogenesis
was quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The concentrations for these components were pre-filtered for the maximum tolerated
concentration (MTC) based on the MTT assay (_noMTC). Data were log2-transformed and analyzed as a fold-

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412 nM
Key positive control:

prochloraz; forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
change compared to DMSO control wells as baseline response. All statistical analyses were conducted using R
programming language, employing tcpl package to generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 85	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	22.233

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	3.536

Negative control well median absolute deviation value, by plate: mmad	0

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-12.268

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

10.648

0.356

3.46%

41.55
1.06

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2147

CEETOX_H295R_OHPROG_noMTC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for OH-Progesterone (OHPROG), maximum
tolerated concentration (MTC) filtered results only

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate, one of 12 assay
component(s) measured or calculated from the CEETOX_H295R assay (11 hormones and 1 viability assay). It is
designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. The concentrations for these components have not been pre-
filtered for the maximum tolerated concentration (MTC) based on the MTT assay (_noMTC). Data from the assay
component CEETOX_H295R_OHPROG_noMTC was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_OHPROG_noMTC, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-
MS was used to understand synthesis of 17alpha-hydroxyprogesterone in H295R cell line at 48hr of chemical
exposure, without filtering concentrations for maximum tolerated concentration (MTC). To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the steroid hormone intended
target family, where the subfamily is progestagens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: steroid hormone biosynthesis

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

-------
2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard

-------
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four lO^M forskolin replicates to
control for hormone stimulation, four 3nM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.11
Response cutoff threshold used to determine hit calls: 0.66
Detection technology used: HPLC-MS-MS (HPLC-MS-MS)

2.6	Response: Decreased or increased production of OH-progesterone following interference with steroidogenesis
was quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The concentrations for these components were pre-filtered for the maximum tolerated
concentration (MTC) based on the MTT assay (_noMTC). Data were log2-transformed and analyzed as a fold-

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412 nM
Key positive control:

prochloraz; forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
change compared to DMSO control wells as baseline response. All statistical analyses were conducted using R
programming language, employing tcpl package to generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 85	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

16.352
0.645
4.17%

38.702
1.605

NA

10.817

NA
NA

10.852
0.189
NA

-6.506

NA

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 12.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2149

CEETOX_H295R_ANDR_noMTC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Androstenedione (ANDR), maximum
tolerated concentration (MTC) filtered results only

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate, one of 12 assay
component(s) measured or calculated from the CEETOX_H295R assay (11 hormones and 1 viability assay). It is
designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. The concentrations for these components have not been pre-
filtered for the maximum tolerated concentration (MTC) based on the MTT assay (_noMTC). Data from the assay
component CEETOX_H295R_ANDR_noMTC was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_ANDR_noMTC, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-MS was
used to understand synthesis of Androstenedione in H295R cell line at 48hr of chemical exposure, without
filtering concentrations for maximum tolerated concentration (MTC). To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the steroid hormone intended target family, where the
subfamily is androgens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: steroid hormone biosynthesis

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

-------
2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard

-------
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four lO^M forskolin replicates to
control for hormone stimulation, four 3nM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.092
Response cutoff threshold used to determine hit calls: 0.551
Detection technology used: HPLC-MS-MS (HPLC-MS-MS)

2.6	Response: Decreased or increased production of androstenedione following interference with steroidogenesis
was quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The concentrations for these components were pre-filtered for the maximum tolerated
concentration (MTC) based on the MTT assay (_noMTC). Data were log2-transformed and analyzed as a fold-

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412 nM
Key positive control:

prochloraz; forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
change compared to DMSO control wells as baseline response. All statistical analyses were conducted using R
programming language, employing tcpl package to generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 85	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	17.715

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	2.39

Negative control well median absolute deviation value, by plate: mmad	0.096

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-22.202

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

99.025
4.088
4.42%

268.722
8.506

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 10.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2151

CEETOX_H295R_CORTIC_noMTC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Corticosterone (CORTIC), maximum
tolerated concentration (MTC) filtered results only

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate, one of 12 assay
component(s) measured or calculated from the CEETOX_H295R assay (11 hormones and 1 viability assay). It is
designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. The concentrations for these components have not been pre-
filtered for the maximum tolerated concentration (MTC) based on the MTT assay (_noMTC). Data from the assay
component CEETOX_H295R_CORTIC_noMTC was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_CORTIC_noMTC, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-
MS was used to understand synthesis of Corticosterone in H295R cell line at 48hr of chemical exposure, without
filtering concentrations for maximum tolerated concentration (MTC). To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the steroid hormone intended target family, where the
subfamily is glucocorticoids.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: steroid hormone biosynthesis

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

-------
2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard

-------
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four lO^M forskolin replicates to
control for hormone stimulation, four 3nM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.168
Response cutoff threshold used to determine hit calls: 1.008
Detection technology used: HPLC-MS-MS (HPLC-MS-MS)

2.6	Response: Decreased or increased production of corticosterone following interference with steroidogenesis
was quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The concentrations for these components were pre-filtered for the maximum tolerated
concentration (MTC) based on the MTT assay (_noMTC). Data were log2-transformed and analyzed as a fold-

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412 nM
Key positive control:

prochloraz; forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
change compared to DMSO control wells as baseline response. All statistical analyses were conducted using R
programming language, employing tcpl package to generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 85	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	18.796

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.354

Negative control well median absolute deviation value, by plate: mmad	0

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-16.464

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

2.58
0.126
4.73%

13.67
0.508

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 11.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2153

CEETOX_H295R_CORTISOL_noMTC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Cortisol, maximum tolerated
concentration (MTC) filtered results only

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate, one of 12 assay
component(s) measured or calculated from the CEETOX_H295R assay (11 hormones and 1 viability assay). It is
designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. The concentrations for these components have not been pre-
filtered for the maximum tolerated concentration (MTC) based on the MTT assay (_noMTC). Data from the assay
component CEETOX_H295R_CORTISOL_noMTC was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_CORTISOL_noMTC, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-
MS was used to understand synthesis of Cortisol in H295Rcell line at 48hr of chemical exposure, without filtering
concentrations for maximum tolerated concentration (MTC). To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the steroid hormone intended target family, where the
subfamily is glucocorticoids.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: steroid hormone biosynthesis

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

-------
2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard

-------
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four lO^M forskolin replicates to
control for hormone stimulation, four 3nM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.113
Response cutoff threshold used to determine hit calls: 0.679
Detection technology used: HPLC-MS-MS (HPLC-MS-MS)

2.6	Response: Decreased or increased production of Cortisol following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The concentrations for these components were pre-filtered for the maximum tolerated
concentration (MTC) based on the MTT assay (_noMTC). Data were log2-transformed and analyzed as a fold-

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412 nM
Key positive control:

prochloraz; forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
change compared to DMSO control wells as baseline response. All statistical analyses were conducted using R
programming language, employing tcpl package to generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 85	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

28.197
1.546
5.37%

101.325
2.828

NA

18.725

NA
NA

0.887
0.082
NA

-17.841

NA

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 14.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2157

CEETOX_H295R_DOC_noMTC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for DOC, maximum tolerated concentration
(MTC) filtered results only

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate, one of 12 assay
component(s) measured or calculated from the CEETOX_H295R assay (11 hormones and 1 viability assay). It is
designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. The concentrations for these components have not been pre-
filtered for the maximum tolerated concentration (MTC) based on the MTT assay (_noMTC). Data from the assay
component CEETOX_H295R_DOC_noMTC was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_DOC_noMTC, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-MS was
used to understand synthesis of 11-Deoxycorticosterone in H295R cell line at 48hr of chemical exposure,
without filtering concentrations for maximum tolerated concentration (MTC). To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the steroid hormone intended target family,
where the subfamily is glucocorticoids.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: steroid hormone biosynthesis

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

-------
2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard

-------
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four lO^M forskolin replicates to
control for hormone stimulation, four 3nM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.105
Response cutoff threshold used to determine hit calls: 0.633
Detection technology used: HPLC-MS-MS (HPLC-MS-MS)

2.6	Response: Decreased or increased production of DOC following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The concentrations for these components were pre-filtered for the maximum tolerated
concentration (MTC) based on the MTT assay (_noMTC). Data were log2-transformed and analyzed as a fold-

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412 nM
Key positive control:

prochloraz; forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
change compared to DMSO control wells as baseline response. All statistical analyses were conducted using R
programming language, employing tcpl package to generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 85	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	14.83

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	28.075

Negative control well median absolute deviation value, by plate: mmad	1.101

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	10.386

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

14.955
0.419
2.95%

31.713
1.071

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 14.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2159

CEETOX_H295R_ESTRADIOL_noMTC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Estradiol, maximum tolerated
concentration (MTC) filtered results only

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate, one of 12 assay
component(s) measured or calculated from the CEETOX_H295R assay (11 hormones and 1 viability assay). It is
designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. The concentrations for these components have not been pre-
filtered for the maximum tolerated concentration (MTC) based on the MTT assay (_noMTC). Data from the assay
component CEETOX_H295R_ESTRADIOL_noMTC was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_ESTRADIOL_noMTC, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-
MS was used to understand synthesis of Estradiol in H295R cell line at 48hr of chemical exposure, without
filtering concentrations for maximum tolerated concentration (MTC). To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the steroid hormone intended target family, where the
subfamily is estrogens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: steroid hormone biosynthesis

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

-------
2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard

-------
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four lO^M forskolin replicates to
control for hormone stimulation, four 3nM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.178
Response cutoff threshold used to determine hit calls: 1.07
Detection technology used: HPLC-MS-MS (HPLC-MS-MS)

2.6	Response: Decreased or increased production of estradiol following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The concentrations for these components were pre-filtered for the maximum tolerated
concentration (MTC) based on the MTT assay (_noMTC). Data were log2-transformed and analyzed as a fold-

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412 nM
Key positive control:

prochloraz; forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
change compared to DMSO control wells as baseline response. All statistical analyses were conducted using R
programming language, employing tcpl package to generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 85	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	13.81

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.021

Negative control well median absolute deviation value, by plate: mmad	0

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-11.626

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

0.28
0.022
7.48%

1.938
0.115

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2161

CEETOX_H295R_ESTRON E_noMTC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Estrone, maximum tolerated
concentration (MTC) filtered results only

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate, one of 12 assay
component(s) measured or calculated from the CEETOX_H295R assay (11 hormones and 1 viability assay). It is
designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. The concentrations for these components have not been pre-
filtered for the maximum tolerated concentration (MTC) based on the MTT assay (_noMTC). Data from the assay
component CEETOX_H295R_ESTRONE_noMTC was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_ESTRONE_noMTC, was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-
MS was used to understand synthesis of Estrone in H295R cell line at 48hr of chemical exposure, without
filtering concentrations for maximum tolerated concentration (MTC). To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the steroid hormone intended target family, where the
subfamily is estrogens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: steroid hormone biosynthesis

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

-------
2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard

-------
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four lO^M forskolin replicates to
control for hormone stimulation, four 3nM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.212
Response cutoff threshold used to determine hit calls: 1.271
Detection technology used: HPLC-MS-MS (HPLC-MS-MS)

2.6	Response: Decreased or increased production of estrone following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The concentrations for these components were pre-filtered for the maximum tolerated
concentration (MTC) based on the MTT assay (_noMTC). Data were log2-transformed and analyzed as a fold-

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412 nM
Key positive control:

prochloraz; forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
change compared to DMSO control wells as baseline response. All statistical analyses were conducted using R
programming language, employing tcpl package to generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 85	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

2.107
0.137
7.32%

18.082
0.815

NA

19.122

NA
NA

0.048
0.007
NA

-11.813

NA

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 13.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2165

CEETOX_H295R_PROG_noMTC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Progesterone (PROG), maximum
tolerated concentration (MTC) filtered results only

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate, one of 12 assay
component(s) measured or calculated from the CEETOX_H295R assay (11 hormones and 1 viability assay). It is
designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. The concentrations for these components have not been pre-
filtered for the maximum tolerated concentration (MTC) based on the MTT assay (_noMTC). Data from the assay
component CEETOX_H295R_PROG_noMTC was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_PROG_noMTC, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-MS was
used to understand synthesis of Progesterone in H295R cell line at 48hr of chemical exposure, without filtering
concentrations for maximum tolerated concentration (MTC). To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the steroid hormone intended target family, where the
subfamily is progestagens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: steroid hormone biosynthesis

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

-------
2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard

-------
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four lO^M forskolin replicates to
control for hormone stimulation, four 3nM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.126
Response cutoff threshold used to determine hit calls: 0.758
Detection technology used: HPLC-MS-MS (HPLC-MS-MS)

2.6	Response: Decreased or increased production of progesterone following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The concentrations for these components were pre-filtered for the maximum tolerated
concentration (MTC) based on the MTT assay (_noMTC). Data were log2-transformed and analyzed as a fold-

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412 nM
Key positive control:

prochloraz; forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
change compared to DMSO control wells as baseline response. All statistical analyses were conducted using R
programming language, employing tcpl package to generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 85	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	6.513

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	13.867

Negative control well median absolute deviation value, by plate: mmad	0.467

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	28.366

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

0.802
0.03
3.68%

1.17
0.03

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 13.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2167

CEETOX_H295R_TESTO_noMTC

1.	General Information

1.1	Assay Title: CeeTox H295R High-throughput Steroidogenesis Assay for Testosterone (TESTO), maximum
tolerated concentration (MTC) filtered results only

1.2	Assay Summary: CEETOX_H295R is a cell-based, multiplexed-readout assay that uses H295R, a human adrenal
gland cell line, with measurements taken at 48 hours after chemical dosing in a 96-well plate, one of 12 assay
component(s) measured or calculated from the CEETOX_H295R assay (11 hormones and 1 viability assay). It is
designed to make measurements of hormone induction, a form of inducible reporter, as detected with
absorbance signals by HPLC-MS-MS technology. The concentrations for these components have not been pre-
filtered for the maximum tolerated concentration (MTC) based on the MTT assay (_noMTC). Data from the assay
component CEETOX_H295R_TESTO_noMTC was analyzed into 1 assay endpoint. This assay endpoint,
CEETOX_H295R_TESTO_noMTC, was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of inducible reporter, gain or loss-of-signal activity using HPLC-MS-MS was
used to understand synthesis of Testosterone in H295R cell line at 48hr of chemical exposure, without filtering
concentrations for maximum tolerated concentration (MTC). To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the steroid hormone intended target family, where the
subfamily is androgens.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Ceetox, a part of Cyprotex, is a Contract Research Organization (CRO) that in coordination with
OpAns, an analytical laboratory, provide ADME-tox services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary; it is a modification of the existing OECD H295R in vitro
steroidogenesis assay validated in 2011 (Test Guideline No. 456, [1]). NCI-H295R cells (ATCC CRL-2128) are
commercially available from American Type Culture Collection with signed Material Transfer Agreement

1.9	Assay Throughput: 96-well plate. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times (maximum passage of 10). Cells were seeded into 96-well plates at 50-60% confluency and allowed
to adhere overnight. Prior to test chemical exposure, cells were pre-stimulated with forskolin for 48 hours, and
chemical exposures were subsequently conducted for 48 hours in forskolin-free media. Following
concentration-response and MTC assays, MTT cytotoxicity assays were conducted. The media samples were
stored at -80 C prior to HPLC-MS/MS quantification of steroid hormones.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: steroid hormone biosynthesis

The CeeTox High-throughput Steroidogenesis assay was used to screen a large chemical library for changes in
steroid hormone levels resulting from interference with steroidogenesis in H295R human adrenocortical
carcinoma cells.

-------
2.2	Scientific Principles: The steroidogenic pathway is a series of hydroxylation and dehydrogenation steps carried
out by CYP450 and hydroxysteroid dehydrogenase enzymes. Environmentally-relevant chemicals can elicit
endocrine disruption by altering steroid hormone biosynthesis and metabolism (steroidogenesis), resulting in
adverse reproductive and developmental effects. TheToxCast HTS program adapted the OECD-validated H295R
steroidogenesis assay using human adrenocortical carcinoma cells (TG No. 456; which established performance-
based testing guidelines for the quantification of two steroid hormones, testosterone and estradiol) to
quantitatively assess the concentration-dependent effects of chemicals on 13 steroid hormones in 4 hormone
classes including progestogens, androgens, estrogens and glucocorticoids. To identify xenobiotics with the
capacity to disrupt the steroid hormone biosynthetic pathway, this assay quantitatively assesses changes in
steroid hormone levels. The CeeTox Steroidogenesis assay also demonstrates a novel implementation of H295R
cells as a high-throughput and multiplexed screening platform for screening a diverse chemical library for
xenobiotic interference with steroidogenesis

2.3	Experimental System: adherent H295R cell line used. H295R is an immortalized cell line derived from an
adrenocortical carcinoma isolated in 1980 from a 48-year-old African-American female patient. H295R cells
express genes that encode for all the key enzymes involved in steroidogenesis [1, 2], H295R cells have the
physiological characteristics of zonally undifferentiated human fetal adrenal cells. H295R cells represent a
unique in vitro system, as the cells retain the ability to produce many of the steroid hormones found in the adult
adrenal cortex and the gonads (with the exception of dihydrotestosterone, DHT), which allows for analysis of
xenobiotic effects on both corticosteroid synthesis and the production of sex steroid hormones, including
androgens and estrogens

2.4	Metabolic Competence: H295R cell line expresses CYPs 1A1, 11A, 17, 19, 21, 1B1 and 11B1, which are
differentially induced by endocrine-disrupting chemicals

2.5	Exposure Regime: Cell culture and media preparation procedures were conducted in accordance with OECD Test
No. 456 guidelines, with minor modifications. H295R cells (ATCC CRL-2128) were expanded for 5 passages and
frozen in batches in liquid nitrogen. Prior to the steroidogenesis assay, H295R cells were thawed and passed at
least 4 times. The maximum passage was 10. Cells were maintained in a 1:1 mixture of DMEM/F12
supplemented with 5ml/L ITS+ Premix (BD Bioscience) and 12.5 ml/L Nu-Serum (BD Bioscience). After seeding
cells into 96-well plates at 50-60% confluency, cells were allowed to adhere overnight. Prior to chemical testing,
culture medium was replaced with 175 uL medium containing 10 nM forskolin to stimulate steroidogenesis for
48 hours. Following prestimulation, forskolin medium was replaced with medium containing test chemical
solubilized in DMSO (ensuring a final concentration of 0.1% DMSO). Test chemicals were incubated for 48 hours,
then the medium was removed and split into 2 vials containing approximately 75uL each, and stored at -80 C
prior to hormone analysis. For steroid quantification, media samples were shipped to OpAns, LLC (Durham,
NC) on dry ice, and samples were thawed to room temperature prior to liquid-liquid extraction. Steroid
hormones were extracted from media samples using methyl tertbutyl ether (MTBE). An extra derivatization with
dansyl chloride was included for estrogen (estrone and estradiol) detection only. Steroid hormones were
separated by HPLC, eluted using a reverse phase C18 gradient with electrospray positive ionization, followed by
quantification by tandem mass spectrometry (MS/MS). Data were acquired on a MassHunter Workstation
Acquisition version B03.01 (Agilent Technologies, Inc.) and processed using MassHunter Quantitative Analysis
for QQQ. Accuracy was determined for each hormone analyte from 3 standards to determine upper and lower
limits of quantification (ULOQand LLOQ, respectively) using a 7-point standard curve. Precision and accuracy of
the extraction and quantification methods were calculated as the percent relative standard deviation (%RSD) of
the spiked standards and percent spiked standard recovered, respectively. The goal was to achieve 100%
accuracy (i.e., recover all spiked-in standard at quantification with minimal loss during run time) and good
precision (i.e., have %RSD <15% to ensure reproducibility). Test medium was removed following chemical
exposure, and cell viability was evaluated in the same wells by MTT. MTT procedures were as follows: after
removal of test medium, 500 uL of 0.5 mg/ml of 3-[4,5-dimethylthiazol-2-y]2,5-diphenyltetrazoliumbromide
solution was added to the cells. Following a 4-hour incubation at 37 C and 5% C02 to allow formazan-crystal
formation, the MTT solution was removed and blue formazan salt crystals were solubilized using 500uL
anhydrous isopropanol with shaking for 20 minutes. Absorbance was read at 570 and 650nm using a Packard

-------
Fusion microplate reader. Background correction of absorbance units was used to determine percent change
relative to controls. All assay plates contained multiple control wells including four lO^M forskolin replicates to
control for hormone stimulation, four 3nM prochloraz replicates to control for hormone inhibition and 4
digitonin replicates to control for cell death. Cell viability, as indicated by the MTT assay, was used to establish
maximum tolerated concentration (MTC) initially using a nominal concentration of lOO^M (where feasible
within solubility limits of the specific test chemical) and targeting cell viability >70%. When a test chemical
reduced cell viability to 20% - 70%, the test chemical was diluted 10-fold and re-evaluated, but if cell viability
was <20%, then the test chemical was diluted 100-fold and re-evaluated. Test chemical dilutions were made
until cell viability was >70%. MTT assays were also conducted for duplicates of all concentrations of chemicals
tested in concentration-response studies.

Baseline median absolute deviation for the assay (bmad): 0.114
Response cutoff threshold used to determine hit calls: 0.684
Detection technology used: HPLC-MS-MS (HPLC-MS-MS)

2.6	Response: Decreased or increased production of testosterone following interference with steroidogenesis was
quantified by HPLC-MS/MS.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Steroidogenesis Bioactivity: Assays related to steroidogenesis

Additionally, this assay was annotated to the intended target family of steroid hormone.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The concentrations for these components were pre-filtered for the maximum tolerated
concentration (MTC) based on the MTT assay (_noMTC). Data were log2-transformed and analyzed as a fold-

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.412 nM
Key positive control:

prochloraz; forskolin

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
change compared to DMSO control wells as baseline response. All statistical analyses were conducted using R
programming language, employing tcpl package to generate model parameters and confidence intervals.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 17: bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median,
by assay plate ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a
concentration index (cndx) of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

6: bmad6 (Add a cutoff value of 6 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 85	Number of chemicals tested: 84

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	8.518

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	0.192

Negative control well median absolute deviation value, by plate: mmad	0.022

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	-18.589

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

2.697
0.133
4.86%

6.432
0.378

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Karmaus AL, Toole CM, Filer DL, Lewis KC, Martin MT. High-Throughput Screening of Chemical
Effects on Steroidogenesis Using H295R Human Adrenocortical Carcinoma Cells. Toxicol Sci. 2016
Apr;150(2):323-32. doi: 10.1093/toxsci/kfw002. Epub 2016 Jan 18. PMID: 26781511; PMCID: PMC4809454.,
Haggard DE, Karmaus AL, Martin MT, Judson RS, Setzer RW, Paul Friedman K. High-Throughput H295R
Steroidogenesis Assay: Utility as an Alternative and a Statistical Approach to Characterize Effects on
Steroidogenesis. Toxicol Sci. 2018 Apr l;162(2):509-534. doi: 10.1093/toxsci/kfx274. Erratum in: Toxicol Sci.
2018 Aug 1;164(2):646. PMID: 29216406., Haggard DE, Setzer RW, Judson RS, Paul Friedman K. Development of
a prioritization method for chemical-mediated effects on steroidogenesis using an integrated statistical analysis
of high-throughput H295R data. Regul Toxicol Pharmacol. 2019 Dec; 109:104510. doi:
10.1016/j.yrtph.2019.104510. Epub 2019 Oct 29. PMID: 31676319.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1611

CLD_ABCBl_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for ABC Transporter (ABCB1) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_ABCBl_6hr is one of 16
assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_ABCBl_6hr was analyzed
at the endpoint, CLD_ABCBl_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene ABCB1. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the transporter intended target family, where subfamily is ABC transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.133
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

26

16

150

quadratic-polynomialfpoly2) model: 54

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

5

2

61

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

1789.25

Neutral control median absolute deviation, by plate: nmad	141.588

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	1.1%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 5.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1612

CLD_ABCBll_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for ABC Transporter (ABCB11) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_ABCBll_6hr is one of
16 assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_ABCBll_6hr was analyzed
at the endpoint, CLD_ABCBll_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene ABCB11. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the transporter intended target family, where subfamily is ABC transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.241
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
5

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

31

11

134

quadratic-polynomialfpoly2) model: 48

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

81

5

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

84.75

Neutral control median absolute deviation, by plate: nmad	10.378

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.51%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 5.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1613

CLD_ABCG2_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for ABC Transporter (ABCG2) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_ABCG2_6hr is one of 16
assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_ABCG2_6hr was analyzed
at the endpoint, CLD_ABCG2_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene ABCG2. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the transporter intended target family, where subfamily is ABC transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.205
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
2

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

54

14

122

quadratic-polynomialfpoly2) model: 42

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

4

0

79

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

299.5

Neutral control median absolute deviation, by plate: nmad	38.918

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.14%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 4.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1614

CLD_ACTIN_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for Baseline Control (Actin) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_ACTIN_6hr is one of 16
assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_ACTIN_6hr was analyzed
at the endpoint, CLD_ACTIN_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene ACTA1. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves a background control function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the background measurement intended target family, where subfamily is baseline
control.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative

-------
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of

-------
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.172
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
0

211

108

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

8

48

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

15

121

quadratic-polynomialfpoly2) model: 36

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

83

8

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3533.25

Neutral control median absolute deviation, by plate: nmad	401.043

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.36%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1615

CLD_CYPlAl_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A1) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_CYPlAl_6hr is one of 16
assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_CYPlAl_6hr was analyzed
at the endpoint, CLD_CYPlAl_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene CYP1A1. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the cyp intended target family, where subfamily is xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.592
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
133

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

34

26

84

quadratic-polynomialfpoly2) model: 58

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

21

78

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

80

Neutral control median absolute deviation, by plate: nmad	28.911

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	31.43%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 21.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1616

CLD_CYPlA2_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A2) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_CYPlA2_6hr is one of 16
assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_CYPlA2_6hr was analyzed
at the endpoint, CLD_CYPlA2_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene CYP1A2. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the cyp intended target family, where subfamily is xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.441
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
88

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

56

30

69

quadratic-polynomialfpoly2) model: 58

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

70

1

19

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

110.75

Neutral control median absolute deviation, by plate: nmad	25.575

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	23.26%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 19.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1617

CLD_CYP2B6_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A2) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_CYP2B6_6hr is one of 16
assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_CYP2B6_6hr was analyzed
at the endpoint, CLD_CYP2B6_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene CYP2B6. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the cyp intended target family, where subfamily is xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.397
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
180

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

24

24

66

quadratic-polynomialfpoly2) model: 66

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

2

74

31

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

52.25

Neutral control median absolute deviation, by plate: nmad	10.378

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	20.34%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 31.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1618

CLD_CYP2C19_6h r

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A2) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_CYP2C19_6hr is one of
16 assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_CYP2C19_6hr was analyzed
at the endpoint, CLD_CYP2C19_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene CYP2C19. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the cyp intended target family, where subfamily is xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.195
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

29

28

128

quadratic-polynomialfpoly2) model: 42

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

4

81

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

1282.75

Neutral control median absolute deviation, by plate: nmad	159.009

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.39%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 4.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1619

CLD_CYP2C9_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP2C9) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_CYP2C9_6hr is one of 16
assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_CYP2C9_6hr was analyzed
at the endpoint, CLD_CYP2C9_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene CYP2C9. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the cyp intended target family, where subfamily is xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.318
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

48

18

122

quadratic-polynomialfpoly2) model:	17

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

9

101

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

912.25

Neutral control median absolute deviation, by plate: nmad	270.204

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	45.98%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1620

CLD_CYP3A4_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP3A4) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_CYP3A4_6hr is one of 16
assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_CYP3A4_6hr was analyzed
at the endpoint, CLD_CYP3A4_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene CYP3A4. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the cyp intended target family, where subfamily is xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.561
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

4nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
66

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

20

15

134

quadratic-polynomialfpoly2) model: 70

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

67

3

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

20

Neutral control median absolute deviation, by plate: nmad	6.672

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	63.54%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 3.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1621

CLD_GAPDH_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for Baseline Control (Glyceraldehyde 3-phosphate
dehydrogenase) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_GAPDH_6hr is one of 16
assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_GAPDH_6hr was analyzed
at the endpoint, CLD_GAPDH_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene GAPDH. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves a background control function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the background measurement intended target family, where subfamily is baseline
control.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).

-------
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological

-------
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.137
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
0

206

113

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

6

27

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

23

133

quadratic-polynomialfpoly2) model: 47

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

6

76

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	5253.75

Neutral control median absolute deviation, by plate: nmad	615.279

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.74%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1622

CLD_GSTA2_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for alkyl and aryl transferase (GSTA2) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_GSTA2_6hr is one of 16
assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_GSTA2_6hr was analyzed
at the endpoint, CLD_GSTA2_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene GSTA2. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the transferase intended target family, where subfamily is alkyl and aryl transferase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.182
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transferase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

33

19

118

quadratic-polynomialfpoly2) model: 47

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

93

7

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

1459

Neutral control median absolute deviation, by plate: nmad	162.345

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.23%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1623

CLD_HMGCS2_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for HMG-CoA synthase (HMGCS2) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_HMGCS2_6hr is one of
16 assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_HMGCS2_6hr was analyzed
at the endpoint, CLD_HMGCS2_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene HMGCS2. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the lyase intended target family, where subfamily is HMG-CoA synthase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed. This endpoint measures mRNA
induction as it relates to the HMGCS2 gene (USEPA, 2020), which is involved in cholesterol biosynthesis and
mitochondrial ketogenesis (Vila-Brau et al., 2011; Wang et al., 2019). Ketogenesis provides lipid-derived energy
for various organs during times of carbohydrate deprivation, such as fasting (Geisler et al., 2019; Hegardt, 1999).
The HMGCS2 gene was mapped to 17 AOPs through 10 key events mostly relating to mitochondrial dysfunction
(AOP Wiki, 2016). Interestingly, four of these AOPs (#77-80) relate to abnormal behavior in social insects (bees),
associated with colony loss or death (LaLone et al., 2017). Although these AOPs are still under development
(LaLone et al., 2017), their descriptions align with other studies highlighting the potential risk of 2,4-D on insects
(Ejomah et al., 2020; Papaefthimiou et al., 2002).

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte

-------
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.524
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of lyase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 319

Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
11

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	75.5

Neutral control median absolute deviation, by plate: nmad	32.247

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	41.73%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1624

CLD_SLC01Bl_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for organic anion transporter (SLC01B1) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_SLC01Bl_6hr is one of
16 assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_SLC01Bl_6hr was analyzed
at the endpoint, CLD_SLC01Bl_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene SLC01B1. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the transporter intended target family, where subfamily is organic anion transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.254
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

29

12

140

quadratic-polynomialfpoly2) model: 32

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

4

95

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

444.25

Neutral control median absolute deviation, by plate: nmad	54.486

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.76%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 4.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1625

CLD_SULT2A_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for sulfotransferase (SULT2A) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_SULT2A_6hr is one of 16
assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements of
mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_SULT2A_6hr was analyzed
at the endpoint, CLD_SULT2A_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene SULT2A1. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the transferase intended target family, where subfamily is sulfotransferase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.444
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

40 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transferase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 317	Number of chemicals tested: 307

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
11

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

22

24

132

quadratic-polynomialfpoly2) model: 38

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

0

84

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

21.25

Neutral control median absolute deviation, by plate: nmad	5.93

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	30.55%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1626

CLD_UGTlAl_6hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for glucuronosyltransferase (UGT1A1) at 6 hours

1.2	Assay Summary: CLD_6hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver primary
cell, with measurements taken at 6 hours after chemical dosing in a 96-well plate. CLD_UGTlAl_6hr is one of
16 assay component(s) measured or calculated from the CLD_6hr assay. It is designed to make measurements
of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals by Quantitative
Nuclease Protection Assay (qNPA) technology. Data from the assay component CLD_UGTlAl_6hr was analyzed
at the endpoint, CLD_UGTlAl_6hr, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used to
understand changes in the reporter gene as they relate to the gene UGT1A1. Furthermore, this assay endpoint
can be referred to as a primary readout. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the transferase intended target family, where subfamily is glucuronosyltransferase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling

-------
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research

-------
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.203
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transferase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
2

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

29

35

97

quadratic-polynomialfpoly2) model: 60

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

80

12

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

360.5

Neutral control median absolute deviation, by plate: nmad	38.177

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.15%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 12.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1627

CLD_ABCBl_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for ABC Transporter (ABCB1) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_ABCBl_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_ABCBl_24hr was analyzed at the endpoint, CLD_ABCBl_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene ABCB1.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transporter intended target family, where
subfamily is ABC transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.151
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.04 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
15

249

55

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

2

22

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

15

133

quadratic-polynomialfpoly2) model: 66

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

73

6

1

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	885

Neutral control median absolute deviation, by plate: nmad	89.327

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	10.51%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1628

CLD_ABCBll_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for ABC Transporter (ABCB11) at 48 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_ABCBll_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_ABCBll_24hr was analyzed at the endpoint, CLD_ABCBll_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene ABCB11.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transporter intended target family, where
subfamily is ABC transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.28
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.04 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
7

111

201

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

5

26

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

24

142

quadratic-polynomialfpoly2) model: 58

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

6

57

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	101

Neutral control median absolute deviation, by plate: nmad	18.903

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	21.59%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1629

CLD_ABCG2_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for ABC Transporter (ABCG2) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_ABCG2_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_ABCG2_24hr was analyzed at the endpoint, CLD_ABCG2_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene ABCG2.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transporter intended target family, where
subfamily is ABC transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.246
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.04 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
5

203

111

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

9

37

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

11

120

quadratic-polynomialfpoly2) model: 59

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

78

5

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	249

Neutral control median absolute deviation, by plate: nmad	32.617

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	12.17%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 5.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1630

CLD_ACTIN_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for Baseline Control (Actin) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_ACTIN_24hr is
one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_ACTIN_24hr was analyzed at the endpoint, CLD_ACTIN_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene ACTA1.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a background control function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the background measurement intended target
family, where subfamily is baseline control.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative

-------
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of

-------
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.183
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.04 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
0

144

175

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

8

38

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

30

123

quadratic-polynomialfpoly2) model: 42

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

9

68

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3301

Neutral control median absolute deviation, by plate: nmad	258.714

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.11%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1631

CLD_CYPlAl_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A1) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_CYPlAl_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYPlAl_24hr was analyzed at the endpoint, CLD_CYPlAl_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP1A1.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.653
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

4nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
145

119

55

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

7

12

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

36

120

quadratic-polynomialfpoly2) model: 59

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2

62

1

14

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	11.25

Neutral control median absolute deviation, by plate: nmad	6.672

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	66.27%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 14.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1632

CLD_CYPlA2_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A2) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_CYPlA2_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYPlA2_24hr was analyzed at the endpoint, CLD_CYPlA2_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP1A2.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.473
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.04 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
138

136

45

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

20
29

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

38

97

quadratic-polynomial(poly2) model: 51

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

26

57

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	54.75

Neutral control median absolute deviation, by plate: nmad	8.896

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	16.42%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 26.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1633

CLD_CYP2 B6_24h r

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A2) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_CYP2B6_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYP2B6_24hr was analyzed at the endpoint, CLD_CYP2B6_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP2B6.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.415
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

4nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
165

120

34

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

23
21

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

33

86

quadratic-polynomialfpoly2) model: 59

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

72

22

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	32

Neutral control median absolute deviation, by plate: nmad	6.672

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	20.79%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 22.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1634

CLD_CYP2C19_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A2) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_CYP2C19_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYP2C19_24hr was analyzed at the endpoint, CLD_CYP2C19_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP2C19.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.192
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.04 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
0

177

142

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

3

32

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

28

111

quadratic-polynomialfpoly2) model: 52

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

87

6

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1448.5

Neutral control median absolute deviation, by plate: nmad	222.761

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.85%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1635

CLD_CYP2C9_24h r

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A2) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_CYP2C9_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYP2C9_24hr was analyzed at the endpoint, CLD_CYP2C9_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP2C9.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.364
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.04 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
1

169

149

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

9

51

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

93

17

quadratic-polynomialfpoly2) model: 37

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

2

6

104

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	2430

Neutral control median absolute deviation, by plate: nmad	522.246

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	24.73%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1636

CLD_CYP3A4_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP3A4) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_CYP3A4_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYP3A4_24hr was analyzed at the endpoint, CLD_CYP3A4_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP3A4.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.72
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

4nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
146

126

47

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

6

13

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

28

115

quadratic-polynomialfpoly2) model: 63

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

67

2

1

12

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	12

Neutral control median absolute deviation, by plate: nmad	8.896

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	74.52%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 12.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1637

CLD_GAPDH_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for Baseline Control (Glyceraldehyde 3-phosphate
dehydrogenase) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_GAPDH_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_GAPDH_24hr was analyzed at the endpoint, CLD_GAPDH_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene GAPDH.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a background control function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the background measurement intended target
family, where subfamily is baseline control.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).

-------
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological

-------
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.16
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.04 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
0

148

171

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

1

15

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

34

140

quadratic-polynomialfpoly2) model: 42

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

7

79

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	5726.25

Neutral control median absolute deviation, by plate: nmad	670.506

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.9%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1638

CLD_GSTA2_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for alkyl and aryl transferase (GSTA2) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_GSTA2_24hr is
one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_GSTA2_24hr was analyzed at the endpoint, CLD_GSTA2_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene GSTA2.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transferase intended target family, where
subfamily is alkyl and aryl transferase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.185
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.04 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transferase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
0

203

116

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

5

32

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

28

131

quadratic-polynomialfpoly2) model: 44

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

70

9

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	688.75

Neutral control median absolute deviation, by plate: nmad	90.439

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.92%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1639

CLD_HMGCS2_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for HMG-CoA synthase (HMGCS2) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_HMGCS2_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_HMGCS2_24hr was analyzed at the endpoint, CLD_HMGCS2_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene HMGCS2.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the lyase intended target family, where subfamily is
HMG-CoA synthase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed. This endpoint measures mRNA
induction as it relates to the HMGCS2 gene (USEPA, 2020), which is involved in cholesterol biosynthesis and
mitochondrial ketogenesis (Vila-Brau et al., 2011; Wang et al., 2019). Ketogenesis provides lipid-derived energy
for various organs during times of carbohydrate deprivation, such as fasting (Geisler et al., 2019; Hegardt, 1999).
The HMGCS2 gene was mapped to 17 AOPs through 10 key events mostly relating to mitochondrial dysfunction
(AOP Wiki, 2016). Interestingly, four of these AOPs (#77-80) relate to abnormal behavior in social insects (bees),
associated with colony loss or death (LaLone et al., 2017). Although these AOPs are still under development
(LaLone et al., 2017), their descriptions align with other studies highlighting the potential risk of 2,4-D on insects
(Ejomah et al., 2020; Papaefthimiou et al., 2002).

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast

-------
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.422
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.4 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
genes were measured: ABCB1, ABCB11, ABCG2, SLC01B1, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of lyase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 319

Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
5

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	164

Neutral control median absolute deviation, by plate: nmad	37.065

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	26.16%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1640

CLD_SLC01Bl_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for organic anion transporter (SLC01B1) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_SLC01Bl_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_SLC01Bl_24hr was analyzed at the endpoint, CLD_SLC01Bl_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene SLC01B1.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transporter intended target family, where
subfamily is organic anion transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.264
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.04 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
0

147

172

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

7

40

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

21

121

quadratic-polynomialfpoly2) model: 41

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

7

81

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	401.75

Neutral control median absolute deviation, by plate: nmad	59.304

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	21.35%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1641

CLD_SULT2A_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for sulfotransferase (SULT2A) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_SULT2A_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_SULT2A_24hr was analyzed at the endpoint, CLD_SULT2A_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene SULT2A1.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transferase intended target family, where
subfamily is sulfotransferase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.429
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.4 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transferase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
20

205

94

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

2

20

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

27

135

quadratic-polynomialfpoly2) model: 52

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

8

71

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	39.5

Neutral control median absolute deviation, by plate: nmad	7.413

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	18.79%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1642

CLD_UGTlAl_24hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for glucuronosyltransferase (UGT1A1) at 24 hours

1.2	Assay Summary: CLD_24hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 24 hours after chemical dosing in a 96-well plate. CLD_UGTlAl_24hr
is one of 16 assay component(s) measured or calculated from the CLD_24hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_UGTlAl_24hr was analyzed at the endpoint, CLD_UGTlAl_24hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene UGT1A1.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transferase intended target family, where
subfamily is glucuronosyltransferase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.209
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.04 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transferase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
32

244

43

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

10
25

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

41

119

quadratic-polynomialfpoly2) model: 62

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

54

8

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	453.25

Neutral control median absolute deviation, by plate: nmad	30.764

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	6.77%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1643

CLD_ABCBl_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for ABC Transporter (ABCB1) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_ABCBl_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_ABCBl_48hr was analyzed at the endpoint, CLD_ABCBl_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene ABCB1.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transporter intended target family, where
subfamily is ABC transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.171
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
15

216

88

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

6

29

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

29

106

quadratic-polynomialfpoly2) model: 75

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

62

8

1

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	965

Neutral control median absolute deviation, by plate: nmad	90.439

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.39%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1644

CLD_ABCBll_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for ABC Transporter (ABCB11) at 24 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_ABCBll_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_ABCBll_48hr was analyzed at the endpoint, CLD_ABCBll_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene ABCB11.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transporter intended target family, where
subfamily is ABC transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.314
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
6

105

208

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

6

27

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

19

145

quadratic-polynomialfpoly2) model: 44

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

2

66

8

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	115.5

Neutral control median absolute deviation, by plate: nmad	29.281

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	25.73%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1645

CLD_ABCG2_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for ABC Transporter (ABCG2) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_ABCG2_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_ABCG2_48hr was analyzed at the endpoint, CLD_ABCG2_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene ABCG2.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transporter intended target family, where
subfamily is ABC transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.202
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
18

181

120

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

1

24

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

14

135

quadratic-polynomialfpoly2) model: 67

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

69

5

1

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	111.15

Neutral control median absolute deviation, by plate: nmad	26.687

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.46%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 5.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1646

CLD_ACTIN_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for Baseline Control (Actin) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_ACTIN_48hr is
one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_ACTIN_48hr was analyzed at the endpoint, CLD_ACTIN_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene ACTA1.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a background control function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the background measurement intended target
family, where subfamily is baseline control.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative

-------
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of

-------
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.199
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
0

151

168

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

11
29

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

33

106

quadratic-polynomialfpoly2) model: 49

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

2

79

8

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4664.5

Neutral control median absolute deviation, by plate: nmad	400.302

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.36%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1647

CLD_CYPlAl_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A1) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_CYPlAl_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYPlAl_48hr was analyzed at the endpoint, CLD_CYPlAl_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP1A1.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.615
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.4 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
125

132

62

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

5

14

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

32

136

quadratic-polynomialfpoly2) model: 45

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

3

66

1

10

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	11.75

Neutral control median absolute deviation, by plate: nmad	7.784

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	67.93%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 10.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1648

CLD_CYPlA2_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A2) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_CYPlA2_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYPlA2_48hr was analyzed at the endpoint, CLD_CYPlA2_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP1A2.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.353
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
139

140

40

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

23
14

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

50

85

quadratic-polynomialfpoly2) model: 65

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

4

51

25

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	70

Neutral control median absolute deviation, by plate: nmad	21.498

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	47.62%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 25.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1649

CLD_CYP2 B6_48h r

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A2) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_CYP2B6_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYP2B6_48hr was analyzed at the endpoint, CLD_CYP2B6_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP2B6.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.45
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
167

99

53

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

19
39

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

24

74

quadratic-polynomialfpoly2) model: 58

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

24

78

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	46.5

Neutral control median absolute deviation, by plate: nmad	12.973

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	25.99%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 24.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1650

CLD_CYP2C19_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A2) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_CYP2C19_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYP2C19_48hr was analyzed at the endpoint, CLD_CYP2C19_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP2C19.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.178
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
0

156

163

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

7

25

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

37

121

quadratic-polynomialfpoly2) model: 43

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

4

0

2

78

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1328.5

Neutral control median absolute deviation, by plate: nmad	107.859

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.24%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 4.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1651

CLD_CYP2C9_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP1A2) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_CYP2C9_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYP2C9_48hr was analyzed at the endpoint, CLD_CYP2C9_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP2C9.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.349
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
6

157

156

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

10
44

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

20

102

quadratic-polynomialfpoly2) model: 47

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

86

8

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	836.75

Neutral control median absolute deviation, by plate: nmad	277.987

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	30.73%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1652

CLD_CYP3A4_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for xenobiotic metabolism (CYP3A4) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_CYP3A4_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_CYP3A4_48hr was analyzed at the endpoint, CLD_CYP3A4_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene CYP3A4.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the cyp intended target family, where subfamily is
xenobiotic metabolism.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.784
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.4 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cyp.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
152

106

61

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

8
15

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

34

115

quadratic-polynomialfpoly2) model: 69

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

56

14

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3.5

Neutral control median absolute deviation, by plate: nmad	1.483

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	18.53%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 14.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1653

CLD_GAPDH_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for Baseline Control (Glyceraldehyde 3-phosphate
dehydrogenase) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_GAPDH_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_GAPDH_48hr was analyzed at the endpoint, CLD_GAPDH_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene GAPDH.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a background control function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the background measurement intended target
family, where subfamily is baseline control.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).

-------
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological

-------
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.156
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of background measurement.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
0

155

164

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

11
21

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

29

125

quadratic-polynomialfpoly2) model: 43

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

7

80

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	5719

Neutral control median absolute deviation, by plate: nmad	442.556

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	7.62%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1654

CLD_GSTA2_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for alkyl and aryl transferase (GSTA2) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_GSTA2_48hr is
one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_GSTA2_48hr was analyzed at the endpoint, CLD_GSTA2_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene GSTA2.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transferase intended target family, where
subfamily is alkyl and aryl transferase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.2
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transferase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
23

205

91

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

4
22

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

36

135

quadratic-polynomialfpoly2) model: 53

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

6

60

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	218

Neutral control median absolute deviation, by plate: nmad	37.436

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	22.56%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1655

CLD_HMGCS2_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for HMG-CoA synthase (HMGCS2) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_HMGCS2_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_HMGCS2_48hr was analyzed at the endpoint, CLD_HMGCS2_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene HMGCS2.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the lyase intended target family, where subfamily is
HMG-CoA synthase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed. This endpoint measures mRNA
induction as it relates to the HMGCS2 gene (USEPA, 2020), which is involved in cholesterol biosynthesis and
mitochondrial ketogenesis (Vila-Brau et al., 2011; Wang et al., 2019). Ketogenesis provides lipid-derived energy
for various organs during times of carbohydrate deprivation, such as fasting (Geisler et al., 2019; Hegardt, 1999).
The HMGCS2 gene was mapped to 17 AOPs through 10 key events mostly relating to mitochondrial dysfunction
(AOP Wiki, 2016). Interestingly, four of these AOPs (#77-80) relate to abnormal behavior in social insects (bees),
associated with colony loss or death (LaLone et al., 2017). Although these AOPs are still under development
(LaLone et al., 2017), their descriptions align with other studies highlighting the potential risk of 2,4-D on insects
(Ejomah et al., 2020; Papaefthimiou et al., 2002).

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast

-------
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.484
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
genes were measured: ABCB1, ABCB11, ABCG2, SLC01B1, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of lyase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq

-------
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 319

Number of chemicals tested: 309

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
13

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	370.5

Neutral control median absolute deviation, by plate: nmad	69.312

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	18.74%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1656

CLD_SLC01Bl_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for organic anion transporter (SLC01B1) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_SLC01Bl_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_SLC01Bl_48hr was analyzed at the endpoint, CLD_SLC01Bl_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene SLC01B1.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transporter intended target family, where
subfamily is organic anion transporter.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.249
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transporter.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
0

130

189

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

5

33

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

28

128

quadratic-polynomial(poly2) model: 31

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

2

84

6

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	493.75

Neutral control median absolute deviation, by plate: nmad	111.936

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	22.49%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1657

CLD_SULT2A_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for sulfotransferase (SULT2A) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_SULT2A_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_SULT2A_48hr was analyzed at the endpoint, CLD_SULT2A_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene SULT2A1.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transferase intended target family, where
subfamily is sulfotransferase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.384
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transferase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
29

212

78

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

3

31

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

28

122

quadratic-polynomialfpoly2) model: 43

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

82

8

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	280.25

Neutral control median absolute deviation, by plate: nmad	40.401

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	14.98%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 8.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1658

CLD_UGTlAl_48hr

1.	General Information

1.1	Assay Title: CellzDirect Gene Expression Assay for glucuronosyltransferase (UGT1A1) at 48 hours

1.2	Assay Summary: CLD_48hr is a cell-based, multiplexed-readout assay that uses hepatocyte, a human liver
primary cell, with measurements taken at 48 hours after chemical dosing in a 96-well plate. CLD_UGTlAl_48hr
is one of 16 assay component(s) measured or calculated from the CLD_48hr assay. It is designed to make
measurements of mRNA induction, a form of inducible reporter, as detected with chemiluminescence signals
by Quantitative Nuclease Protection Assay (qNPA) technology. Data from the assay component
CLD_UGTlAl_48hr was analyzed at the endpoint, CLD_UGTlAl_48hr, in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene UGT1A1.
Furthermore, this assay endpoint can be referred to as a primary readout. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the transferase intended target family, where
subfamily is glucuronosyltransferase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Formerly CellzDirect, this Contract Research Organization (CRO) is now part of the Invitrogen
brand of Thermo Fisher providing cell-based in vitro assay screening services using primary hepatocytes.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Human hepatocyte culture and qNPA assays are developed and performed for
commercial purposes by CellzDirect/lnvitrogen corporation (a part of Life Technologies).

1.9	Assay Throughput: 96-well plate. Cultures of primary human hepatocytes from 20,928 wells (96-well plate
format) were prepared, cultured, and harvested across 4 time points (0, 6, 24, or 48 h) for a minimum of 5
concentrations of each chemical or positive control.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose:

The CellzDirect assays used metabolically competent primary cultures of human hepatocytes to monitor
induction or suppression of xenobiotic metabolizing enzyme and transporter gene expression multiple liver-
relevant pathways.

2.2	Scientific Principles: This model system was used to characterize the concentration- and time-response of
chemicals for changes in expression of genes regulated by nuclear receptors. Nuclear receptor-mediated
regulation of gene expression represents an important hepatic response to exposure to both endogenous and
exogenous substrates (Nakata et al., 2006). These receptors regulate multiple gene targets involved in
absorption, metabolism, disposition, and excretion of endogenous and foreign chemicals (and metabolites).
Fourteen gene targets were monitored in quantitative nuclease protection assays: six representative
cytochromes P-450, four hepatic transporters, three Phase II conjugating enzymes, and one endogenous

-------
metabolism gene involved in cholesterol synthesis. These gene targets are sentinels of five major signaling
pathways: AhR, CAR, PXR, FXR, and PPARa. Besides gene expression, the relative potency and efficacy for these
chemicals to modulate cellular health and enzymatic activity were assessed.

2.3	Experimental System: adherent hepatocyte primary cell used. Primary cultures of human hepatocytes were
prepared from human liver tissue derived from two separate male donors (Hu776 and Hu778) and had initial
viabilities of 91 and 95%, respectively, at the time of plating. Donor Hu776 was a male Caucasian, 41 yr of age,
weighing 180 lb, standing 5'10" tall, who consumed 4-6 alcoholic beverages per week and occasionally chewed
tobacco. Donor Hu778 was a male Caucasian, 55 yr of age, weighing 162 lb, standing 5'6" tall, with no history of
alcohol or tobacco consumption. Tissue specimens used for these studies were derived from the normal margins
of resected liver tissue that was resected due to the presence of metastatic colon tumors. The research was
carried out in accordance with the principles of the current version of the Helsinki Declaration. Each patient
whose tissue was used in this study was fully consented under an institutional review board (IRB) application
approved by the individual institutions from patients undergoing liver resection surgery. Eligible patients were
between the ages of 18 and 75 yr and were not restricted to any gender or ethnic grouping. All samples were
collected and preserved at the participating institution and shipped to CellzDirect's facility in Durham, NC, for
processing under protocols approved during the IRB application process. Hepatocytes were isolated by a
modification of the two-step collagenase perfusion method described previously (LeCluyse et al., 2005). Final
cell viability, prior to plating, was determined by the trypan blue exclusion test and was >90% in both
preparations. Following isolation, hepatocytes were resuspended in Dulbecco's modified Eagle's medium
(DMEM) containing 5% fetal calf serum, insulin (4 ug/ml), and DEX (1 uM) and added to 96-well plates (BioCoat,
BD Biosciences, San Jose, CA) coated with a simple collagen, type I, substratum. Hepatocytes were allowed to
attach for 4-6 h at 37C in a humidified culture chamber with 95% relative humidity/5% air/C02. After
attachment, culture vessels were swirled and medium containing debris and unattached cells was aspirated.
Fresh ice-cold serum-free DMEM/Ham's F12 containing 50 nM DEX, 6.25 ug/ml insulin, 6.25 ug/ml transferrin,
6.25 ng/ml selenium (ITS+), and 0.25 mg/ml ECM was added to the culture vessels and immediately returned to
the culture chamber. Medium was changed on a daily basis thereafter. Cultures of hepatocytes were maintained
for 24-48 h prior to initiating experiments with the chemicals.

2.4	Metabolic Competence: This assay utilizes a metabolically competent, in vitro hepatocyte culture system
consisting of primary cultures of human hepatocytes prepared from human liver tissue derived from two
separate male donors. Primary human hepatocyte cultures are useful in vitro model systems of human liver
because when cultured under appropriate conditions the hepatocytes retain liver-like functionality such as
metabolism, transport, and cell signaling.

2.5	Exposure Regime: Human hepatocyte cultures were treated daily for two consecutive days with fresh dosing
solutions containing appropriate concentrations of the 320 ToxCast chemicals (chemical identities were blinded
to CellzDirect and other ToxCast collaborators), vehicle control (0.2% DMSO), and positive control inducers as
summarized in Table 1. Due to the large number of treatment groups and chemicals examined, the ToxCast
chemicals were divided into two groups (spanning two independent cultures of hepatocytes from separate
donor preparations (Hu778: ToxCast plates 1 and 2; Hu776: ToxCast plates 3 and 4). Each preparation of human
hepatocyte cultures was treated with respective vehicle control, medium only control, set of six positive
controls/reference chemicals for the five hepatic receptor pathways, and a subset of eight of the ToxCast
chemicals (indoxacarb, pyrithiobac-sodium, norflurazon, cyhalofop-butyl, methomyl, thiram, acetochlor, and
propiconazole). These replicate data, coupled with internal replicates designed within the blinded ToxCast
chemical library, provided additional data to evaluate interindividual differences between hepatocyte
preparations and their potential impact on chemical profiles. Cell morphology and integrity were evaluated
using phase-contrast microscopy as an indicator of hepatocyte cell health (Tyson & Green, 1987). Cultures for
each treatment group (i.e., media, vehicle [0.2% DMSO], positive control inducers [multiple concentrations],
and the ToxCast chemicals [multiple concentrations]) were observed and cell morphology was assessed relative
to vehicle control cultures at each harvest time point (0, 6, 24, or 48 h). Any discernable morphological
alterations such as changes in cell shape, nucleus size/shape, cytoplasmic alterations, and accumulation of
vacuoles suggestive of dilated organelles and lipid droplets (Guillouzo et al., 1997) that were observed as a

-------
consequence of chemical exposure were recorded in images captured using a Zeiss Axiovert inverted research
microscope equipped with phase-contrast optics, a 3 CCD camera, and a computer with image capture and
analysis software. Images were evaluated at the conclusion of the study to assess changes in cell integrity and
account for the effects of chemical exposure on uncharacteristic changes in concentration-dependent gene
expression profiles. The results from these determinations of apparent cytotoxicity were annotated (yes/no)
alongside the corresponding qNPA data for reference. At the conclusion of each treatment period, hepatocyte
cultures (96-well) were washed with 1 volume of HBSS, lysed by addition of 25 ul ArrayPlate lysis buffer (HTG,
Tucson, AZ), and 70 ul/well of Denaturation Oil, denatured by incubation at 95C for 10 min, and frozen at
approximately -70C until analysis by nuclease protection assay (qNPA) (Roberts et al., 2007). For qNPA analysis,
cell lysates were thawed at 50C for approximately 30 min, qNPA probes were added, and samples were
incubated at 95C for 10 min to begin the detection process by denaturing the target RNA, dissociating the
duplexes and secondary structure hybridization. At the conclusion of the hybridization period, SI nuclease
reagent was added to each sample to digest all nonprotected nucleotides at 50C for 60-90 min. At the
conclusion of the SI nuclease digestions, all reactions were stopped by transfer of all the samples to fresh plates
containing stop solution and incubated at 95C for 15 min to deactivate the enzyme, dissociate the mRNA/ DNA
probe heterodimers, and hydrolyze the resulting single stranded mRNA, leaving a stoichiometric amount of
single-stranded DNA nuclease protection probe, unmodified in sequence, as the only intact oligonucleotide left
in the sample. Neutralization solution was subsequently applied to cooled (room temperature) plates, and
samples were transferred to ArrayPlates for overnight incubations at 50C to allow probes to be captured onto
programmed locations on the ArrayPlates. Half of the nucleotides comprising each nuclease protection probe
are utilized for capture hybridization to the array. At the completion of the array capture of probes, plates were
washed, detection linkers were hybridized to the other half of each nuclease protection probe, plates were
washed again, and detection enzymes were applied. The final step in the process was the imaging of the plates
with the OMIX Imaging System (HTG, Tucson AZ). The quantity of protected nuclease protection probe, and
hence target mRNA in each well, was proportional to the luminescence intensity of the labeled detection
oligonucleotides that bind each of the 16 spots within each well of a 96-well plate. Luminescence data were
generated using the OMIX Imaging System software to generate endogenous control normalized data. These
data were exported for bioinformatic analyses.

Baseline median absolute deviation for the assay (bmad): 0.188
Response cutoff threshold used to determine hit calls: 1

Detection technology used: Quantitative Nuclease Protection Assay (qNPA) (Luminescence)

2.6 Response: This platform used quantitative Nuclease Protection Assays (qNPA) to simultaneously monitor an
array of liver-relevant gene targets, primarily those regulated by nuclear receptors which serve as sentinels for
key toxicant response mediated pathways; including gene products involved in absorption, metabolism,
disposition and excretion of endogenous and foreign chemicals. The mRNA levels of 14 target and 2 control
genes were measured: ABCB1, ABCB11, ABCG2, SLCOIBI, CYP1A1, CYP1A2, CYP2B6, CYP2C9, CYP2C19, CYP3A4,
UGT1A1, GSTA2, SULT2A1, HMGCS2, and control genes ACTB, GAPDH. These genes represent 5 nuclear receptor
signaling pathways: AHR, CAR, PXR, PPARa and FXR. In addition to measuring gene expression levels, visual
observation of changes to cellular morphology (relative to vehicle control) are assessed at multiple time points
(0, 6, 24, and 48 h) to quantify morphological alterations such as changes in cell shape, nucleus size/shape,
cytoplasmic alterations, accumulation of vacuoles etc. resulting from xenobiotic exposures.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:

0.004 nM
Key positive control:

Target (nominal) number of replicates:

4

Standard maximum concentration tested:

40 nM
Neutral vehicle control:
DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of transferase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Data from an OMIX imager was received in comma-separated values (.csv) format with plate and
well identifiers. These data were annotated with matching chemical and dosage information and compiled in a
database. Foldover-control values for each respective time point were calculated for each treatment group.
mRNA induction data is transformed to log2 fold induction over DMSO (vehicle control) signal and 5-point
concentration response curves were generated following 48 hours of continuous exposure to test chemicals.
Performance was compared to known inducers of xenobiotic response (e.g., phenobarbital and rifampicin) and
response data were generated at 4 different time points during the test duration (0, 6, 24 and 48 hours).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).), 3: rmneg (Exclude wells
with negative corrected response values (cval) and downgrading their well quality (wllq); if cval < 0, wllq
= 0.), 4: rmzero (Exclude wells with corrected response values (cval) equal to zero and downgrading their
well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

35: resp.logfc (Calculate the normalized response (resp) as the fold change of logged, i.e. the difference
between corrected (cval) and baseline (bval) log-scale values.), 38: bval.apid.nwllstcwllslowconc.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected

-------
values (cval) for neutral control wells (wilt = n) or wells with a concentration index (cndx) of 1 or 2 and
well type of test compound (wilt = t) or gain-of-signal control in multiple concentrations (wilt = c).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

11: log2_2 (Add a cutoff value log2(2). Typically for fold change data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 319	Number of chemicals tested: 309

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
49

238

32

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

12
22

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

44

94

quadratic-polynomialfpoly2) model: 80

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

5

59

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	704.75

Neutral control median absolute deviation, by plate: nmad	87.844

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.48%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 5.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Rotroff DM, Beam AL, Dix DJ, Farmer A, Freeman KM, Houck KA, Judson RS, LeCluyse EL, Martin
MT, Reif DM, Ferguson SS. Xenobiotic-metabolizing enzyme and transporter gene expression in primary cultures
of human hepatocytes modulated by ToxCast chemicals. J Toxicol Environ Health B Crit Rev. 2010 Feb;13(2-
4):329-46. doi: 10.1080/10937404.2010.483949. PMID: 20574906.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2771

IU F_N PClb_proliferation_BrdU_72hr

1.	General Information

1.1	Assay Title: Leibniz Research Institute for Environmental Medicine (IUF) 72-hour Neural Progenitor Cell
Bromodeoxyuridine (Brdll) Proliferation Assay (NPClb)

1.2	Assay Summary: IUF_NPC1 is a cell-based, multiplexed assay that uses hNPC, a human primary neural progenitor
cells, with measurements taken at 72 hours after chemical dosing in a 96-well plate. Human neural progenitor
cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. Grown in suspension
culture and under proliferative conditions (proliferation media and growth factors), these cells represent neural
progenitor cell proliferation. In the NPC1 assay, hNPC's are exposed to the test compound for 72h. Cell
proliferation is assessed as an increase in sphere area using automated phase contrast imaging and as
incorporation of Bromodeoxyuridine (Brdll) during DNA synthesis using a luminescence-based cell proliferation
ELISA. In parallel, the cell viability is assessed using an alamar blue viability assay and the cytotoxicity using a
lactate dehydrogenase dependent membrane integrity assay. IUF_NPClb_proliferation_Brdll_72hr is one of 4
assay components measured from the IUF_NPC1 assay. It is designed to measure cell proliferation as assessed
by the incorporation of Bromodeoxyuridine (Brdll) in the last 16 h of a 72 h compound exposure using a
luminescence-based cell proliferation ELISA. Data from the IUF_NPClb_proliferation_Brdll component was
analyzed at the endpoint IUF_NPClb_proliferation_Brdll in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of growth reporter, gain or loss-of-signal
activity can be used to understand cell proliferation effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to chemiluminescence signals based on BrdU incorporation during DNA synthesis in
proliferating cells are correlated to proliferation of the system.

-------
The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPCs are isolated from the fetal brain cortices and can be used to measure
proliferation, a process of brain growth during the fetal phase of prenatal development. The test system
therefore measures adverse events in the young (fetal) developing brain. Different types of NPC exist in the
developing brain. Besides ventricular zone NPC, radial glia cells serve as cortical progenitor cells responsible for
cortical expansion and folding. As whole cortices were used for cell preparation, this is not a specific NPC type
but rather a mix of NPCs found in fetal human cortex during development. The toxicological events that are
modeled concern events that influence proliferation of NPCs found in human cortex during the fetal phase.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the

-------
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uL pipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: After the cell expansion period, the cells are cultured for up to four weeks in which they are
passaged every week. Between one to three days after passaging, depending on the size chosen for passaging,
spheres at a size of 0.3 mm are used in the assay. For the assessment of neural progenitor cell proliferation, the
spheres are plated in poly-Hema coated 96-well U-bottom plates filled with proliferation medium containing
growth factors (EGF and FGF). One 0.25 - 0.35 mm big sphere is plated in the middle of each well. Within 3 days
NPCs proliferate and grow in size. Cultivation during the test method is performed at 37C and 5% C02 at a pH
of 7.2-7.6. As a positive control, spheres are cultivated in the absence of growth factors (EGF and FGF), which
dramatically reduces proliferation. Exposure starts on the plating day (day 0) and is continued over three days,
without chemical renewal, until the experiment is terminated. The assay is terminated by the assessment of cell
viability, cytotoxicity, and proliferation by Brdll. All endpoints are generated from the same experimental run
and from each well/sphere in the 96-well plate.

Baseline median absolute deviation for the assay (bmad): 7.171
Response cutoff threshold used to determine hit calls: 30

Detection technology used: Luminescence-based cell proliferation ELISA (Chemiluminescence)

2.6 Response: This assay uses primary human neural progenitor cells (hNPCs) from human cortex (gestation week
16 - 19) to measure changes in hNPC proliferation. Biological responses that are measured include fetal hNPC
proliferation, viability and cytotoxicity as quantified by sphere size, DNA synthesis as chemiluminescence
measurement, viability and cytotoxicity as fluorescence intensity. Primary endpoints: 1) Proliferation by area
(72hours; NPCla) is assessed as the slope of the increase in sphere size (amount of pixels in the bright-field
image, sphere area) over 72 hours measured by brightfield microscopy using high content imaging at 0 hours,
24 hours, 48 hours, and 72 hours. 2) Proliferation by Brdll (72hours; NPClb) is assessed as Brdll incorporation
(as an indirect measure of DNA synthesis) over the last 16 hours of compound exposure. It is measured as a
luminescence signal (relative luminescence unit) in a multi-plate reader after 72 hours. Secondary endpoints: 1)
Cytotoxicity at 72 hours is assessed as membrane integrity by measuring the amount of LDH leaked from cells
with damaged plasma membranes. LDH-dependent reduction of resazurin to resorufin is measured in the
supernatant of each well as fluorescence of the reaction product resorufin (relative fluorescence unit) in a multi-
plate reader after 72 hours of compound exposure. 2) Viability at 72 hours is assessed as mitochondrial activity

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

15.4321 nM
Key positive control:

media without growth factors

Target (nominal) number of replicates:

13

Standard maximum concentration tested:

11250 nM
Neutral vehicle control:

DMSO

-------
by measuring the amount of resazurin reduced to fluorescent resorufin (relative fluorescence unit) in a multi-
plate reader in the last two hours of the 72 hours proliferation and compound exposure period.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multi-plate reader (viability, cytotoxicity, Brdll incorporation) the raw data formats are excel files containing
values (one for each endpoint, timepoint and well) measured as relative fluorescence/luminescence units. These
values are transferred from the original excel file into the AXES sheet. The original excel output file is saved for
traceability of the data. The sphere size is automatically measured in the Cellomics scan software (Version 6.6.0;
Thermo Scientific) and copied into the AXES sheet. Original brightfield images are archived for 10 years. If not
otherwise stated, all data processing steps are performed in an R based evaluation tool that was designed for
data processing, curve fitting and point of departure evaluation of in vitro concentration response toxicity data.
Data processing describes all processing steps of raw data that are necessary to obtain the final response values
including the normalization, curve fitting and benchmark concentration calculation. Processing (or pre-
processing) steps for the Proliferation by Brdll endpoint involve subtraction of mean Brdll background from
each raw response value by plate. Background corrected response (RLU)=raw response
(RLU)-Background BrdU (RLU). The data is normalized to the solvent control. For the normalization to the
solvent control, each replicate data point is normalized to the median of the solvent control in the respective
experiment. Mathematical procedures to define outliers are not applied. Data points from wells where
technical problems are known or obvious are excluded from the analysis. Possible technical problems include:
pipetting errors, spillover from lysis, or problems in fixation of singularized cells. All wells with technical
problems are marked in the AXES sheet.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0; pval
= 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 30: osd_coff_bmr
(Overwrite the osd value so that bmr == coff)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	175099

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	9908

Positive control well median absolute deviation, by plate: pmad	0

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-Inf

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 32.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2773

IU F_N PCla_proliferation_area_72hr

1.	General Information

1.1	Assay Title: Leibniz Research Institute for Environmental Medicine (IUF) 72-hour Neural Progenitor Cell
Proliferation Area Assay (NPCla)

1.2	Assay Summary: IUF_NPC1 is a cell-based, multiplexed assay that uses hNPC, a human primary neural progenitor
cells, with measurements taken at 72 hours after chemical dosing in a 96-well plate. Human neural progenitor
cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. Grown in suspension
culture and under proliferative conditions (proliferation media and growth factors), these cells represent neural
progenitor cell proliferation. In the NPC1 assay, hNPC's are exposed to the test compound for 72h. Cell
proliferation is assessed as an increase in sphere area using automated phase contrast imaging and as
incorporation of Bromodeoxyuridine (Brdll) during DNA synthesis using a luminescence-based cell proliferation
ELISA. In parallel, the cell viability is assessed using an alamar blue viability assay and the cytotoxicity using a
lactate dehydrogenase dependent membrane integrity assay. IUF_NPCla_proliferation_area_72hr is one of 4
assay components measured from the IUF_NPC1 assay. It is designed to measure cell proliferation as assessed
by an increase in sphere area over 72 h using automated phase contrast imaging. Data from the
IUF_NPCla_proliferation_area component was analyzed at the endpoint IUF_NPCla_proliferation_area in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of growth reporter, gain or loss-of-signal activity can be used to understand cell proliferation effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in sphere size based on microscopic measurements are correlated to the proliferation of
the system.

The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human

-------
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPCs are isolated from the fetal brain cortices and can be used to measure
proliferation, a process of brain growth during the fetal phase of prenatal development. The test system
therefore measures adverse events in the young (fetal) developing brain. Different types of NPC exist in the
developing brain. Besides ventricular zone NPC, radial glia cells serve as cortical progenitor cells responsible for
cortical expansion and folding. As whole cortices were used for cell preparation, this is not a specific NPC type
but rather a mix of NPCs found in fetal human cortex during development. The toxicological events that are
modeled concern events that influence proliferation of NPCs found in human cortex during the fetal phase.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To

-------
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uLpipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: After the cell expansion period, the cells are cultured for up to four weeks in which they are
passaged every week. Between one to three days after passaging, depending on the size chosen for passaging,
spheres at a size of 0.3 mm are used in the assay. For the assessment of neural progenitor cell proliferation, the
spheres are plated in poly-Hema coated 96-well U-bottom plates filled with proliferation medium containing
growth factors (EGF and FGF). One 0.25 - 0.35 mm big sphere is plated in the middle of each well. Within 3 days
NPCs proliferate and grow in size. Cultivation during the test method is performed at 37C and 5% C02 at a pH
of 7.2-7.6. As a positive control, spheres are cultivated in the absence of growth factors (EGF and FGF), which
dramatically reduces proliferation. Exposure starts on the plating day (day 0) and is continued over three days,
without chemical renewal, until the experiment is terminated. The assay is terminated by the assessment of cell
viability, cytotoxicity, and proliferation by Brdll. All endpoints are generated from the same experimental run
and from each well/sphere in the 96-well plate.

Baseline median absolute deviation for the assay (bmad): 3.536

Response cutoff threshold used to determine hit calls: 30

Detection technology used: Automated phase contrast imaging (Microscopy)

2.6 Response: This assay uses primary human neural progenitor cells (hNPCs) from human cortex (gestation week
16 - 19) to measure changes in hNPC proliferation. Biological responses that are measured include fetal hNPC
proliferation, viability and cytotoxicity as quantified by sphere size, DNA synthesis as chemiluminescence
measurement, viability and cytotoxicity as fluorescence intensity. Primary endpoints: 1) Proliferation by area
(72hours; NPCla) is assessed as the slope of the increase in sphere size (amount of pixels in the bright-field
image, sphere area) over 72 hours measured by brightfield microscopy using high content imaging at 0 hours,
24 hours, 48 hours, and 72 hours. 2) Proliferation by Brdll (72hours; NPClb) is assessed as Brdll incorporation
(as an indirect measure of DNA synthesis) over the last 16 hours of compound exposure. It is measured as a
luminescence signal (relative luminescence unit) in a multi-plate reader after 72 hours. Secondary endpoints: 1)
Cytotoxicity at 72 hours is assessed as membrane integrity by measuring the amount of LDH leaked from cells
with damaged plasma membranes. LDH-dependent reduction of resazurin to resorufin is measured in the
supernatant of each well as fluorescence of the reaction product resorufin (relative fluorescence unit) in a multi-
plate reader after 72 hours of compound exposure. 2) Viability at 72 hours is assessed as mitochondrial activity
by measuring the amount of resazurin reduced to fluorescent resorufin (relative fluorescence unit) in a multi-
plate reader in the last two hours of the 72 hours proliferation and compound exposure period.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.445832647462276 nM
Key positive control:

media without growth factors

Target (nominal) number of replicates:

13

Standard maximum concentration tested:

325.011999999998 nM
Neutral vehicle control:

DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multi-plate reader (viability, cytotoxicity, Brdll incorporation) the raw data formats are excel files containing
values (one for each endpoint, timepoint and well) measured as relative fluorescence/luminescence units. These
values are transferred from the original excel file into the AXES sheet. The original excel output file is saved for
traceability of the data. The sphere size is automatically measured in the Cellomics scan software (Version 6.6.0;
Thermo Scientific) and copied into the AXES sheet. Original brightfield images are archived for 10 years. If not
otherwise stated, all data processing steps are performed in an R based evaluation tool that was designed for
data processing, curve fitting and point of departure evaluation of in vitro concentration response toxicity data.
Data processing describes all processing steps of raw data that are necessary to obtain the final response values
including the normalization, curve fitting and benchmark concentration calculation. Processing (or pre-
processing) steps for Proliferation by area endpoint involve taking the slope of the sphere size over 3 days of
proliferation (dO, dl, d2, d3) by plate. The calculated slope is used as raw data input for the data base (DB) and
is thus not calculated in the R based evaluation tool. The data is normalized to the solvent control. For the
normalization to the solvent control, each replicate data point is normalized to the median of the solvent control
in the respective experiment. Mathematical procedures to define outliers are not applied. Data points from
wells where technical problems are known or obvious are excluded from the analysis. Possible technical
problems include: pipetting errors, spillover from lysis, or problems in fixation of singularized cells. All wells with
technical problems are marked in the AXES sheet.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must

-------
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0; pval
= 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 232

Number of chemicals tested: 218

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
38

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	3265.9

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	57.8

Positive control well median absolute deviation, by plate: pmad	0

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-Inf

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 45.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2775

IUF_NPCl_viability_72hr

1.	General Information

1.1	Assay Title: Viability Assessment at 72 hours in the Leibniz Research Institute for Environmental Medicine (IUF)
Neural Progenitor Cell Proliferation Assay (NPC1)

1.2	Assay Summary: IUF_NPC1 is a cell-based, multiplexed assay that uses hNPC, a human primary neural progenitor
cells, with measurements taken at 72 hours after chemical dosing in a 96-well plate. Human neural progenitor
cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. Grown in suspension
culture and under proliferative conditions (proliferation media and growth factors), these cells represent neural
progenitor cell proliferation. In the NPC1 assay, hNPC's are exposed to the test compound for 72h. Cell
proliferation is assessed as an increase in sphere area using automated phase contrast imaging and as
incorporation of Bromodeoxyuridine (Brdll) during DNA synthesis using a luminescence-based cell proliferation
ELISA. In parallel, the cell viability is assessed using an alamar blue viability assay and the cytotoxicity using a
lactate dehydrogenase dependent membrane integrity assay. IllF_NPCl_viability_72hr is one of 4 assay
components measured from the IUF_NPC1 assay. It is designed to measure cell viability assessed as
mitochondria-dependent reduction of resazurin to resorufin using the alamar blue cell viability assay. Viability
is measured as a fluorescence signal (relative fluorescence unit) in a multiplate reader. Data from the
IIIF_NPCl_viability component was analyzed at the endpoint IIIF_NPCl_viability in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter,
gain or loss-of-signal activity can be used to understand viability effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in mitochondrial activity are correlated to cellular viability.

The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human

-------
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPCs are isolated from the fetal brain cortices and can be used to measure
proliferation, a process of brain growth during the fetal phase of prenatal development. The test system
therefore measures adverse events in the young (fetal) developing brain. Different types of NPC exist in the
developing brain. Besides ventricular zone NPC, radial glia cells serve as cortical progenitor cells responsible for
cortical expansion and folding. As whole cortices were used for cell preparation, this is not a specific NPC type
but rather a mix of NPCs found in fetal human cortex during development. The toxicological events that are
modeled concern events that influence proliferation of NPCs found in human cortex during the fetal phase.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To

-------
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uLpipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: After the cell expansion period, the cells are cultured for up to four weeks in which they are
passaged every week. Between one to three days after passaging, depending on the size chosen for passaging,
spheres at a size of 0.3 mm are used in the assay. For the assessment of neural progenitor cell proliferation, the
spheres are plated in poly-Hema coated 96-well U-bottom plates filled with proliferation medium containing
growth factors (EGF and FGF). One 0.25 - 0.35 mm big sphere is plated in the middle of each well. Within 3 days
NPCs proliferate and grow in size. Cultivation during the test method is performed at 37C and 5% C02 at a pH
of 7.2-7.6. As a positive control, spheres are cultivated in the absence of growth factors (EGF and FGF), which
dramatically reduces proliferation. Exposure starts on the plating day (day 0) and is continued over three days,
without chemical renewal, until the experiment is terminated. The assay is terminated by the assessment of cell
viability, cytotoxicity, and proliferation by Brdll. All endpoints are generated from the same experimental run
and from each well/sphere in the 96-well plate.

Baseline median absolute deviation for the assay (bmad): 3.706
Response cutoff threshold used to determine hit calls: 30
Detection technology used: Alamar blue viability assay (Fluorescence)

2.6 Response: This assay uses primary human neural progenitor cells (hNPCs) from human cortex (gestation week
16 - 19) to measure changes in hNPC proliferation. Biological responses that are measured include fetal hNPC
proliferation, viability and cytotoxicity as quantified by sphere size, DNA synthesis as chemiluminescence
measurement, viability and cytotoxicity as fluorescence intensity. Primary endpoints: 1) Proliferation by area
(72hours; NPCla) is assessed as the slope of the increase in sphere size (amount of pixels in the bright-field
image, sphere area) over 72 hours measured by brightfield microscopy using high content imaging at 0 hours,
24 hours, 48 hours, and 72 hours. 2) Proliferation by Brdll (72hours; NPClb) is assessed as Brdll incorporation
(as an indirect measure of DNA synthesis) over the last 16 hours of compound exposure. It is measured as a
luminescence signal (relative luminescence unit) in a multi-plate reader after 72 hours. Secondary endpoints: 1)
Cytotoxicity at 72 hours is assessed as membrane integrity by measuring the amount of LDH leaked from cells
with damaged plasma membranes. LDH-dependent reduction of resazurin to resorufin is measured in the
supernatant of each well as fluorescence of the reaction product resorufin (relative fluorescence unit) in a multi-
plate reader after 72 hours of compound exposure. 2) Viability at 72 hours is assessed as mitochondrial activity
by measuring the amount of resazurin reduced to fluorescent resorufin (relative fluorescence unit) in a multi-
plate reader in the last two hours of the 72 hours proliferation and compound exposure period.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

Target (nominal) number of replicates:

15.4321 nM
Key positive control:
lysed cells

14

Standard maximum concentration tested:

11250 nM
Neutral vehicle control:

DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multi-plate reader (viability, cytotoxicity, Brdll incorporation) the raw data formats are excel files containing
values (one for each endpoint, timepoint and well) measured as relative fluorescence/luminescence units. These
values are transferred from the original excel file into the AXES sheet. The original excel output file is saved for
traceability of the data. The sphere size is automatically measured in the Cellomics scan software (Version 6.6.0;
Thermo Scientific) and copied into the AXES sheet. Original brightfield images are archived for 10 years. If not
otherwise stated, all data processing steps are performed in an R based evaluation tool that was designed for
data processing, curve fitting and point of departure evaluation of in vitro concentration response toxicity data.
Data processing describes all processing steps of raw data that are necessary to obtain the final response values
including the normalization, curve fitting and benchmark concentration calculation. Processing (or pre-
processing) steps for the Viability endpoint involves subtraction of mean background from each response value
by plate. Background corrected response (RFU)=raw response (RFU)-Background (RFU). The data is
normalized to the solvent control. For the normalization to the solvent control, each replicate data point is
normalized to the median of the solvent control in the respective experiment. Mathematical procedures to
define outliers are not applied. Data points from wells where technical problems are known or obvious are
excluded from the analysis. Possible technical problems include: pipetting errors, spillover from lysis, or
problems in fixation of singularized cells. All wells with technical problems are marked in the AXES sheet.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must

-------
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0; pval
= 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 19: pc30 (Add a cutoff value of 30. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238

Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
30

Inactive hit count: 0
-------
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1182.5

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	515

Positive control well median absolute deviation, by plate: pmad	0

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-Inf

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 27.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2938

IU F_N PC2a_ra dia l^gli a_m igratio n_72 h r

1. General Information

1.1	Assay Title: Leibniz Research Institute for Environmental Medicine (IUF) 72-hour Neural Progenitor Cell
Migration Assay for Radial Glia Migration (NPC2a)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC2a_radial glia_migration_72hr is one of 12 assay components measured from the
IUF_NPC2-5 assay. The migration distance of radial glia at 72 hr is assessed in A,A|am from the edge of the sphere
core to the edge of the migration area based on brightfield images of each well. Therefore, each plate is scanned
using a high content imaging device. Images are exported and the migration distance is measured manually in
four directions using ImageJ. The mean of four measures per well is used as raw data input. Data from the
IUF_NPC2a_radial_glia_migration_72hr component was analyzed at the endpoint
IUF_NPC2a_radial_glia_migration_72hr in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of migration reporter, gain or loss-of-signal activity can be used to
understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Changes in the migration distance of radial glia are indicative of radial glia migration during
neurodevelopment.

The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. The NPC2a radial glia cell migration correspond to radial glia cell migration during
corticogenesis in vivo.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial

-------
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uL pipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution

-------
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:

SRC kinase inhibitor PP2

Target (nominal) number of replicates:

17

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 5.637

Response cutoff threshold used to determine hit calls: 16.911

Detection technology used: Automated phase contrast imaging (Microscopy)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Migration distance radial glia 72 hours is assessed as migration distance in um from the edge of the
sphere core and the edge of the migration area based on brightfield images of each well. Each plate is scanned
after 72 hours in culture in an automated high content imaging device and the sphere size in four directions is
measured manually using ImageJ. The endpoint-specific control for radial glia migration after 72 hours is the
SRC kinase inhibitor PP2. SRC family kinases represent one pathway that regulates radial gilia migration of
differentiating hNPC (Moors et al., 2007). Inhibition of this pathway with PP2 (10 nM) causes a reduction of
radial glia cell migration between 0 and 60 % of solvent control. All other primary endpoints are assessed based
on an immunocytochemical staining (ICC) image for each sphere. Cells are fixated after 120 hours in culture and
an ICC staining with Hoechst for nuclei, TUBB3 for neurons and 04 for oligodendrocytes is performed. The plates
are scanned using an automated high content imaging device and all nuclei and their positions are determined
automatically based on their intensity and size. Images are imported to the Omnisphero software to run the
image analysis for the following endpoints: Migration distance radial glia 120h is assessed as the migration
distance in um between the sphere core and the edge of the migration area based on ICC images of Hoechst
positives nuclei. With the identification of each nuclei positions around the sphere core, the migration distance
can be calculated. Therefore, a density distribution mask is calculated. By scanning the images of the nuclei
channel, the algorithm can determine relatively more and less dense image areas. By identifying the densest
area in the image, the sphere core can be detected. For identification of the migration, it is assumed, that the
nuclei density decreases with increasing distance to the sphere core. Once the density hits a pre-defined
threshold, the outer boundaries can be determined and the sphere itself can be mapped out in a polynomial
bounding box. Derived from this box, the actual size and migration distance can be calculated for each well.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

-------
2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES
sheet as value in nm. Original brightfield images are archived for 10 years. All other raw data is computed from
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (nm), mean migration distance all neurons (nm), mean migration distance
all oligodendrocytes (nm), neurite length (nm), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation. Raw data
processing to summary data for the migration distance of radial glia at 72 hours involves taking the mean of four
replicate measures of each sphere. The mean of four measures per well is used as raw data input and is not
calculated in the R based evaluation tool.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 22: pclO (Add a cutoff value of 10. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
39

Inactive hit count: Oihitc 0.9
199

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

20
12

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

26

51

-------
quadratic-polynomialfpoly2) model: 37

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

8

3

51

30

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 30.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2940

IUF_NPC2a_radial^glia_migration_120hr

1. General Information

1.1	Assay Title: Leibniz Research Institute for Environmental Medicine (IUF) 120-hour Neural Progenitor Cell
Migration Assay for Radial Glia Migration (NPC2a)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC2a_radial_glia_migration_120hr is one of 12 assay components measured from the
IUF_NPC2-5 assay. The migration distance of radial glia at 120 hr is assessed in nm from the edge of the sphere
core to the edge of the migration area based on fluorescent images of Hoechst-positive nuclei. With the
identification of each nucleus position around the sphere core, migration-related parameters can be calculated.
First, a nuclei density distribution is calculated for which an algorithm determines relatively more and less dense
nuclei areas. This calculation identifies the sphere core by the densest nuclei area in the image. The algorithm
further assumes that nuclei density decreases from the sphere core to the periphery. When the nuclei density
hits a pre-defined threshold, the outer boundaries of the migration area are determined and the sphere itself is
mapped out in a polynomial bounding box. Derived from this box, the size of the migration area and the
migration distance are calculated for each well. Data from the IUF_NPC2a_radial_glia_migration_120hr
component was analyzed at the endpoint IUF_NPC2a_ radial_glia_migration_120hr in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of migration
reporter, gain or loss-of-signal activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	CV: Coefficient of Variation

AOP: Adverse Outcome Pathway	DMSO: Dimethyl Sulfoxide

-------
ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package

SSMD: Strictly Standardized Mean Difference

2. Test Method Description

2.1	Purpose: Changes in the migration distance of radial glia are indicative of radial glia migration during
neurodevelopment.

The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. The NPC2a radial glia cell migration correspond to radial glia cell migration during
corticogenesis in vivo.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human

-------
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uL pipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or

-------
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

Baseline median absolute deviation for the assay (bmad): 5.431

Response cutoff threshold used to determine hit calls: 10.863

Detection technology used: High content image analysis (Fluorescence microscopy)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Migration distance radial glia 72 hours is assessed as migration distance in um from the edge of the
sphere core and the edge of the migration area based on brightfield images of each well. Each plate is scanned
after 72 hours in culture in an automated high content imaging device and the sphere size in four directions is
measured manually using ImageJ. The endpoint-specific control for radial glia migration after 72 hours is the
SRC kinase inhibitor PP2. SRC family kinases represent one pathway that regulates radial gilia migration of
differentiating hNPC (Moors et al., 2007). Inhibition of this pathway with PP2 (10 nM) causes a reduction of
radial glia cell migration between 0 and 60 % of solvent control. All other primary endpoints are assessed based
on an immunocytochemical staining (ICC) image for each sphere. Cells are fixated after 120 hours in culture and
an ICC staining with Hoechst for nuclei, TUBB3 for neurons and 04 for oligodendrocytes is performed. The plates
are scanned using an automated high content imaging device and all nuclei and their positions are determined
automatically based on their intensity and size. Images are imported to the Omnisphero software to run the
image analysis for the following endpoints: Migration distance radial glia 120h is assessed as the migration
distance in um between the sphere core and the edge of the migration area based on ICC images of Hoechst
positives nuclei. With the identification of each nuclei positions around the sphere core, the migration distance
can be calculated. Therefore, a density distribution mask is calculated. By scanning the images of the nuclei
channel, the algorithm can determine relatively more and less dense image areas. By identifying the densest
area in the image, the sphere core can be detected. For identification of the migration, it is assumed, that the
nuclei density decreases with increasing distance to the sphere core. Once the density hits a pre-defined
threshold, the outer boundaries can be determined and the sphere itself can be mapped out in a polynomial
bounding box. Derived from this box, the actual size and migration distance can be calculated for each well.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:

NA

I arget (nominal) number of replicates:

17

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

-------
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES
sheet as value in nm. Original brightfield images are archived for 10 years. All other raw data is computed from
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (nm), mean migration distance all neurons (nm), mean migration distance
all oligodendrocytes (nm), neurite length (nm), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation. Migration
distance of radial glia at 120 hours involves no pre-processing. The data is normalized to the solvent control. For
the normalization to the solvent control, each replicate data point is normalized to the median of the solvent
control in the respective experiment.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

21: bmad2 (Add a cutoff value of 2 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.), 22: pclO (Add a cutoff
value of 10. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
65

Inactive hit count: 0
-------
Number of sample-assay endpoints with winning hill model:

19
24

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

31

38

quadratic-polynomialfpoly2) model: 35

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

4

24

52

11

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

-------
NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 24.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)

-------
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2942

IUF_NPC2b_neuronal_migration_120hr

1.	General Information

1.1	Assay Title: Leibniz Research Institute for Environmental Medicine (IUF) 120-hour Neural Progenitor Cell
Migration Assay for Neuron Migration (NPC2b)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC2b_neuronal_migration_120hr is one of 12 assay components measured from the
IUF_NPC2-5 assay. Neuronal migration distance at 120 hr is the mean distance of all neurons from the edge of
the sphere core to each individual neuron (see IUF_NPC3_neuronal_differentiation_120hr). Neuronal migration
is normalized to radial glia migration distance at 120 hr. Data from the IUF_NPC2b_neuronal_migration_120hr
component was analyzed at the endpoint IUF_NPC2b_neuronal_migration_120hr in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of migration reporter,
gain or loss-of-signal activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the migration distance of neurons are indicative of neurodevelopment.

-------
The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. The NPC2b migration of young cortical neurons on a radial glia carpet correspond
to cortical neuronal radial migration in vivo.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To

-------
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uLpipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:

NA

Target (nominal) number of replicates:

16

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 8.1
Response cutoff threshold used to determine hit calls: 30

Detection technology used: High content image analysis (Fluorescence microscopy)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Migration distance neurons at 120 hours is the mean distance of all neurons from the edge of the sphere
core in relation to the radial glia migration and is determined based on the position of each neuron. The
identification of neurons is done automatically using a convolutional neural network (CNN).

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES
sheet as value in nm. Original brightfield images are archived for 10 years. All other raw data is computed from

-------
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (um), mean migration distance all neurons (um), mean migration distance
all oligodendrocytes (um), neurite length (um), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation. Raw data
processing to summary data for the migration distance neurons at 120 hours endpoint takes the mean migration
distance of all neurons (in the migration area) divided by migration distance radial glia at 120 hours, migration
distance neurons % = mean migration distance of all neurons (um) / migration distance of radial glia (um) x 100.
For the normalization to the solvent control, each replicate data point is normalized to the median of the solvent
control in the respective experiment.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 21: bmad2 (Add a cutoff value
of 2 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using
test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one

-------
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

-------
Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 38.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.

Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on

-------
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7. Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2944

IU F_N PC2c_oligodend rocyte_migration_120h r

1.	General Information

1.1	Assay Title: Leibniz Research Institute for Environmental Medicine (IUF) 120-hour Neural Progenitor Cell
Migration Assay for Oligodendrocyte Migration (NPC2c)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC2b_oligodendrocyte_migration_120hr is one of 12 assay components measured from
the IUF_NPC2-5 assay. Oligodendrocyte migration distance at 120 hr is the mean distance of all
oligodendrocytes from the edge of the sphere core to each individual oligodendrocyte (see
IUF_NPC5_oligodendrocyte_differentiation_120hr). Oligodendrocyte migration is normalized to radial glia
migration distance at 120 hr. Data from the IUF_NPC2c_oligodendrocyte_migration_120hr component was
analyzed at the endpoint IUF_NPC2c_oligodendrocyte_migration_120hr in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of migration reporter, gain or
loss-of-signal activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the migration distance of oligodendrocytes are indicative of neurodevelopment.

-------
The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. The NPC2c oligodendrocyte migration correspond to oligodendrocyte migration
during corticogenesis in vivo.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To

-------
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uLpipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:

NA

Target (nominal) number of replicates:

15

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 7.118
Response cutoff threshold used to determine hit calls: 30

Detection technology used: High content image analysis (Fluorescence microscopy)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Migration distance oligodendrocytes at 120 hours is the mean distance of all oligodendrocytes from the
edge of the sphere core in relation to the radial glia migration and is determined based on the position of each
oligodendrocytes. The identification of oligodendrocytes is done automatically using a convolutional neural
network (CNN). Training of the CNN was done based on manually annotated experiments.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES

-------
sheet as value in urn. Original brightfield images are archived for 10 years. All other raw data is computed from
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (um), mean migration distance all neurons (um), mean migration distance
all oligodendrocytes (um), neurite length (um), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation. Raw data
processing to summary data for the migration distance oligodendrocytes at 120 hours endpoint takes the mean
migration distance of all oligodendrocytes (in the migration area) divided by the migration distance radial glia
120 h. migration distance neurons % = mean migration distance of all oligo (um) / migration distance of radial
glia (um) x 100. For the normalization to the solvent control, each replicate data point is normalized to the
median of the solvent control in the respective experiment.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 21: bmad2 (Add a cutoff value
of 2 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using
test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active

-------
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

-------
Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 26.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.

Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on

-------
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7. Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2946

IU F_N PC3_neu rona l_differentiation_120h r

1. General Information

1.1	Assay Title: Leibniz Research Institute for Environmental Medicine (IUF) 120-hour Neuronal Differentiation
Assay (NPC3)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC3_neuronal_differentiation_120hr is one of 12 assay components measured from the
IUF_NPC2-5 assay. Neuronal differentiation is determined as the number of all TUBB3 positive cells in percent
of the number of Hoechst-positive nuclei in the total neurosphere migration area (see IUF_NPC2a_
glia_migration_120hr) after 120 hr of migration/differentiation. Neurons are automatically identified using a
convolutional neural network (CNN) that was trained using manually annotated images of differentiated
neurons. Data from the IUF_NPC3_neuronal_differentiation_120hr component was analyzed at the endpoint
IUF_NPC3_neuronal_differentiation_120hr in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of differentiation reporter, gain or loss-of-signal activity
can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

2.1 Purpose: Changes in the neuronal differentiation are indicative of neurodevelopment.

-------
The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. The NPC3 fetal neuronal differentiation into young neurons correspond to cortical
neurogenesis in vivo.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To

-------
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uLpipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:

EGF

Target (nominal) number of replicates:

16

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 22.615

Response cutoff threshold used to determine hit calls: 45.23

Detection technology used: High content image analysis (Fluorescence microscopy)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Neuronal differentiation is determined as the number of all TUBB3 positive cells in percent of the
amount of Hoechst positive nuclei in the migration area after 120 hours of differentiation. The identification of
neurons is done automatically using a convolutional neural network (CNN). The endpoint specific control for
neuronal differentiation is EGF. EGF is a growth factor that stimulates radial glia proliferation and migration and
inhibits neuronal differentiation (Ayuso-Sacido et al., 2010; Baumann et al., 2015). 20 ng/mL EGF inhibit
neuronal differentiation to between 0 and 50% of control.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the

-------
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES
sheet as value in nm. Original brightfield images are archived for 10 years. All other raw data is computed from
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (nm), mean migration distance all neurons (nm), mean migration distance
all oligodendrocytes (nm), neurite length (nm), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation. Raw data
processing to summary data for the neuronal differentiation endpoint takes the number of all neurons divided
by the number of all cells (in the migration area), neuronal differentiation % = number of neurons / number of
cells x 100. For the normalization to the solvent control, each replicate data point is normalized to the median
of the solvent control in the respective experiment.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 21: bmad2 (Add a cutoff value
of 2 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using
test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.

-------
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

-------
Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 31.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2948

IU F_N PC4_neu rite_length_120h r

1.	General Information

1.1	Assay Title: Leibniz Research Institute for Environmental Medicine (IUF) 120-hour Neuronal Morphology Assay
for Neurite Length (NPC4)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC4_neurite_length_120hr is one of 12 assay components measured from the IUF_NPC2-
5 assay. The neurite length at 120 hr is the mean length in um of all neurons (see
IUF_NPC3_neuronal_differentiation_120hr) that are identified by the skeletonization algorithm in Omnisphero.
Data from the IUF_NPC4_neurite_length_120hr component was analyzed at the endpoint
IUF_NPC4_neurite_length_120hr in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of differentiation reporter, gain or loss-of-signal activity can be
used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the neurite length are indicative of neurodevelopment.

-------
The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. The NPC4 neurite length of young primary fetal neurons correspond to
axon/dendrite formation in vivo.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To

-------
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uLpipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:

NA

Target (nominal) number of replicates:

16

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 14.636
Response cutoff threshold used to determine hit calls: 30

Detection technology used: High content image analysis (Fluorescence microscopy)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Neurite length at 120 hours is the mean length in um of all neuron that are identified by the
skeletonization algorithm in Omnisphero. The identification of neurons is done automatically using a
convolutional neural network (CNN).

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES
sheet as value in um. Original brightfield images are archived for 10 years. All other raw data is computed from

-------
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (nm), mean migration distance all neurons (nm), mean migration distance
all oligodendrocytes (nm), neurite length (nm), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation. Neurite
Length assessment involves no pre-processing. The data is normalized to the solvent control. For the
normalization to the solvent control, each replicate data point is normalized to the median of the solvent control
in the respective experiment.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 21: bmad2 (Add a cutoff value
of 2 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using
test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:

-------
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

-------
(1 - ((3 * (mmad + nmad)) / abs(mmed - rimed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 26.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2950

IU F_N PC4_neu rite_a rea_120h r

1.	General Information

1.1	Assay Title: Leibniz Research Institute for Environmental Medicine (IUF) 120-hour Neuronal Morphology Assay
for Neurite Area (NPC4)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC4_neurite_area_120hr is one of 12 assay components measured from the IUF_NPC2-5
assay. The neurite area at 120 hr is the mean area in pixel (without nuclei) of all neurons (see
IUF_NPC3_neuronal_differentiation_120hr) that are identified by the skeletonization algorithm in Omnisphero.
Data from the IUF_NPC4_neurite_area_120hr component was analyzed at the endpoint
IUF_NPC4_neurite_area_120hr in the positive analysis fitting direction relative to DMSO as the negative control
and baseline of activity. Using a type of differentiation reporter, loss-of-signal activity can be used to understand
developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the neurite area are indicative of neurodevelopment.

-------
The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. The NPC4 neurite area of young primary fetal neurons correspond to axon/dendrite
formation in vivo.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To

-------
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uLpipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:

NA

Target (nominal) number of replicates:

16

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 14.444
Response cutoff threshold used to determine hit calls: 30

Detection technology used: High content image analysis (Fluorescence microscopy)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Neurite area at 120 hours is the mean area in pixel (without nuclei) of all neuron that are identified by
the skeletonization algorithm in Omnisphero. The identification of neurons is done automatically using a
convolutional neural network (CNN).

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES
sheet as value in nm. Original brightfield images are archived for 10 years. All other raw data is computed from

-------
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (nm), mean migration distance all neurons (nm), mean migration distance
all oligodendrocytes (nm), neurite length (nm), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation. Neurite
Area assessment involves no pre-processing. The data is normalized to the solvent control. For the normalization
to the solvent control, each replicate data point is normalized to the median of the solvent control in the
respective experiment.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 21: bmad2 (Add a cutoff value
of 2 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using
test compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model
hitcall (hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where
only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one

-------
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 31.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2952

IU F_N PC5_oligodend rocyte_differentiation_120h r

1.	General Information

1.1	Assay Title: Leibniz Research Institute for Environmental Medicine (IUF) 120-hour Oligodendrocyte
Differentiation Assay (NPC5)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC5_oligodendrocyte_differentiation_120hr is one of 12 assay components measured
from the IUF_NPC2-5 assay. Oligodendrocyte differentiation at 120 hr is determined as the number of all 04-
positive cells in percent of the number of Hoechst-positive nuclei in the total neurosphere migration area (see
IUF_NPC2a_ glia_migration_120hr) after 120 hr of migration/differentiation. Oligodendrocytes are
automatically identified using a convolutional neural network (CNN) that was trained using manually annotated
images of differentiated oligodendrocytes. Data from the IUF_NPC5_oligodendrocyte_differentiation_120hr
component was analyzed at the endpoint IUF_NPC5_oligodendrocyte_differentiation_120hr in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of
differentiation reporter, loss-of-signal activity can be used to understand developmental effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the oligodendrocyte differentiation are indicative of neurodevelopment.

-------
The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. The NPC5 oligodendrocyte formation from fetal NPC correspond to
oligodendrogenesis during the fetal phase of brain development in vivo.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To

-------
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uLpipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

ASSAY DESIGN SUMMARY

-------
Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:

BMP7

Target (nominal) number of replicates:

15

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 28.259

Response cutoff threshold used to determine hit calls: 56.518

Detection technology used: High content image analysis (Fluorescence microscopy)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Oligodendrocyte differentiation at 120 hours is determined as number of all 04 positive cells in percent
of the amount of all Hoechst positive nuclei in the migration area after 120 hours of differentiation. The
identification of oligodendrocytes is done automatically using a convolutional neural network (CNN). Training
of the CNN was done based on manually annotated experiments. The endpoint specific control for
oligodendrocyte differentiation is BMP7. BMP7 promotes the BMP signaling cascade which upregulates
astroglia differentiation and maturation and inhibits oligodendrocyte formation (Baumann et al., 2015; Gross et
al., 1996; Mabie et al., 1997). 100 ng/mL BMP7 inhibit oligodendrocyte differentiation to between 0 and 60 %
of SC.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

-------
3.2 Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES
sheet as value in nm. Original brightfield images are archived for 10 years. All other raw data is computed from
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (nm), mean migration distance all neurons (nm), mean migration distance
all oligodendrocytes (nm), neurite length (nm), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation. Raw data
processing to summary data for the oligodendrocyte differentiation endpoint takes the number of all
oligodendrocyte is divided by the number of all cells (in the migration area), oligodendrocyte differentiation %
= number of oligodendrocytes / all cells x 100. For the normalization to the solvent control, each replicate data
point is normalized to the median of the solvent control in the respective experiment.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

21: bmad2 (Add a cutoff value of 2 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.), 27:
ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis
direction. Typically used for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning

-------
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
52

Inactive hit count: 0
-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

NA

NA

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

NA

-------
Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 36.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2954

IU F_N PC2-5_cytotoxicity_72 h r

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment at 72 hours in the Leibniz Research Institute for Environmental Medicine
(IUF) Neural Progenitor Cell Migration and Differentiation Assay for Cytotoxicity (NPC2-5)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC2-5_cytotoxicity_72hr is one of 12 assay components measured from the IUF_NPC2-5
assay. It is designed to measure the cytotoxicity at 72 hr as assessed by membrane integrity related to the LDH
dependent reduction of resazurin to resorufin in the supernatant of each well. The cytotoxicity is measured as
fluorescence signal (relative fluorescence unit) in a multiplate reader. Data from the IUF_NPC2-
5_cytotoxicity_72hr component was analyzed at the endpoint IUF_NPC2-5_cytotoxicity_72hr in the positive
analysis fitting direction relative to the dynamic range and baseline of activity. Using a type of viability reporter,
gain-of-signal activity can be used to understand cytotoxicity effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in enzymatic activity (increase in lactate dehydrogenase) are indicative of compromised
cell health. Reductions in the total LDH (in cells), indicates cell loss or death.

-------
The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. Differentiating hNPC are a multicellular system consisting of radial glia, neurons
(1.5-16 %), oligodendrocytes (1.5-11%) and astrocytes in the migration area. The measurement of cytotoxicity
and viability therefore always represents all cells within the migration area. Because of the higher percentage
of GFAP positive radial glia and astrocytes, these two cell types are overrepresented in the assessment of
cytotoxicity and viability. The measure of cell viability strongly depends on the number of cells in the migration
area. Therefore, a reduction in cell number either as lower migration distance or as less dense migration area
will lead to a reduction in cell viability (Figure 3 in Nimtz, Klose, Masjosthusmann, Barenys, & Fritsche, 2019).
The cytotoxicity assay and the number of nuclei in the migration area can be analyzed to distinguish an effect
specifically on cell viability from a reduction in cell viability as secondary effect.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does

-------
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uL pipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27

-------
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

Baseline median absolute deviation for the assay (bmad): 1.887
Response cutoff threshold used to determine hit calls: 10
Detection technology used: membrane integrity assay (Fluorescence)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Cytotoxicity at 72/120 hours is assessed as membrane integrity by measuring the LDH dependent
reduction of resazurin to resorufin in the supernatant of each well, as fluorescence signal (relative fluorescence
unit) in a multi plate reader after 72/120 h of differentiation and compound treatment. 0.2 % Triton X-100 is
used as positive control for cell viability and cytotoxicity as it causes cell lysis and therefor a maximal response
for both endpoints. This positive control is run on each experimental plate.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:
lysed cells

Target (nominal) number of replicates:

17

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES
sheet as value in nm. Original brightfield images are archived for 10 years. All other raw data is computed from
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (nm), mean migration distance all neurons (nm), mean migration distance
all oligodendrocytes (nm), neurite length (nm), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation.
Cytotoxicity assessment involves no pre-processing. The following normalization is used to normalize each
response value is normalized using the median lysis control and median solvent control. Normalized response =
lysis control - response / lysis control - solvent control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

21: bmad2 (Add a cutoff value of 2 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.), 22: pclO (Add a cutoff
value of 10. Typically for percent of control data.), 27: ow_bidirectional_loss (Multiply winning model
hitcall (hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where
only negative responses are biologically relevant.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
21

Inactive hit count: 0
-------
exponentials (exp5) model:

18

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

100

0

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	Inf

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 18.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2956

IU F_N PC2-5_cytotoxicity_120h r

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment at 120 hours in the Leibniz Research Institute for Environmental Medicine
(IUF) Neural Progenitor Cell Migration and Differentiation Assay (NPC2-5)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC2-5_cytotoxicity_120hr is one of 12 assay components measured from the IUF_NPC2-5
assay. It is designed to measure cytotoxicity at 120hr due to loss of membrane integrity. LDH measurements are
performed from medium supernatants of each well and are based on the reduction of resazurin to resorufin
measured as fluorescence signal (relative fluorescence unit) in a multiplate reader. Data from the IUF_NPC2-
5_cytotoxicity_120hr component was analyzed at the endpoint IUF_NPC2-5_cytotoxicity_120hr in the positive
analysis fitting direction relative to the dynamic range and baseline of activity. Using a type of viability reporter,
gain-of-signal activity can be used to understand cytotoxicity effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in enzymatic activity (increase in lactate dehydrogenase) are indicative of compromised
cell health. Reductions in the total LDH (in cells), indicates cell loss or death.

-------
The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. Differentiating hNPC are a multicellular system consisting of radial glia, neurons
(1.5-16 %), oligodendrocytes (1.5-11%) and astrocytes in the migration area. The measurement of cytotoxicity
and viability therefore always represents all cells within the migration area. Because of the higher percentage
of GFAP positive radial glia and astrocytes, these two cell types are overrepresented in the assessment of
cytotoxicity and viability. The measure of cell viability strongly depends on the number of cells in the migration
area. Therefore, a reduction in cell number either as lower migration distance or as less dense migration area
will lead to a reduction in cell viability (Figure 3 in Nimtz, Klose, Masjosthusmann, Barenys, & Fritsche, 2019).
The cytotoxicity assay and the number of nuclei in the migration area can be analyzed to distinguish an effect
specifically on cell viability from a reduction in cell viability as secondary effect.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does

-------
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uL pipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27

-------
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

Baseline median absolute deviation for the assay (bmad): 2.721
Response cutoff threshold used to determine hit calls: 10
Detection technology used: membrane integrity assay (Fluorescence)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Cytotoxicity at 72/120 hours is assessed as membrane integrity by measuring the LDH dependent
reduction of resazurin to resorufin in the supernatant of each well, as fluorescence signal (relative fluorescence
unit) in a multi plate reader after 72/120 h of differentiation and compound treatment. 0.2 % Triton X-100 is
used as positive control for cell viability and cytotoxicity as it causes cell lysis and therefor a maximal response
for both endpoints. This positive control is run on each experimental plate.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:
lysed cells

Target (nominal) number of replicates:

17

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES
sheet as value in nm. Original brightfield images are archived for 10 years. All other raw data is computed from
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (nm), mean migration distance all neurons (nm), mean migration distance
all oligodendrocytes (nm), neurite length (nm), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation.
Cytotoxicity assessment involves no pre-processing. The following normalization is used to normalize each
response value is normalized using the median lysis control and median solvent control. Normalized response =
lysis control - response / lysis control - solvent control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

21: bmad2 (Add a cutoff value of 2 multiplied by the baseline median absolute deviation (bmad). By
default, bmad is calculated using test compound wells (wilt = t) for the endpoint.), 22: pclO (Add a cutoff
value of 10. Typically for percent of control data.), 27: ow_bidirectional_loss (Multiply winning model
hitcall (hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where
only negative responses are biologically relevant.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 237	Number of chemicals tested: 222

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
39

Inactive hit count: 0
-------
exponentials (exp5) model:

16

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

100

0

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	Inf

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 16.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2958

IUF_NPC2-5_cell_number_120hr

1.	General Information

1.1	Assay Title: Cell Number Assessment at 120 hours in the Leibniz Research Institute for Environmental Medicine
(IUF) Neural Progenitor Cell Migration and Differentiation Assay (NPC2-5)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC2-5_cell number_120hr is one of 12 assay components measured from the IUF_NPC2-
5 assay. The cell number at 120 hr is assessed as the number of all Hoechst-positive nuclei identified in the
migration area (see IUF_NPC2a_ glia_migration_120hr). Data from the IUF_NPC2-5_cell_number_120hr
component was analyzed at the endpoint IUF_NPC2-5_cell_number_120hr in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter,
gain or loss-of-signal activity can be used to understand viability effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the number of Hoechst labelled nuclei are indicative of cell proliferation.

The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human

-------
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. Differentiating hNPC are a multicellular system consisting of radial glia, neurons
(1.5-16 %), oligodendrocytes (1.5-11%) and astrocytes in the migration area. The measurement of cytotoxicity
and viability therefore always represents all cells within the migration area. Because of the higher percentage
of GFAP positive radial glia and astrocytes, these two cell types are overrepresented in the assessment of
cytotoxicity and viability. The measure of cell viability strongly depends on the number of cells in the migration
area. Therefore, a reduction in cell number either as lower migration distance or as less dense migration area
will lead to a reduction in cell viability (Figure 3 in Nimtz, Klose, Masjosthusmann, Barenys, & Fritsche, 2019).
The cytotoxicity assay and the number of nuclei in the migration area can be analyzed to distinguish an effect
specifically on cell viability from a reduction in cell viability as secondary effect.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus

-------
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uL pipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest

-------
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:

EGF

Target (nominal) number of replicates:

17

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 11.804

Response cutoff threshold used to determine hit calls: 30

Detection technology used: membrane integrity assay (Fluorescence microscopy)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Migration distance radial glia 72 hours is assessed as migration distance in um from the edge of the
sphere core and the edge of the migration area based on brightfield images of each well. Each plate is scanned
after 72 hours in culture in an automated high content imaging device and the sphere size in four directions is
measured manually using ImageJ. All other primary endpoints are assessed based on an immunocytochemical
staining (ICC) image for each sphere. Cells are fixated after 120 hours in culture and an ICC staining with Hoechst
for nuclei, TUBB3 for neurons and 04 for oligodendrocytes is performed. The plates are scanned using an
automated high content imaging device and all nuclei and their positions are determined automatically based
on their intensity and size. Images are imported to the Omnisphero software to run the image analysis for the
following endpoints: Migration distance radial glia 120h is assessed as the migration distance in um between
the sphere core and the edge of the migration area based on ICC images of Hoechst positives nuclei. With the
identification of each nuclei positions around the sphere core, the migration distance can be calculated.
Therefore, a density distribution mask is calculated. By scanning the images of the nuclei channel, the algorithm
can determine relatively more and less dense image areas. By identifying the densest area in the image, the
sphere core can be detected. For identification of the migration, it is assumed, that the nuclei density decreases
with increasing distance to the sphere core. Once the density hits a pre-defined threshold, the outer boundaries
can be determined and the sphere itself can be mapped out in a polynomial bounding box. Derived from this
box, the actual size and migration distance can be calculated for each well. Cell number is the number of all
Hoechst positive nuclei detected in the area between the sphere core and the outer boundaries of the migration
area. 0.2 % Triton X-100 is used as positive control for cell viability and cytotoxicity as it causes cell lysis and
therefor a maximal response for both endpoints. This positive control is run on each experimental plate.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in

-------
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES
sheet as value in nm. Original brightfield images are archived for 10 years. All other raw data is computed from
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (nm), mean migration distance all neurons (nm), mean migration distance
all oligodendrocytes (nm), neurite length (nm), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation. Cell
number assessment involves no pre-processing. The data is normalized to the solvent control. For the
normalization to the solvent control, each replicate data point is normalized to the median of the solvent control
in the respective experiment.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 21: bmad2 (Add a cutoff value
of 2 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using
test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
44

194

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

21
16

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

28

50

quadratic-polynomialfpoly2) model: 33

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

6

2

54

28

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 28.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2960

IU F-N PC2-5_viability_120hr

1.	General Information

1.1	Assay Title: Viability Assessment at 120 hours in the Leibniz Research Institute for Environmental Medicine (IUF)
Neural Progenitor Cell Migration and Differentiation Assay (NPC2-5)

1.2	Assay Summary: IUF_NPC2-5 is a cell-based, multiplexed assay that uses hNPC, a human primary neural
progenitor cells, with measurements taken at 72 or 120 hours after chemical dosing in a 96-well plate. Human
neural progenitor cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. In
the neural progenitor cell migration and differentiation assay (NPC2-5), hNPCa€™s grown as spheres are plated
on an extracellular matrix and migrate and differentiate out of the sphere core. The processes radial glia
migration, neuronal and oligodendrocyte migration as well as neuronal differentiation, neuronal morphology
and oligodendrocyte differentiation are studied using automated phase contrast imaging and automated
fluorescence imaging in combination with high content image analysis. In parallel, the cell viability is assessed
using an alamar blue viability assay and the cytotoxicity using a lactate dehydrogenase dependent membrane
integrity assay. IUF_NPC2-5_viability_120hr is one of 12 assay components measured from the IUF_NPC2-5
assay. It is designed to measure cell viability assessed as mitochondria-dependent reduction of resazurin to
resorufin using the alamar blue viability assay. The viability is measured as a fluorescence signal (relative
fluorescence unit) in a multiplate reader. Data from the IUF_NPC2-5_viability_120hr component was analyzed
at the endpoint IUF-NPC2-5_viability_120hr in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, gain or loss-of-signal activity can be
used to understand viability effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in enzymatic activity (mitochondrial activity) are correlated to cellular viability.

-------
The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPC are isolated from the fetal brain and represent the process cell migration
and differentiation into neurons, oligodendrocytes within this test method. The test method predicts the hazard
to induce developmental neurotoxicity in the form of neurophysiological and functional changes in the
developing nervous system. Differentiating hNPC are a multicellular system consisting of radial glia, neurons
(1.5-16 %), oligodendrocytes (1.5-11%) and astrocytes in the migration area. The measurement of cytotoxicity
and viability therefore always represents all cells within the migration area. Because of the higher percentage
of GFAP positive radial glia and astrocytes, these two cell types are overrepresented in the assessment of
cytotoxicity and viability. The measure of cell viability strongly depends on the number of cells in the migration
area. Therefore, a reduction in cell number either as lower migration distance or as less dense migration area
will lead to a reduction in cell viability (Figure 3 in Nimtz, Klose, Masjosthusmann, Barenys, & Fritsche, 2019).
The cytotoxicity assay and the number of nuclei in the migration area can be analyzed to distinguish an effect
specifically on cell viability from a reduction in cell viability as secondary effect.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does

-------
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uL pipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: Culture protocol: After the cell expansion period the cells are cultured for up to four weeks
in which they are passaged every week as described in 3.6. Between one to three days after passaging spheres
at a size of 0.3 mm are picked for the test method. For the assessment of neural progenitor cell migration and
differentiation, the spheres are plated on poly-D-lysine/laminin coated 96-well flat bottom plates in
differentiation medium (N2) to initiate migration and differentiation. Thereby one 0.3 mm big sphere is plated
in the middle of a well. The differentiation medium consists of DMEM and Ham's F12 at a ratio of 3 to 1
supplemented with 1% N2 and 1% penicillin and streptomycin. Within 5 days NPCs radially migrate out of the
sphere core and differentiate into radial glia cells (nestin positive), neurons (TUBB3 positive), oligodendrocytes
(04 positive) and astrocytes (GFAP positive). Cultivation during the test method is performed at 37C and 5%
C02 at a ph of 7.2-7.6. Exposure: 0.3 mm big hNPC are plated in the already prepared test conditions. Exposure
starts at day 0 of differentiation and is continued over five days of differentiation until the experiment is
terminated. Cells are fed with fresh medium at day 3 of differentiation. Therefore, half of the test condition
solution (e.g. solvent control or compound dilution) is replaced by freshly prepared test condition solution. Test
Method: Migration distance and cytotoxicity is determined after 72 hours based on brightfield images of each
well. The assay is terminated by the assessment of cell viability and cytotoxicity as well as cell fixation after 120
h. Immunocytochemistry is performed for Hoechst positive nuclei, TUBB3 positive neurons and 04 positive
oligodendrocytes. On the ICC images, migration after 120 hours, neuronal and oligodendrocyte number,
neuronal morphology and neuron/oligodendrocyte specific migration is assessed, immunocytochemical staining
(ICC) generates an image for each sphere. The plates are scanned using an automated high content imaging
device and all nuclei and their positions are determined automatically based on their intensity and size. Images
are imported to the Omnisphero software to run the image analysis that measures the following endpoints.
The method is set up for 8 test conditions including 7 compound concentrations and one SC. The test conditions
are prepared in a serial dilution from the stock solution. Stock solutions are prepared by diluting the compound
in the solvent (e.g. DMSO) in a concentration that allows the preparation of the highest test concentration
without exceeding the highest acceptable solvent concentration. For DMSO the highest acceptable solvent
concentration is 0.1% which means that the stock concentration needs to be at least lOOOx higher than the
highest test concentration. Stock solutions in non-sterile solvents (e.g. water or PBS) have to be sterile filtrated
using a sterile syringe filter (0.2 um). Stock solutions are aliquoted and stored at -20C. A stock solution is not
thawed more than three times. For the preparation of the test condition the stock solution is diluted to the
highest test concentration (default 1:1000) in differentiation media. All following dilutions are prepared by serial
dilution of the highest concentration in differentiation media with solvent (in the concentration of the highest
test concentration). The default serial dilution is 1:3 which covers a concentration range from e.g. 20 uM to 27

-------
nM (729 fold). Depending on the desired concentration range the dilution can be changed to 1:2, 1:5, 1:10 or
other. The SC is prepared by adding the solvent to differentiation media in the same concentration as the highest
test concentration. 100 uLof the compound dilutions and the SC are added to a 96 well plate. The serial dilution
can also be prepared directly in the 96 well plates. hNPC are added to each well after a 15 to 30 min equilibration
period at 37C and 5 % C02.

Baseline median absolute deviation for the assay (bmad): 7.869
Response cutoff threshold used to determine hit calls: 30
Detection technology used: Alamar blue viability assay (Fluorescence)

2.6	Response: All endpoints are generated from the same experimental run and from each well/sphere in the 96
well plate. Primary DNT specific endpoints of the test method are: migration distance radial glia at 72 hours
(NPC2a), migration distance radial glia at 120 hours (NPC2a), migration distance neurons at 120 hours (NPC2b),
migration distance oligodendrocytes at 120 hours (NPC2c), neuronal differentiation at 120 hours (NPC3),
neurite length at 120 hours (NPC4), neurite area at 120 hours (NPC4), and oligodendrocyte differentiation 120
hours (NPC5). Secondary endpoints are: cell number at 120 hours (which is used for normalization of neuronal
and oligodendrocyte differentiation), cytotoxicity at 72 hours, cytotoxicity at 120 hours, and viability at 120
hours. Viability at 120 hours is assessed as mitochondrial activity by measuring the amount of resazurin reduced
to resorufin as fluorescence signal (relative fluorescence unit) in a multi plate reader in the last two hours of a
120 h differentiation and compound treatment period. 0.2 % Triton X-100 is used as positive control for cell
viability and cytotoxicity as it causes cell lysis and therefor a maximal response for both endpoints. This positive
control is run on each experimental plate.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.143263374485596 nM
Key positive control:
lysed cells

Target (nominal) number of replicates:

17

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multiplate reader (viability and cytotoxicity) the raw data format are excel files containing values (one for each
endpoint, timepoint and well) measured as relative fluorescence units. These values are transferred from the
original excel file into the AXES sheet. The original excel output files is saved for traceability of the data. The
migration distance of radial glia after 72 h which is measured manual in ImageJ is directly copied into the AXES
sheet as value in nm. Original brightfield images are archived for 10 years. All other raw data is computed from
the ICC images in the Omnisphero software and is exported and saved as one csv file. From there the values are
again transferred to the AXES sheets. The following data is exported from Omnisphero: number of all cells in
migration area, number of neurons all neurons in migration area, number of all oligodendrocytes in migration
area, migration distance radial glia (nm), mean migration distance all neurons (nm), mean migration distance
all oligodendrocytes (nm), neurite length (nm), and neurite area (pixel). All original ICC images are archived for
10 years. If not otherwise stated, all data processing steps are performed in a R based evaluation tool that was
designed for data processing, curve fitting and point of departure evaluation of in vitro concentration response
toxicity data. Data processing describes all processing steps of raw data that are necessary to obtain the final
response values including the normalization, curve fitting and benchmark concentration calculation. Raw data
processing to summary data for viability endpoint involves subtraction of mean background from each response
value. Background corrected response (RFU) = raw response (RFU) - Background (RFU). The data is normalized
to the solvent control. For the normalization to the solvent control, each replicate data point is normalized to
the median of the solvent control in the respective experiment.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

19: pc30 (Add a cutoff value of 30. Typically for percent of control data.), 21: bmad2 (Add a cutoff value
of 2 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using
test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

-------
5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 238	Number of chemicals tested: 223

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
29

Inaclive hit count: 0
-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

NA%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

66.847

0

Z Prime Factor for median positive and neutral control across all plates:

NA

-------
(1 - ((3 * (pmad + nmad)) / abs(pmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	Inf

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 34.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3087

IU F_N PCl_cytotoxicity_72 h r

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment at 72 hours in the Leibniz Research Institute for Environmental Medicine
(IUF) Neural Progenitor Cell Proliferation Assay (NPC1)

1.2	Assay Summary: IUF_NPC1 is a cell-based, multiplexed assay that uses hNPC, a human primary neural progenitor
cells, with measurements taken at 72 hours after chemical dosing in a 96-well plate. Human neural progenitor
cells (hNPC) are generated from human fetal brain cortex at gestational week (GW) 16-19. Grown in suspension
culture and under proliferative conditions (proliferation media and growth factors), these cells represent neural
progenitor cell proliferation. In the NPC1 assay, hNPC's are exposed to the test compound for 72h. Cell
proliferation is assessed as an increase in sphere area using automated phase contrast imaging and as
incorporation of Bromodeoxyuridine (Brdll) during DNA synthesis using a luminescence-based cell proliferation
ELISA. In parallel, the cell viability is assessed using an alamar blue viability assay and the cytotoxicity using a
lactate dehydrogenase dependent membrane integrity assay. IUF_NPCl_cytotoxicity_72hr is one of 4 assay
components measured from the IUF_NPC1 assay. It is designed to measure cytotoxicity at 72 hrs due to loss of
membrane integrity. LDH measurements are performed from medium supernatants of each well and are based
on the reduction of resazurin to resorufin measured as fluorescence signal (relative fluorescence unit) in a
multiplate reader. Data from the IUF_NPCl_cytotoxicity_72hr component was analyzed at the endpoint
IUF_NPCl_cytotoxicity_72hr in the positive analysis fitting direction relative to the dynamic range and baseline
of activity. Using a type of viability reporter, gain-of-signal activity can be used to understand cytotoxicity
effects.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Leibniz Research Institute for Environmental Medicine (IUF) is a German research institution in
Diisseldorf. The IUF research mission is the molecular prevention of environmentally induced health disorders.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: For the source cells, Lonza holds donor consent and legal authorization that provides
permission for all research use.

1.9	Assay Throughput: 96-well plate. The methods described are set up in a 96 well plate format with automated
image acquisition and analysis and data evaluation. Pipetting steps such as coating of 96 well plates, compound
dilutions, feeding, cell viability and cytotoxicity assay can be automated using a liquid handling system. In the
fully automated set up 10 plates with 8 conditions and 5 replicates per condition can be run in one week. This
results in the generation of 400 data points for each endpoint within one week (excluding all controls). The
throughput is therefore estimated as medium to high.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in enzymatic activity (increase in lactate dehydrogenase) are indicative of compromised
cell health. Reductions in the total LDH (in cells), indicates cell loss or death.

-------
The human developing central nervous system is more vulnerable to the adverse effects of chemical agents than
the adult brain. At present, due to the knowledge gap concerning hazard identification for human
neurodevelopmental toxicity (DNT), there is an urgent need fortestingand subsequently regulation of chemicals
for their potential to interfere with the developing nervous system. Primary human neural progenitor cells
(hNPC) cultivated as three-dimensional floating spheres are able to represent several key processes during brain
development. In the NPC1 assay, hNPC are exposed to the test compound for 72 hours. Cell proliferation is
assessed as an increase in sphere area using automated phase contrast imaging and as incorporation of
Bromodeoxyuridine (Brdll) using a luminescence-based cell proliferation ELISA. In parallel, the cell viability is
assessed using an alamar blue viability assay. In the neural progenitor cell migration and differentiation assay
(NPC2-5), hNPC are plated on an extracellular matrix, and migrate and differentiate out of the sphere core.
Thereby the processes radial glia migration, migration of neurons and oligodendrocytes as well as differentiation
into neurons and oligodendrocytes can be studied in combination with general cell viability and cytotoxicity.
Cell migration and differentiation are critical process during brain development that, if disturbed lead to
alterations in brain development and causes cognitive dysfunction. Currently these processes are assessed in
the OECD TG426 by neuropathological evaluation of certain brain regions as well as neurobehavioral tests.

2.2	Scientific Principles: Primary hNPCs are isolated from the fetal brain cortices and can be used to measure
proliferation, a process of brain growth during the fetal phase of prenatal development. The test system
therefore measures adverse events in the young (fetal) developing brain. Different types of NPC exist in the
developing brain. Besides ventricular zone NPC, radial glia cells serve as cortical progenitor cells responsible for
cortical expansion and folding. As whole cortices were used for cell preparation, this is not a specific NPC type
but rather a mix of NPCs found in fetal human cortex during development. The toxicological events that are
modeled concern events that influence proliferation of NPCs found in human cortex during the fetal phase.

2.3	Experimental System: suspension hNPC primary cell used. Primary human neural progenitor cells (hNPC) are
provided as cryopreserved 3D neurospheres from Lonza, Verviers, Belgium. Material originates from the human
brain cortex of different gestational ages (GW16-19). Sex is either specified or determined before the cells are
used. Grown in suspension culture and under proliferative conditions (proliferation media and growth factors)
these cells represent neural progenitor cell proliferation. 1x106 hNPC are obtained from Lonza (#PT-2599) and
expanded. Lonza provides the cells with a viability of at least 20% FACS analysis indicates proliferating
neurospheres positive for nestin and Ki67. Following differentiation on a poly D-lysin/laminin matrix in the
absence of growth factors, the cells test positive for TUBB3, GFAP, Nestin and 04 (Baumann et al., 2015;
Schmuck et al., 2017). Within the first three days after thawing, 100 uL of spheres (at least 20) are plated on
poly Dlysin/laminin matrix in an 8-chamber slide with 500 uL N2 Media. On day one after plating greater than
80% of spheres need to be differentiated for the cell to be used in experiments. Differentiation towards the
final test system: Cells are frozen in liquid nitrogen and after thawing have to be cultivated in proliferation
medium at 37C and 5 % C02. The medium contains Dulbecco's modified Eagle medium and Hams F12 (3:1)
supplemented with 2% B27, 20 ng/mL EGF, 20 ng/mL recombinant human FGF, 1% penicillin and streptomycin.
The thawing is performed by repeated addition and removal of proliferation medium until all cells are
transferred into suspension in a tissue culture flask. The cells are carefully resuspended and distributed to 10
cm petri dishes filled with fresh, prewarmed proliferation medium. The cells are fed by replacing half the
medium with new medium every two to three days. At each feeding day the culture is checked for impurities
which are removed into a new petri dish. From this dish mistakenly, sorted spheres can be rescued and placed
back in the original culture dish. After 3-4 weeks neurospheres reach the acceptable size of 0.2 - 0.5 mm for
passaging by mechanical dissociation. Thereby, neurospheres are cut into small pieces (0.15 - 0.25 mm;
depending on the desired sphere size), which round up again to uniform sized neurospheres within 1 day in
proliferation medium. By using this method neurospheres are expanded every week. Starting at week 2 poly
hema coated dishes are used for the cultivation procedure. Critical consumable: the cultivation medium does
not contain serum or serum replacement. The use of epidermal growth factor (EGF) and recombinant human
fibroblast growth factor (FGF) is critical for sphere growth. FGF contains 1% bovine serum albumin and is thus
prone to batch effects. Critical handling: The thawing media contains DMSO in a concentration that affects cell
health which is why thawed cells should quickly be diluted in proliferation media (30 mL of media for one vial
of cells). It is recommended to add FGF into medium directly before thawing. At the end of week two of the

-------
expansion period (see below), the spheres should be transferred to petri dishes coated with poly hema to
prevent cell attachment, attached cells that are not differentiated can be detached using a 1000 uL pipet. To
avoid repeated attachment, all cells should be transferred to a new poly hema coated petri dish. Medium that
contains FGF should not be stored longer than 1 week at 4C. During the first two weeks, medium should be
removed using a 1000 uL pipet to keep the accidental removal of small spheres to a minimum. The neurospheres
should be well distributed in the petri dish to circumvent aggregation, which is especially important after
mechanical dissociations.

2.4	Metabolic Competence: Primary hNPC under proliferating and differentiating conditions do not express CYP1A1
and CYP1B1 (Gassmann et al., 2010). Primary hNPC during differentiation, have the capacity to up-regulate
glutathione-dependent protective strategies upon reactive oxygen species (ROS) exposure Masjosthusmann et
al, 2019). Gene expression levels of genes involved in the antioxidative defense (glutathione peroxidase 1
(GPX1), superoxide dismutase 1 (SOD1), catalase (CAT)) were comparable between the in vitro system and
developing human brains in vivo and show similar expression levels (Masjosthusmann et al., 2019). Other
metabolic pathways are not characterized.

2.5	Exposure Regime: After the cell expansion period, the cells are cultured for up to four weeks in which they are
passaged every week. Between one to three days after passaging, depending on the size chosen for passaging,
spheres at a size of 0.3 mm are used in the assay. For the assessment of neural progenitor cell proliferation, the
spheres are plated in poly-Hema coated 96-well U-bottom plates filled with proliferation medium containing
growth factors (EGF and FGF). One 0.25 - 0.35 mm big sphere is plated in the middle of each well. Within 3 days
NPCs proliferate and grow in size. Cultivation during the test method is performed at 37C and 5% C02 at a pH
of 7.2-7.6. As a positive control, spheres are cultivated in the absence of growth factors (EGF and FGF), which
dramatically reduces proliferation. Exposure starts on the plating day (day 0) and is continued over three days,
without chemical renewal, until the experiment is terminated. The assay is terminated by the assessment of cell
viability, cytotoxicity, and proliferation by Brdll. All endpoints are generated from the same experimental run
and from each well/sphere in the 96-well plate.

Baseline median absolute deviation for the assay (bmad): 0.238
Response cutoff threshold used to determine hit calls: 10
Detection technology used: membrane integrity assay (Fluorescence)

2.6 Response: This assay uses primary human neural progenitor cells (hNPCs) from human cortex (gestation week
16 - 19) to measure changes in hNPC proliferation. Biological responses that are measured include fetal hNPC
proliferation, viability and cytotoxicity as quantified by sphere size, DNA synthesis as chemiluminescence
measurement, viability and cytotoxicity as fluorescence intensity. Primary endpoints: 1) Proliferation by area
(72hours; NPCla) is assessed as the slope of the increase in sphere size (amount of pixels in the bright-field
image, sphere area) over 72 hours measured by brightfield microscopy using high content imaging at 0 hours,
24 hours, 48 hours, and 72 hours. 2) Proliferation by Brdll (72hours; NPClb) is assessed as Brdll incorporation
(as an indirect measure of DNA synthesis) over the last 16 hours of compound exposure. It is measured as a
luminescence signal (relative luminescence unit) in a multi-plate reader after 72 hours. Secondary endpoints: 1)
Cytotoxicity at 72 hours is assessed as membrane integrity by measuring the amount of LDH leaked from cells
with damaged plasma membranes. LDH-dependent reduction of resazurin to resorufin is measured in the
supernatant of each well as fluorescence of the reaction product resorufin (relative fluorescence unit) in a multi-
plate reader after 72 hours of compound exposure. 2) Viability at 72 hours is assessed as mitochondrial activity

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
7

Standard minimum concentration tested:

0.445832647462276 nM
Key positive control:
lysed cells

Target (nominal) number of replicates:

14

Standard maximum concentration tested:

325.011999999998 nM
Neutral vehicle control:

DMSO

-------
by measuring the amount of resazurin reduced to fluorescent resorufin (relative fluorescence unit) in a multi-
plate reader in the last two hours of the 72 hours proliferation and compound exposure period.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: The raw data format is different depending on the endpoints. For all endpoints assessed in a
multi-plate reader (viability, cytotoxicity, Brdll incorporation) the raw data formats are excel files containing
values (one for each endpoint, timepoint and well) measured as relative fluorescence/luminescence units. These
values are transferred from the original excel file into the AXES sheet. The original excel output file is saved for
traceability of the data. The sphere size is automatically measured in the Cellomics scan software (Version 6.6.0;
Thermo Scientific) and copied into the AXES sheet. Original brightfield images are archived for 10 years. If not
otherwise stated, all data processing steps are performed in an R based evaluation tool that was designed for
data processing, curve fitting and point of departure evaluation of in vitro concentration response toxicity data.
Data processing describes all processing steps of raw data that are necessary to obtain the final response values
including the normalization, curve fitting and benchmark concentration calculation. The cytotoxicity endpoint
has no pre-processing. The data is normalized to the solvent control by plate. For the normalization to the
solvent control, each replicate data point is normalized to the median of the solvent control in the respective
experiment. Mathematical procedures to define outliers are not applied. Data points from wells where
technical problems are known or obvious are excluded from the analysis. Possible technical problems include:
pipetting errors, spillover from lysis, or problems in fixation of singularized cells. All wells with technical
problems are marked in the AXES sheet.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must

-------
occur through the database and should come from the available list of methods for each processing level.

Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 13: pval.apid.pwlls.med
(Calculate the positive control value (pval) as the plate-wise median, by assay plate ID (apid), of the
corrected values (cval) for single-concentration gain-of-signal positive control wells (wilt = p).), 17:
bval.apid.nwllslowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate
ID (apid), of the corrected values (cval) of test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2 or neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

22: pclO (Add a cutoff value of 10. Typically for percent of control data.), 27: ow_bidirectional_loss
(Multiply winning model hitcall (hitc) by -1 for models fit in the positive analysis direction. Typically used
for endpoints where only negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series

-------
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 230

Active hit count: hitc>0.9
12

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1160.25

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	2022.5

Positive control well median absolute deviation, by plate: pmad	0

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	Inf

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 42.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Baumann J, Gassmann K, Masjosthusmann S, DeBoer D, Bendt F, Giersiefer S, Fritsche E.
Comparative human and rat neurospheres reveal species differences in chemical effects on
neurodevelopmental key events. Arch Toxicol. 2016 Jun;90(6):1415-27. doi: 10.1007/s00204-015-1568-8. Epub
2015 Jul 28. PMID: 26216354.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 739

OT_AR_ARELUC_AG_1440

1.	General Information

1.1	Assay Title: Odyssey Thera 24-hour CHO-K1 Luciferase Assay for Agonist Androgen Response Element (ARE)

1.2	Assay Summary: OT_AR_ARELUC_AG_1440 is a cell-based, single-readout assay that uses CHO-K1, a Chinese
hamster ovary cell line, with measurements taken at 24 hours after chemical dosing in a 384-well plate.
OT_AR_ARELUC_AG_1440 is one of one assay component(s) measured or calculated from the
OT_AR_ARELUC_AG_1440 assay. It is designed to make measurements of luciferase induction, a form of
inducible reporter, as detected with bioluminescence signals by Luciferase technology. Data from the assay
component OT_AR_ARELUC_AG_1440 was analyzed into 1 assay endpoint. This assay endpoint,
OT_AR_ARELUC_AG_1440, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, measures of receptor for gain-of-
signal activity can be used to understand the reporter gene at the pathway-level as they relate to the gene AR.
Furthermore, this assay endpoint can be referred to as a primary readout, because the performed assay has
only produced 1 assay endpoint. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary.

1.9	Assay Throughput: 384-well plate. Stably transfected CHO-K1 cells are aliquoted into 384-well microtiter plates
and incubated with test compounds for 24 hours prior to monitoring luminescence increases resulting from AR
transactivation by test compounds.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to bioluminescence signals produced from an enzymatic reaction involving the key
substrate [D-luciferin] are indicative of changes in transcriptional gene expression due to agonist activity
regulated by the human androgen receptor [GeneSymbokAR | GenelD:367 |
Uniprot_SwissProt_Accession:P10275],

This assay was developed to measure long-term transcriptional changes induced by ligand-binding of androgen
receptor alpha (AR) detected in a mammalian (Chinese hamster ovary; CHO-K1) cell line stably expressing both
full-length human AR and an ARE reporter constructs driving expression of luciferase.

2.2	Scientific Principles: Endocrine disrupting chemicals (EDCs) are chemicals found in the environment or
introduced in one's diet that perturb normal hormone biosynthesis, metabolism and downstream gene
transcription. A significant subset of EDCs including industrial chemicals, organochlorinated pesticides, and

-------
plasticizers have the capacity to bind to the androgen receptor (AR), a member of the nuclear receptor
superfamily that is activated by androgens (Luccio-Camelo and Prins 2011, Sultan et al. 2001). Due to the
androgen-dependence of male sexual differentiation, exposure to EDCs can result in reduced sperm counts,
increased infertility, and elevated testicular and prostate cancer risks (Luccio-Camelo and Prins 2011). AR is a
ligand-inducible nuclear hormone receptor that mediates transcription through a series of events including
ligand binding, DNA binding to androgen response elements, and interaction with various co-activators.

2.3	Experimental System: adherent CHO-K1 cell line used. CHO-K1 is an immortal mammal ovary fibroblast cell line
derived from Chinese hamster cells isolated in 1957 (Puck et al. 1958). CHO-K1 is a widely used cell line with
well characterized cell transfection methods frequently utilized for large-scale production of numerous
pharmaceutical proteins (including hormones, antibodies, and blood factors) since these cells are capable of
folding, assembling and post-translationally modifying proteins in a manner that is more comparable to humans
(Kildegaard et al. 2013). This assay uses CHOK1 cells stably transfected with full-length AR to monitor ARE-driven
expression of luciferase reporter activity in response to chemical exposures over 24 hours.

2.4	Metabolic Competence: CHO-K1 cells have the capacity to metabolize the anti-androgenic fungicide vinclozolin
(Jacobs et al. 2008), however the intrinsic production of Phase l/ll enzymes has not been well characterized in
this cell line and metabolic activation/detoxification of test compounds is potentially limited.

2.5	Exposure Regime: To identify AR agonists, CHO-K1 cells are seeded into a white-walled/white-bottom 384-well
plate, followed by treatment with compounds of interest or controls for 24 hours. A luciferase assay mix
containing D-luciferin and ATP in PBS is then added to the cells and luminescence quantified on a Luminoskan
(Thermo Scientific) luminometer. Modulation of this assay is quantified as an increase in mean luminescence
intensity relative to vehicle controls. To evaluate specificity of the AR / ARE-luc assay, several known AR agonists,
AR antagonists, and ER-selective compounds were also tested in 5-pt concentration-response format in a
minimum of 7 assay plates. OT determined the reproducibility of DHT-induced AR/ARE-luc transcriptional
activation with a total of seven 384-well plates run on 2 different days, for 24-h time frames. A minimum of 3
replicate wells were analyzed for each sample with 14 replicate wells for vehicle controls. Luminescence data
were captured on the Luminoskan luminometer. The range of EC50 values for DHT was determined in the
AR/ARE-luc transcriptional assay in seven 384-well plates run on 2 different days, which did not vary by more
than 4-fold for the 24-hour endpoints

Baseline median absolute deviation for the assay (bmad): 2.582
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Luciferase (Luminescence)

2.6	Response: Androgen receptor-mediated signaling pathway stable protein formation in response to AR agonism
and SRC-1 co-activator recruitment is measured by changes in fluorescence intensity relative to DMSO (neutral
control) baseline.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

6-alpha-Fluorotestosterone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

-------
2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast AR Pathway Model: Androgen receptor assays used in ToxCast AR Pathway model. See
10.1016/j.yrtph.2020.104764 and 10.1021/acs.chemrestox.6b00347

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction over DMSO controls
(baseline), and was reported as a percentage of positive control (6alpha-fluorotestosterone) activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -

-------
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1977	Number of chemicals tested: 1857

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
183

Inactive hit count: 0
-------
power(pow) model:
linear-polynomial (polyl) model:

801

120

quadratic-polynomialfpoly2) model: 209

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

484

12

1

159

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-------
6.52

Neutral control median absolute deviation, by plate: nmad	1.275

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	20.39%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	92.04

Positive control well median absolute deviation, by plate: pmad	15.745

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	5.074

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 159.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug Dev Technol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227.,
Kleinstreuer NC, Ceger P, Watt ED, Martin M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ, Allen D,
Sakamuru S, Xia M, Huang R, Judson R. Development and Validation of a Computational Model for Androgen
Receptor Activity. Chem Res Toxicol. 2017 Apr 17;30(4):946-964. doi: 10.1021/acs.chemrestox.6b00347. Epub
2016 Dec 9. PMID: 27933809; PMCID: PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 740

OT_AR_ARSRC1_0480

1.	General Information

1.1	Assay Title: Odyssey Thera 8-hour HEK293T Protein-Complementation Assay for Androgen Receptor/SRC-1 Co-
activator

1.2	Assay Summary: OT_AR_ARSRC1_0480 is a cell-based, single-readout assay that uses HEK293T, a human kidney
cell line, with measurements taken at 8 hours after chemical dosing in a 384-well plate. OT_AR_ARSRC1_0480
is one of one assay component(s) measured or calculated from the OT_AR_ARSRC1_0480 assay. It is designed
to make measurements of protein fragment complementation, a form of binding reporter, as detected with
fluorescence intensity signals by Protein-fragment Complementation technology. Data from the assay
component OT_AR_ARSRC1_0480 was analyzed into 1 assay endpoint. This assay endpoint,
OT_AR_ARSRC1_0480, was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of binding reporter, measures of receptor for gain-of-signal activity
can be used to understand the binding at the pathway-level as they relate to the gene AR. Furthermore, this
assay endpoint can be referred to as a primary readout, because the performed assay has only produced 1 assay
endpoint. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. Stably transfected HEK293T cells are aliquoted into 384-well microtiter plates
and incubated with test compounds for 8 hours prior to monitoring fluorescence emission resulting from
xenobiotic AR activation and co-factor recruitment.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human androgen receptor and SRC proto-
oncogene, non-receptor tyrosine kinase [GeneSymbokAR & SRC | GenelD:367 & 6714 |
Uniprot_SwissProt_Accession:P10275 & P12931],

The Odyssey Thera AR/SRC-1 assay is a protein-complementation assay (PCA) comprised of the full length
human AR and the nuclear receptor interacting domain of SRC-1, each fused to an inactive fragment of YFP.
Unliganded AR is bound by heat shock/co-chaperone proteins in an inactive state in the cytoplasm (Pratt and
Toft 1997) therefore fluorescent signal in the basal or unstimulated state of the assay is predominately present

-------
in the cytoplasm. In response to ligand binding, the AR/SRC-1YFP complex translocates from the cytoplasm to
the nucleus, and this assay records compound-AR interactions by measurement of nucleus : cytoplasm (N/C)
signal ratios. Each AR protein and its associated coactivator (SRC-1) contain a rationally dissected fragment of a
yellow-fluorescent protein (YFP) reporter enzyme. When the androgen responsive signaling pathway is
impacted by chemical activation or interference, the resulting YFP signal production can be measured using
fluorescence microscopy to screen a diverse chemical library for potential xenobioticARIigand-binding. Changes
in protein complex interactions can be impacted by a variety of biochemical events within a pathway, and this
assay is designed to track changes at the level of cell functioning which may occur at a number of points along
the androgen signaling pathway following an 8 hour incubation of cells with test compound in 384-well plate,
using DMSO as a negative control and baseline signal and DHT (5alpha-Dihydrotestosterone) as a positive
control and measure of 100 percent ligand-binding activity in AR. Concentration-response models are based on
6-point concentration series (0.3 a€" 100 uM) run in triplicate. Preliminary experiments examined the temporal
nature of the AR/SRC-1 translocation and determined that maximum S/B was achieved after 8 hours, and while
EC50s did not vary considerably over time, a more robust estimation of lower concentrations was achieved in
longer duration assays. The OT AR/SRC-1 assay was also run for 16 hours (see description for
OT_AR_ARSRC1_0960). OT initially treated the cells with the AR agonist 4-5, dihydrotestosterone (DHT) in 10-
point concentration- response format for 8 hours to assess sensitivity of the AR/SRC-1 assay to ligand.

2.2	Scientific Principles: Endocrine disrupting chemicals (EDCs) are chemicals found in the environment or
introduced in one's diet that perturb normal hormone biosynthesis, metabolism and downstream gene
transcription. A significant subset of EDCs including industrial chemicals, organochlorinated pesticides, and
plasticizers have the capacity to bind to the androgen receptor (AR), a member of the nuclear receptor
superfamily that is activated by androgens (Luccio-Camelo and Prins 2011, Sultan et al. 2001). Due to the
androgen-dependence of male sexual differentiation, exposure to EDCs can result in reduced sperm counts,
increased infertility, and elevated testicular and prostate cancer risks (Luccio-Camelo and Prins 2011). AR is a
ligand-inducible nuclear hormone receptor that mediates transcription through a series of events including
ligand binding, DNA binding to androgen response elements, and interaction with various co-activators. These
co-activators are components required for androgen-dependent transcription, and either physically link the AR
to the basal transcriptional machinery or modulate chromatin via methylation or acetylation (McKenna et al.
1999). Over 169 proteins have been reported as potential AR co-regulators (Heemers and Tindall 2007) including
the prototypical nuclear receptor coactivator, SRC-1. While numerous assays have been described in the
literature that assess AR function using transcriptional readouts (Vinggaard et al. 1999), ligand competition
binding (Fau et al. 2011) or cellular dynamics of GFP-tagged AR (Sultan et al. 2001, Szafran et al. 2008), the OT
AR/SRC-1 assay evaluated EDC-induced AR activity in the context of the receptor's interaction with the steroid
receptor co-activator protein, SRC-1. The advantage of this approach is that compounds that favor interaction
of AR with SRC-1 (such as ligands), indicating the activated state of the receptor, can be readily detected. In
addition, compounds that perturb this interaction by acting upstream in the pathway (e.g. through non-genomic
effects) may also be identified. Therefore, this assay represents a novel tool for evaluating endocrine disrupting
agents which have potential to interfere with endogenous androgen signaling in a high throughput screening
mode. This assay is intended for use as a part of an integrated testing strategy, to screen a large structurally
diverse chemical library for compounds with the potential to interact with androgen receptor mediated
pathways and potentially affect endocrine systems in exposed populations. There is strong evidence that
androgen receptor agonism is a molecular initiating event (MIE) in an Adverse Outcome Pathway (AOP) leading
to reproductive dysfunction in fish populations (EAGMST Approved AOP), and there is some evidence that
androgen receptor activation is the MIE for a putative pathway leading to hepatocellular adenomas and
carcinomas (in mouse and rat models) (AOP currently under development). Chemical-activity profiles derived
from this assay can inform prioritization decisions for compound selection in more resource intensive in vivo
studies to further investigate the involvement of AR agonism in pathways leading to hazardous outcomes in
biological systems.

2.3	Experimental System: adherent HEK293T cell line used. This assay monitors the ligand binding domain (LBD) of
the human AR (stably expressed in HEK293T) for xenobiotic androgen receptor activation. The HEK-293 cell line
are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared adenovirus 5

-------
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells are derived
from the HEK293 cell line by the addition of the SV40 large T antigen that has been shown to increase vector
production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies when
compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently utilized
cell lines for in small-scale protein production and in viral vector propagation using the transient transfection
method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

2.5	Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured. The OT AR / SRC-1 assay assessed androgen
receptor chemical interactions using a rapidly maturing, intensely fluorescent mutant of YFP known as Venus,
rationally dissected into two separate fragments. The fragments were obtained as follows: first, fragments
coding for YFP1 and YFP2 (corresponding to amino acid residues 1-158 and 159-239 of the full length YFP,
respectively) were generated by oligonucleotide synthesis (Blue Heron Biotechnology), and then PCR
mutagenesis was used to generate the mutant fragments IFP1 and IFP2. Fusion constructs were transfected into
HEK293T cells with a (Gly4Ser)2 linker between the AR/SRC-1 and YFP fragment genes to facilitate
complementation when interacting proteins bring fragments into close proximity. The construct is stably
expressed in HEK293T cells. Cells are seeded into optical quality 384-well poly-D-lysine coated plates in phenol
red-free DMEM medium supplemented with 10 percent dextran-treated FBS and allowed to adhere for 24 hours
prior to treatment with compounds of interest or controls for 8 hours. Cells are fixed in 4 percent formaldehyde
and stained with Draq5 (BioStatus) to identify cells and subcellular compartment boundaries prior to signal
detection. Images were acquired on an Evotec Opera at 2 wavelengths (488 and 635nm), and the ratio of
fluorescence in the nucleus relative to fluorescence in the cytoplasm (N/C Ratio) in the 488nm channel was
calculated for a minimum of 400 cells per image. Both agonists and antagonists of the AR receptor induce
nuclear translocation to varying degrees. Preliminary experiments examined the temporal nature of the AR/SRC-
1 translocation and determined that maximum S/B was achieved after 8 hours, and while EC50s did not vary
considerably over time, a more robust estimation of lower concentrations was achieved in longer duration
assays. The OT AR/SRC-1 assay was also run for 16 hours (see description for OT_AR_ARSRC1_0960).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

4,5-alpha-Dihydrotestosterone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 2.456

Response cutoff threshold used to determine hit calls: 20

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6 Response: Androgen receptor-mediated signaling pathway stable protein formation in response to AR agonism
and SRC-1 co-activator recruitment is measured by changes in fluorescence intensity relative to DMSO (neutral
control) baseline.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast AR Pathway Model: Androgen receptor assays used in ToxCast AR Pathway model. See
10.1016/j.yrtph.2020.104764 and 10.1021/acs.chemrestox.6b00347

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Each data point was formed by taking the log of the ratio of the sample signal to the control
signal. A minimum of 8 replicate wells were analyzed each for sample and vehicle controls. Wells located in the
outer ring of the plate were omitted due to the potential for edge effects. Data were captured on a Perkin Elmer
Opera confocal microscope: 8 images per well in two wavelengths with a minimum of 400 cells per image. Each
data point represents the average of 32 images acquired in four wells, normalized to the N/C Ratio calculated
for 8 vehicle control wells (64 images). Gain-of-signal data are plotted as percent of activity where max activity
corresponds to luM DHT, and relative to DMSO, negative control and baseline activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),

-------
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053

Number of chemicals tested: 1857

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
379

Inactive hit count: Oihitc 0.9
1449

WINING MODEL SELECTION

NA hit count: hitc^O
225

Number of sample-assay endpoints with winning hill model:

79
72

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

757

140

quadratic-poly nomialfpoly 2) model: 394

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

49

3

397

162

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.786

Neutral control median absolute deviation, by plate: nmad	0.017

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	2.21%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	1.508

Positive control well median absolute deviation, by plate: pmad	0.036

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	17.023

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 162.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug Dev Technol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227.,
Kleinstreuer NC, Ceger P, Watt ED, Martin M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ, Allen D,
Sakamuru S, Xia M, Huang R, Judson R. Development and Validation of a Computational Model for Androgen
Receptor Activity. Chem Res Toxicol. 2017 Apr 17;30(4):946-964. doi: 10.1021/acs.chemrestox.6b00347. Epub
2016 Dec 9. PMID: 27933809; PMCID: PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 741

OT_AR_ARSRC1_0960

1.	General Information

1.1	Assay Title: Odyssey Thera 16-hour HEK293T Protein-Complementation Assay for Androgen Receptor/SRC-1 Co-
activator

1.2	Assay Summary: OT_AR_ARSRC1_0960 is a cell-based, single-readout assay that uses HEK293T, a human kidney
cell line, with measurements taken at 16 hours after chemical dosing in a 384-well plate. OT_AR_ARSRC1_0960
is one of one assay component(s) measured or calculated from the OT_AR_ARSRC1_0960 assay. It is designed
to make measurements of protein fragment complementation, a form of binding reporter, as detected with
fluorescence intensity signals by Protein-fragment Complementation technology. Data from the assay
component OT_AR_ARSRC1_0960 was analyzed into 1 assay endpoint. This assay endpoint,
OT_AR_ARSRC1_0960, was analyzed in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of binding reporter, measures of receptor for gain-of-signal activity
can be used to understand the binding at the pathway-level as they relate to the gene AR. Furthermore, this
assay endpoint can be referred to as a primary readout, because the performed assay has only produced 1 assay
endpoint. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. Stably transfected HEK293T cells are aliquoted into 384-well microtiter plates
and incubated with test compounds for 8 hours prior to monitoring fluorescence emission resulting from
xenobiotic AR activation and co-factor recruitment.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human androgen receptor and SRC proto-
oncogene, non-receptor tyrosine kinase [GeneSymbokAR & SRC | GenelD:367 & 6714 |
Uniprot_SwissProt_Accession:P10275 & P12931],

The Odyssey Thera AR/SRC-1 assay is a protein-complementation assay (PCA) comprised of the full length
human AR and the nuclear receptor interacting domain of SRC-1, each fused to an inactive fragment of YFP.
Unliganded AR is bound by heat shock/co-chaperone proteins in an inactive state in the cytoplasm (Pratt and
Toft 1997) therefore fluorescent signal in the basal or unstimulated state of the assay is predominately present

-------
in the cytoplasm. In response to ligand binding, the AR/SRC-1YFP complex translocates from the cytoplasm to
the nucleus, and this assay records compound-AR interactions by measurement of nucleus : cytoplasm (N/C)
signal ratios. Each AR protein and its associated coactivator (SRC-1) contain a rationally dissected fragment of a
yellow-fluorescent protein (YFP) reporter enzyme. When the androgen responsive signaling pathway is
impacted by chemical activation or interference, the resulting YFP signal production can be measured using
fluorescence microscopy to screen a diverse chemical library for potential xenobioticARIigand-binding. Changes
in protein complex interactions can be impacted by a variety of biochemical events within a pathway, and this
assay is designed to track changes at the level of cell functioning which may occur at a number of points along
the androgen signaling pathway following an 8 hour incubation of cells with test compound in 384-well plate,
using DMSO as a negative control and baseline signal and DHT (5alpha-Dihydrotestosterone) as a positive
control and measure of 100 percent ligand-binding activity in AR. Concentration-response models are based on
6-point concentration series (0.3 a€" 100 uM) run in triplicate. Preliminary experiments examined the temporal
nature of the AR/SRC-1 translocation and determined that maximum S/B was achieved after 8 hours, and while
EC50s did not vary considerably over time, a more robust estimation of lower concentrations was achieved in
longer duration assays. The OT AR/SRC-1 assay was also run for 16 hours (see description for
OT_AR_ARSRC1_0960). OT initially treated the cells with the AR agonist 4-5, dihydrotestosterone (DHT) in 10-
point concentration- response format for 8 hours to assess sensitivity of the AR/SRC-1 assay to ligand.

2.2	Scientific Principles: Endocrine disrupting chemicals (EDCs) are chemicals found in the environment or
introduced in one's diet that perturb normal hormone biosynthesis, metabolism and downstream gene
transcription. A significant subset of EDCs including industrial chemicals, organochlorinated pesticides, and
plasticizers have the capacity to bind to the androgen receptor (AR), a member of the nuclear receptor
superfamily that is activated by androgens (Luccio-Camelo and Prins 2011, Sultan et al. 2001). Due to the
androgen-dependence of male sexual differentiation, exposure to EDCs can result in reduced sperm counts,
increased infertility, and elevated testicular and prostate cancer risks (Luccio-Camelo and Prins 2011). AR is a
ligand-inducible nuclear hormone receptor that mediates transcription through a series of events including
ligand binding, DNA binding to androgen response elements, and interaction with various co-activators. These
co-activators are components required for androgen-dependent transcription, and either physically link the AR
to the basal transcriptional machinery or modulate chromatin via methylation or acetylation (McKenna et al.
1999). Over 169 proteins have been reported as potential AR co-regulators (Heemers and Tindall 2007) including
the prototypical nuclear receptor coactivator, SRC-1. While numerous assays have been described in the
literature that assess AR function using transcriptional readouts (Vinggaard et al. 1999), ligand competition
binding (Fau et al. 2011) or cellular dynamics of GFP-tagged AR (Sultan et al. 2001, Szafran et al. 2008), the OT
AR/SRC-1 assay evaluated EDC-induced AR activity in the context of the receptor's interaction with the steroid
receptor co-activator protein, SRC-1. The advantage of this approach is that compounds that favor interaction
of AR with SRC-1 (such as ligands), indicating the activated state of the receptor, can be readily detected. In
addition, compounds that perturb this interaction by acting upstream in the pathway (e.g. through non-genomic
effects) may also be identified. Therefore, this assay represents a novel tool for evaluating endocrine disrupting
agents which have potential to interfere with endogenous androgen signaling in a high throughput screening
mode. This assay is intended for use as a part of an integrated testing strategy, to screen a large structurally
diverse chemical library for compounds with the potential to interact with androgen receptor mediated
pathways and potentially affect endocrine systems in exposed populations. There is strong evidence that
androgen receptor agonism is a molecular initiating event (MIE) in an Adverse Outcome Pathway (AOP) leading
to reproductive dysfunction in fish populations (EAGMST Approved AOP), and there is some evidence that
androgen receptor activation is the MIE for a putative pathway leading to hepatocellular adenomas and
carcinomas (in mouse and rat models) (AOP currently under development). Chemical-activity profiles derived
from this assay can inform prioritization decisions for compound selection in more resource intensive in vivo
studies to further investigate the involvement of AR agonism in pathways leading to hazardous outcomes in
biological systems.

2.3	Experimental System: adherent HEK293T cell line used. This assay monitors the ligand binding domain (LBD) of
the human AR (stably expressed in HEK293T) for xenobiotic androgen receptor activation. The HEK-293 cell line
are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared adenovirus 5

-------
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells are derived
from the HEK293 cell line by the addition of the SV40 large T antigen that has been shown to increase vector
production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies when
compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently utilized
cell lines for in small-scale protein production and in viral vector propagation using the transient transfection
method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

2.5	Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured. The OT AR / SRC-1 assay assessed androgen
receptor chemical interactions using a rapidly maturing, intensely fluorescent mutant of YFP known as Venus,
rationally dissected into two separate fragments. The fragments were obtained as follows: first, fragments
coding for YFP1 and YFP2 (corresponding to amino acid residues 1-158 and 159-239 of the full length YFP,
respectively) were generated by oligonucleotide synthesis (Blue Heron Biotechnology), and then PCR
mutagenesis was used to generate the mutant fragments IFP1 and IFP2. Fusion constructs were transfected into
HEK293T cells with a (Gly4Ser)2 linker between the AR/SRC-1 and YFP fragment genes to facilitate
complementation when interacting proteins bring fragments into close proximity. The construct is stably
expressed in HEK293T cells. Cells are seeded into optical quality 384-well poly-D-lysine coated plates in phenol
red-free DMEM medium supplemented with 10 percent dextran-treated FBS and allowed to adhere for 24 hours
prior to treatment with compounds of interest or controls for 8 hours. Cells are fixed in 4 percent formaldehyde
and stained with Draq5 (BioStatus) to identify cells and subcellular compartment boundaries prior to signal
detection. Images were acquired on an Evotec Opera at 2 wavelengths (488 and 635nm), and the ratio of
fluorescence in the nucleus relative to fluorescence in the cytoplasm (N/C Ratio) in the 488nm channel was
calculated for a minimum of 400 cells per image. Both agonists and antagonists of the AR receptor induce
nuclear translocation to varying degrees. Preliminary experiments examined the temporal nature of the AR/SRC-
1 translocation and determined that maximum S/B was achieved after 8 hours, and while EC50s did not vary
considerably over time, a more robust estimation of lower concentrations was achieved in longer duration
assays. The OT AR/SRC-1 assay was also run for 16 hours (see description for OT_AR_ARSRC1_0960).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

4,5-alpha-Dihydrotestosterone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 1.638

Response cutoff threshold used to determine hit calls: 20

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6 Response: Androgen receptor-mediated signaling pathway stable protein formation in response to AR agonism
and SRC-1 co-activator recruitment is measured by changes in fluorescence intensity relative to DMSO (neutral
control) baseline.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast AR Pathway Model: Androgen receptor assays used in ToxCast AR Pathway model. See
10.1016/j.yrtph.2020.104764 and 10.1021/acs.chemrestox.6b00347

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Each data point was formed by taking the log of the ratio of the sample signal to the control
signal. A minimum of 8 replicate wells were analyzed each for sample and vehicle controls. Wells located in the
outer ring of the plate were omitted due to the potential for edge effects. Data were captured on a Perkin Elmer
Opera confocal microscope: 8 images per well in two wavelengths with a minimum of 400 cells per image. Each
data point represents the average of 32 images acquired in four wells, normalized to the N/C Ratio calculated
for 8 vehicle control wells (64 images). Gain-of-signal data are plotted as percent of activity where max activity
corresponds to luM DHT, and relative to DMSO, negative control and baseline activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),

-------
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053

Number of chemicals tested: 1857

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
504

Inactive hit count: Oihitc 0.9
1512

WINING MODEL SELECTION

NA hit count: hitc^O
37

Number of sample-assay endpoints with winning hill model:

123
70

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

513

149

quadratic-polynomialfpoly2) model: 469

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

440

55

5

229

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.764

Neutral control median absolute deviation, by plate: nmad	0.013

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	1.76%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	1.576

Positive control well median absolute deviation, by plate: pmad	0.034

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	21.331

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 229.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug Dev Technol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227.,
Kleinstreuer NC, Ceger P, Watt ED, Martin M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ, Allen D,
Sakamuru S, Xia M, Huang R, Judson R. Development and Validation of a Computational Model for Androgen
Receptor Activity. Chem Res Toxicol. 2017 Apr 17;30(4):946-964. doi: 10.1021/acs.chemrestox.6b00347. Epub
2016 Dec 9. PMID: 27933809; PMCID: PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 742

OT_ER_ERa ERa_0480

1.	General Information

1.1	Assay Title: Odyssey Thera 8-hour HEK293T Protein-Complementation Assay for Estrogen Receptor-alpha/-
alpha Homodimer

1.2	Assay Summary: OT_ER_ERaERa_0480 is a cell-based assay that uses HEK293T, a human kidney cell line, with
measurements taken at 8 hours after chemical dosing in a 384-well plate. OT_ER_ERaERa_0480 is one of one
assay component(s) measured or calculated from the OT_ER_ERaERa_0480 assay. It is designed to make
measurements of protein fragment complementation, a form of binding reporter, as detected with fluorescence
intensity signals by Protein-fragment Complementation technology. Data from the assay component
OT_ER_ERaERa_0480 was analyzed into 1 assay endpoint. This assay endpoint, OT_ER_ERaERa_0480, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of binding reporter, measures of receptor for gain-of-signal activity can be used to
understand the binding at the pathway-level as they relate to the gene ESR1. Furthermore, this assay endpoint
can be referred to as a primary readout, because the performed assay has only produced 1 assay endpoint. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. HEK-293T cells are seeded into 384-well microtiter plates and allowed to
adhere for 24 hours prior to 8 hour chemical exposures.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human estrogen receptor 1
[GeneSymbokESRl | GenelD:2099 | Uniprot_SwissProt_Accession:P03372],

The Odyssey Thera ER-alpha/-alpha ligand binding domain assay utilized the ability of the ER-alpha to
homodimerize upon ligand-binding with estrogenic compounds (Katzenellenbogen et al. 1993). This activity is
monitored via Protein-Fragment Complementation Assays (PCAs) which investigate the biochemical pathways
capable of bringing separate protein fragments into close proximity. Each ER-alpha protein contains a rationally
dissected fragment of a yellow-fluorescent protein (YFP) reporter enzyme. When the estrogenic pathway is
unimpeded, separate ER-alpha proteins form homodimers and the resulting YFP signal can be measured using

-------
fluorescence microscopy to screen a diverse chemical library for potential xenobiotic ligand-binding and ER-
alpha activation. Changes in protein complex interactions can be impacted by a variety of biochemical events
within a pathway, and this assay is designed to track xenobiotic changes at the level of cell functioning which
may occur at a number of points along the estrogen signaling pathway following an 8-hour incubation of cells
with test compound in 384-well plate, using DMSO as a negative control and baseline signal and 17-beta-
estradiol as a positive control and measure of 100 percent ligand-binding activity in ER-alpha.

2.2	Scientific Principles: Endocrine disrupting compounds bind to nuclear hormone receptors, leading to a diverse
array of transcriptional and signaling pathway alterations. These cell- and tissue-type specific changes affect
many aspects of human physiology, including those involved in inflammation, neonatal development and
oncogenesis. Among the human NRs, the estrogen receptor family is particularly susceptible to perturbation by
EDCs because of the ER LBD proclivity to bind a disparate set of small hydrophobic molecules commonly found
in nature and in man-made materials (Shanle and Xu 2010). Cell-based and in vivo experiments suggest that the
ER-alpha isoform is the primary mediator of estrogenic effects by EDCs on reproductive development and on
cell proliferation (Helguero et al. 2005). Thus, a highly sensitive assay that can detect ER-alpha LBD binding in
the context of a whole cell would serve as a powerful predictor of human-relevant estrogenic effects. The
Odyssey Thera Ligand Binding assays used protein-fragment complementation (PCA) to express a dose-
dependent homodimer which binds to estrogen receptor (ER) alpha expressed in human embryonic kidney cell
line HEK293T. This dimizeration and concurrent conformational changes brings into close proximity the fused
fragments of split-yellow fluorescent protein (YFP), leading to a dramatic increase in YFP intensity. This intensity
change is dose-dependent and shows saturation-binding that plateaus differently for agonists versus
antagonists. This assay is intended for use as a part of an integrated testing strategy, to screen a large structurally
diverse chemical library for compounds with the potential to interact with estrogen receptor alpha mediated
pathways and potentially affect endocrine systems in exposed populations. There is strong evidence that
estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome Pathway (AOP) leading
to reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen receptor
activation is the MIE for putative AOPs leading to reduced survival due to renal failure and leading to skewed
sex ratios due to altered sexual differentiation in males (all AOPs currently under development). Chemical-
activity profiles derived from this assay can inform prioritization decisions for compound selection in more
resource intensive in vivo studies to further investigate the involvement of ER agonism in pathways leading to
hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. This assay monitors the ligand binding domain (LBD) of
the wild type human ER-alpha (stably expressed in HEK293T) for xenoestrogenic activation. The HEK-293 cell
line are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared adenovirus
5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells are derived
from the HEK293 cell line by the addition of the SV40 large T antigen that has been shown to increase vector
production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies when
compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently utilized
cell lines for in small-scale protein production and in viral vector propagation using the transient transfection
method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

2.5	Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured. The OT ER alpha / ER alpha assay is a

-------
homodimer PCA of the ligand binding domain (amino acids 310-547) of human ER alpha stably expressed in
HEK293T cells. Cells are seeded into optical quality 384-well poly-D-lysine coated plates in phenol red-free
medium supplemented with 10 percent dextran-treated FBS and allowed to adhere for 24 hours prior to
treatment with compounds of interest or controls for 8 hours. Cells are fixed in 4 percent formaldehyde and
stained with Draq5 (BioStatus) to identify cells prior to signal detection. Fluorescent signal in the basal state of
the assay is very low and is primarily punctuate and cytoplasmic. Modulation of this assay is quantified as an
increase in mean fluorescence intensity in the nucleus (for agonists) or in the nucleus and cytoplasm
(antagonists) relative to vehicle controls, and can be quantified on a high content imaging device or a laser
scanning cytometer. The latter instrument is favored for this assay because of the rapid mode of data
acquisition, the ability to acquire data from the entire cell population in a well (as opposed to a fraction of the
well on the high content devices) and the larger dynamic range typically achieved with this instrument. To assess
sensitivity of the ER alpha/ER alpha LBD assay to ligand, cells were treated with the ER agonist 17 beta-estradiol
(E2) in 10-point concentration- response format for 8 hours and monitored response on a laser scanning plate
cytometer (Acumen eX3; TTP Lab Tech).

Baseline median absolute deviation for the assay (bmad): 1.665

Response cutoff threshold used to determine hit calls: 20

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6	Response: Estrogen signaling pathway protein fragment dimerization and enzyme formation in response to
estrogen receptor alpha ligand-binding is measured by changes in fluorescence intensity relative to DMSO
(neutral control) baseline.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed into in the positive fitting direction (receptor gain-of-signal activity)
as a percent of 17b-Estradiol (positive control, 100 percent activation) and relative to DMSO, negative control
and signal baseline for activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053	Number of chemicals tested: 1857

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.859

Neutral control median absolute deviation, by plate: nmad	0.256

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	29.86%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	15.558

Positive control well median absolute deviation, by plate: pmad	1.1

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	12.146

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

-------
Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 90.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227., Bolt MJ,
Stossi F, Callison AM, Mancini MG, Dandekar R, Mancini MA. Systems level-based RNAi screening by high
content analysis identifies UBR5 as a regulator of estrogen receptor-? protein levels and activity. Oncogene.
2015 Jan 8;34(2):154-64. doi: 10.1038/onc.2013.550. Epub 2014 Jan 20. PMID: 24441042; PMCID:
PMC4871123., Stossi F, Bolt MJ, Ashcroft FJ, Lamerdin JE, Melnick JS, Powell RT, Dandekar RD, Mancini MG,
Walker CL, Westwick JK, Mancini MA. Defining estrogenic mechanisms of bisphenol A analogs through high
throughput microscopy-based contextual assays. Chem Biol. 2014 Jun 19;21(6):743-53. doi:
10.1016/j.chembiol.2014.03.013. Epub 2014 May 22. PMID: 24856822; PMCID: PMC4301571.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 743

OT_ER_ERa ERa_1440

1.	General Information

1.1	Assay Title: Odyssey Thera 24-hour HEK293T Protein-Complementation Assay for Estrogen Receptor-alpha/-
alpha Homodimer

1.2	Assay Summary: OT_ER_ERaERa_1440 is a cell-based assay that uses HEK293T, a human kidney cell line, with
measurements taken at 24 hours after chemical dosing in a 384-well plate. OT_ER_ERaERa_1440 is one of one
assay component(s) measured or calculated from the OT_ER_ERaERa_1440 assay. It is designed to make
measurements of protein fragment complementation, a form of binding reporter, as detected with fluorescence
intensity signals by Protein-fragment Complementation technology. Data from the assay component
OT_ER_ERaERa_1440 was analyzed into 1 assay endpoint. This assay endpoint, OT_ER_ERaERa_1440, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of binding reporter, measures of receptor for gain-of-signal activity can be used to
understand the binding at the pathway-level as they relate to the gene ESR1. Furthermore, this assay endpoint
can be referred to as a primary readout, because the performed assay has only produced 1 assay endpoint. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. HEK-293T cells are seeded into 384-well microtiter plates and allowed to
adhere for 24 hours prior to 24 hour chemical exposures.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human estrogen receptor 1
[GeneSymbokESRl | GenelD:2099 | Uniprot_SwissProt_Accession:P03372],

The Odyssey Thera ER-alpha/-alpha ligand binding domain assay utilized the ability of the ER-alpha to
homodimerize upon ligand-binding with estrogenic compounds (Katzenellenbogen et al. 1993). This activity is
monitored via Protein-Fragment Complementation Assays (PCAs) which investigate the biochemical pathways
capable of bringing separate protein fragments into close proximity. Each ER-alpha protein contains a rationally
dissected fragment of a yellow-fluorescent protein (YFP) reporter enzyme. When the estrogenic pathway is
unimpeded, separate ER-alpha proteins form homodimers and the resulting YFP signal can be measured using

-------
fluorescence microscopy to screen a diverse chemical library for potential xenobiotic ligand-binding and ER-
alpha activation. Changes in protein complex interactions can be impacted by a variety of biochemical events
within a pathway, and this assay is designed to track xenobiotic changes at the level of cell functioning which
may occur at a number of points along the estrogen signaling pathway following an 8-hour incubation of cells
with test compound in 384-well plate, using DMSO as a negative control and baseline signal and 17-beta-
estradiol as a positive control and measure of 100 percent ligand-binding activity in ER-alpha.

2.2	Scientific Principles: Endocrine disrupting compounds bind to nuclear hormone receptors, leading to a diverse
array of transcriptional and signaling pathway alterations. These cell- and tissue-type specific changes affect
many aspects of human physiology, including those involved in inflammation, neonatal development and
oncogenesis. Among the human NRs, the estrogen receptor family is particularly susceptible to perturbation by
EDCs because of the ER LBD proclivity to bind a disparate set of small hydrophobic molecules commonly found
in nature and in man-made materials (Shanle and Xu 2010). Cell-based and in vivo experiments suggest that the
ER-alpha isoform is the primary mediator of estrogenic effects by EDCs on reproductive development and on
cell proliferation (Helguero et al. 2005). Thus, a highly sensitive assay that can detect ER-alpha LBD binding in
the context of a whole cell would serve as a powerful predictor of human-relevant estrogenic effects. The
Odyssey Thera Ligand Binding assays used protein-fragment complementation (PCA) to express a dose-
dependent homodimer which binds to estrogen receptor (ER) alpha expressed in human embryonic kidney cell
line HEK293T. This dimizeration and concurrent conformational changes brings into close proximity the fused
fragments of split-yellow fluorescent protein (YFP), leading to a dramatic increase in YFP intensity. This intensity
change is dose-dependent and shows saturation-binding that plateaus differently for agonists versus
antagonists. This assay is intended for use as a part of an integrated testing strategy, to screen a large structurally
diverse chemical library for compounds with the potential to interact with estrogen receptor alpha mediated
pathways and potentially affect endocrine systems in exposed populations. There is strong evidence that
estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome Pathway (AOP) leading
to reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen receptor
activation is the MIE for putative AOPs leading to reduced survival due to renal failure and leading to skewed
sex ratios due to altered sexual differentiation in males (all AOPs currently under development). Chemical-
activity profiles derived from this assay can inform prioritization decisions for compound selection in more
resource intensive in vivo studies to further investigate the involvement of ER agonism in pathways leading to
hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. This assay monitors the ligand binding domain (LBD) of
the wild type human ER-alpha (stably expressed in HEK293T) for xenoestrogenic activation. The HEK-293 cell
line are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared adenovirus
5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells are derived
from the HEK293 cell line by the addition of the SV40 large T antigen that has been shown to increase vector
production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies when
compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently utilized
cell lines for in small-scale protein production and in viral vector propagation using the transient transfection
method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

2.5	Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured. The OT ER alpha / ER alpha assay is a

-------
homodimer PCA of the ligand binding domain (amino acids 310-547) of human ER alpha stably expressed in
HEK293T cells. Cells are seeded into optical quality 384-well poly-D-lysine coated plates in phenol red-free
medium supplemented with 10 percent dextran-treated FBS and allowed to adhere for 24 hours prior to
treatment with compounds of interest or controls for 24 hours. Cells are fixed in 4 percent formaldehyde and
stained with Draq5 (BioStatus) to identify cells prior to signal detection. Fluorescent signal in the basal state of
the assay is very low and is primarily punctuate and cytoplasmic. Modulation of this assay is quantified as an
increase in mean fluorescence intensity in the nucleus (for agonists) or in the nucleus and cytoplasm
(antagonists) relative to vehicle controls, and can be quantified on a high content imaging device or a laser
scanning cytometer. The latter instrument is favored for this assay because of the rapid mode of data
acquisition, the ability to acquire data from the entire cell population in a well (as opposed to a fraction of the
well on the high content devices) and the larger dynamic range typically achieved with this instrument. To assess
sensitivity of the ER alpha/ER alpha LBD assay to ligand, cells were treated with the ER agonist 17 beta-estradiol
(E2) in 10-point concentration- response format for 24 hours and monitored response on a laser scanning plate
cytometer (Acumen eX3; TTP Lab Tech).

Baseline median absolute deviation for the assay (bmad): 0.64

Response cutoff threshold used to determine hit calls: 20

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6	Response: Estrogen signaling pathway protein fragment dimerization and enzyme formation in response to
estrogen receptor alpha ligand-binding is measured by changes in fluorescence intensity relative to DMSO
(neutral control) baseline.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed into in the positive fitting direction (receptor gain-of-signal activity)
as a percent of 17b-Estradiol (positive control, 100 percent activation) and relative to DMSO, negative control
and signal baseline for activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053	Number of chemicals tested: 1857

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.943

Neutral control median absolute deviation, by plate: nmad	0.25

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	25.46%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	30.361

Positive control well median absolute deviation, by plate: pmad	1.813

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	17.447

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

-------
Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 107.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227., Bolt MJ,
Stossi F, Callison AM, Mancini MG, Dandekar R, Mancini MA. Systems level-based RNAi screening by high
content analysis identifies UBR5 as a regulator of estrogen receptor-? protein levels and activity. Oncogene.
2015 Jan 8;34(2):154-64. doi: 10.1038/onc.2013.550. Epub 2014 Jan 20. PMID: 24441042; PMCID:
PMC4871123., Stossi F, Bolt MJ, Ashcroft FJ, Lamerdin JE, Melnick JS, Powell RT, Dandekar RD, Mancini MG,
Walker CL, Westwick JK, Mancini MA. Defining estrogenic mechanisms of bisphenol A analogs through high
throughput microscopy-based contextual assays. Chem Biol. 2014 Jun 19;21(6):743-53. doi:
10.1016/j.chembiol.2014.03.013. Epub 2014 May 22. PMID: 24856822; PMCID: PMC4301571.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 744

OT_ER_ERaERb_0480

1.	General Information

1.1	Assay Title: Odyssey Thera 8-hour HEK293T Protein-Complementation Assay for Estrogen Receptor-alpha/-beta
Homodimer

1.2	Assay Summary: OT_ER_ERaERb_0480 is a cell-based assay that uses HEK293T, a human kidney cell line, with
measurements taken at 8 hours after chemical dosing in a 384-well plate. OT_ER_ERaERb_0480 is one of one
assay component(s) measured or calculated from the OT_ER_ERaERb_0480 assay. It is designed to make
measurements of protein fragment complementation, a form of binding reporter, as detected with fluorescence
intensity signals by Protein-fragment Complementation technology. Data from the assay component
OT_ER_ERaERb_0480 was analyzed into 1 assay endpoint. This assay endpoint, OT_ER_ERaERb_0480, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of binding reporter, measures of receptor for gain-of-signal activity can be used to
understand the binding at the pathway-level as they relate to the gene ESR1 and ESR2. Furthermore, this assay
endpoint can be referred to as a primary readout, because the performed assay has only produced 1 assay
endpoint. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. HEK-293T cells are seeded into 384-well microtiter plates and allowed to
adhere for 24 hours prior to 8 hour chemical exposures.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human estrogen receptor 1 and estrogen
receptor 2 (ER beta) [GeneSymbol:ESRl & ESR2 | GenelD:2099 & 2100 | Uniprot_SwissProt_Accession:P03372
&Q92731],

The Odyssey Thera ER-alpha/-beta ligand binding domain assay utilized the ability of the ER-alpha to form
heterodimers following ligand-binding with estrogenic compounds. This activity is monitored via Protein-
Fragment Complementation Assays (PCAs) which investigate the biochemical pathways capable of bringing
separate protein fragments into close proximity. Each ER-alpha and ER-beta protein contains a fragment of a
reporter enzyme (YFP) and when both proteins come in contact to form homo- or heterodimers, the resulting

-------
YFP signal can be measured using fluorescence microscopy and used to screen a diverse chemical library for
potential xenobiotic ligand-binding and ER activation. Changes in protein complex interactions can be impacted
by a variety of biochemical events within a pathway, and this assay is designed to track xenobiotic changes at
the level of cell functioning which may occur at a number of points along the estrogen signaling pathway
following an 8 hour incubation of cells with test compound in 384-well plate, using DMSO as a negative control
and baseline signal and 17-beta-estradiol as a positive control and measure of 100 percent ligand-binding
activity in ER alpha/-beta.

2.2	Scientific Principles: Endocrine disrupting compounds bind to nuclear hormone receptors, leading to a diverse
array of transcriptional and signaling pathway alterations. These cell- and tissue-type specific changes affect
many aspects of human physiology, including those involved in inflammation, neonatal development and
oncogenesis. Among the human NRs, the estrogen receptor family is particularly susceptible to perturbation by
EDCs because of the ER LBD proclivity to bind a disparate set of small hydrophobic molecules commonly found
in nature and in man-made materials (Shanle and Xu 2010). The estrogen receptor is expressed in two forms,
ER-alpha and ER-beta which play different roles in mediating the actions of estrogenic compounds. Multiple
studies have determined that the two isoforms can form functional homo- and hetero-dimers in vitro and in
vivo which are capable of binding DNA (Papoutsi et al. 2009) and initiating transcription of target genes (Cowley
et al. 1997). Furthermore, ER homo- and heterodimers display ligand-selective activity (Powell and Xu 2008)
leading in turn to a unique but overlapping set of dimer-mediated transcriptional changes (Li et al. 2004, Monroe
et al. 2005). Thus, a complete understanding of the potential estrogenic effects of EDCs requires the
comprehensive profiling of the three physiological dimers. To assess the activity of the ER-alpha/-beta
heterodimer, this assay was designed to utilize the ability of the coexpressed ER-alpha and ER-beta LBDs to
heterodimerize upon ligand-binding with estrogenic compounds. This dimerization and concurrent
conformational changes brings into close proximity the fused fragments of split-YFP, leading to a dramatic
increase in YFP intensity. This intensity change is dose-dependent and shows saturation-binding that plateaus
differently for agonists versus antagonists. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
estrogen receptor alpha mediated pathways and potentially affect endocrine systems in exposed populations.
There is strong evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse
Outcome Pathway (AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some
evidence that estrogen receptor activation is the MIE for putative AOPs leading to reduced survival due to renal
failure and leading to skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under
development). Chemical-activity profiles derived from this assay can inform prioritization decisions for
compound selection in more resource intensive in vivo studies to further investigate the involvement of ER
agonism in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. This assay monitors the ligand binding domain (LBD) of
the wild type human ER (isoforms alpha and -beta, stably expressed in HEK293T cells) for xenoestrogenic
activation. The HEK-293 cell line are human embryonic epithelial kidney cells (of unknown parentage)
transformed with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation
incorporated approximately 4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells,
and subsequent cytogenetic characterization established that the 293 line is pseudotriploid (Bylund et al. 2004).
HEK-293T cells are derived from the HEK293 cell line by the addition of the SV40 large T antigen that has been
shown to increase vector production of some viral vectors. HEK293T are reported to have relatively high
transfection efficiencies when compared to other cell lines COS-7 and HepG2 cell lines (Dai et al. 2015) and it is
among the most frequently utilized cell lines for in small-scale protein production and in viral vector propagation
using the transient transfection method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

-------
2.5 Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured. The OT ER alpha / ER beta assay is a
heterodimer PCA of the ligand binding domain (LBD) of ER alpha (amino acids 310-547) and human ER beta
(amino acids 263-489) stably expressed in HEK293T cells. ER alpha/ER beta LBD cells are seeded into optical
quality 384-well poly-D-lysine coated plates in phenol red-free medium supplemented with 10 percent dextran-
treated FBS and allowed to adhere for 24 hours prior to treatment with compounds of interest or controls for 8
hours. Cells are fixed in 4 percent formaldehyde and stained with Draq5 (BioStatus) to identify cells prior to
signal detection. Fluorescent signal in the basal state of the assay is very low and is primarily punctuate and
cytoplasmic. Modulation of this assay is quantified as an increase in mean fluorescence intensity in the nucleus
(for agonists) or in the nucleus and cytoplasm (antagonists) relative to vehicle controls, and can be quantified
on a high content imaging device or a laser scanning cytometer. The latter instrument is favored for this assay
because of the rapid mode of data acquisition, the ability to acquire data from the entire cell population in a
well (as opposed to a fraction of the well on the high content devices) and the larger dynamic range typically
achieved with this instrument. To assess sensitivity of the ER alpha/ER beta LBD assay to ligand, cells were
treated with the ER agonist 17- beta-estradiol (E2) in 10-point concentration- response format for 8 hours and
monitored response on a laser scanning plate cytometer (Acumen eX3; TTP Lab Tech).

Baseline median absolute deviation for the assay (bmad): 4.66

Response cutoff threshold used to determine hit calls: 23.3

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6	Response: Estrogen signaling pathway protein fragment dimerization and enzyme formation in response to
estrogen receptor (-alpha /-beta) ligand-binding is measured by changes in fluorescence intensity relative to
DMSO (neutral control) baseline.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

3.

Additionally, this assay was annotated to the intended target family of nuclear receptor.
Data Interpretation

-------
The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed into in the positive fitting direction (receptor gain-of-signal activity)
as a percent of 17b-Estradiol (positive control, 100 percent activation) and relative to DMSO, negative control
and signal baseline for activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with

-------
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053	Number of chemicals tested: 1857

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
330

Inactive hit count: 0
-------
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	8.786

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

0.875
0.126
14.95%

3.22
0.213

NA

-------
Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 116.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

-------
• Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227., Bolt MJ,
Stossi F, Callison AM, Mancini MG, Dandekar R, Mancini MA. Systems level-based RNAi screening by high
content analysis identifies UBR5 as a regulator of estrogen receptor-? protein levels and activity. Oncogene.
2015 Jan 8;34(2):154-64. doi: 10.1038/onc.2013.550. Epub 2014 Jan 20. PMID: 24441042; PMCID:
PMC4871123., Stossi F, Bolt MJ, Ashcroft FJ, Lamerdin JE, Melnick JS, Powell RT, Dandekar RD, Mancini MG,
Walker CL, Westwick JK, Mancini MA. Defining estrogenic mechanisms of bisphenol A analogs through high
throughput microscopy-based contextual assays. Chem Biol. 2014 Jun 19;21(6):743-53. doi:
10.1016/j.chembiol.2014.03.013. Epub 2014 May 22. PMID: 24856822; PMCID: PMC4301571.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 745

OT_ER_ERaERb_1440

1.	General Information

1.1	Assay Title: Odyssey Thera 24-hour HEK293T Protein-Complementation Assay for Estrogen Receptor-alpha/-
beta Homodimer

1.2	Assay Summary: OT_ER_ERaERb_1440 is a cell-based assay that uses HEK293T, a human kidney cell line, with
measurements taken at 24 hours after chemical dosing in a 384-well plate. OT_ER_ERaERb_1440 is one of one
assay component(s) measured or calculated from the OT_ER_ERaERb_1440 assay. It is designed to make
measurements of protein fragment complementation, a form of binding reporter, as detected with fluorescence
intensity signals by Protein-fragment Complementation technology. Data from the assay component
OT_ER_ERaERb_1440 was analyzed into 1 assay endpoint. This assay endpoint, OT_ER_ERaERb_1440, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of binding reporter, measures of receptor for gain-of-signal activity can be used to
understand the binding at the pathway-level as they relate to the gene ESR1 and ESR2. Furthermore, this assay
endpoint can be referred to as a primary readout, because the performed assay has only produced 1 assay
endpoint. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. HEK-293T cells are seeded into 384-well microtiter plates and allowed to
adhere for 24 hours prior to 24 hour chemical exposures.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human estrogen receptor 1 and estrogen
receptor 2 (ER beta) [GeneSymbol:ESRl & ESR2 | GenelD:2099 & 2100 | Uniprot_SwissProt_Accession:P03372
&Q92731],

The Odyssey Thera ER-alpha/-beta ligand binding domain assay utilized the ability of the ER-alpha to form
heterodimers following ligand-binding with estrogenic compounds. This activity is monitored via Protein-
Fragment Complementation Assays (PCAs) which investigate the biochemical pathways capable of bringing
separate protein fragments into close proximity. Each ER-alpha and ER-beta protein contains a fragment of a
reporter enzyme (YFP) and when both proteins come in contact to form homo- or heterodimers, the resulting

-------
YFP signal can be measured using fluorescence microscopy and used to screen a diverse chemical library for
potential xenobiotic ligand-binding and ER activation. Changes in protein complex interactions can be impacted
by a variety of biochemical events within a pathway, and this assay is designed to track xenobiotic changes at
the level of cell functioning which may occur at a number of points along the estrogen signaling pathway
following an 8 hour incubation of cells with test compound in 384-well plate, using DMSO as a negative control
and baseline signal and 17-beta-estradiol as a positive control and measure of 100 percent ligand-binding
activity in ER alpha/-beta.

2.2	Scientific Principles: Endocrine disrupting compounds bind to nuclear hormone receptors, leading to a diverse
array of transcriptional and signaling pathway alterations. These cell- and tissue-type specific changes affect
many aspects of human physiology, including those involved in inflammation, neonatal development and
oncogenesis. Among the human NRs, the estrogen receptor family is particularly susceptible to perturbation by
EDCs because of the ER LBD proclivity to bind a disparate set of small hydrophobic molecules commonly found
in nature and in man-made materials (Shanle and Xu 2010). The estrogen receptor is expressed in two forms,
ER-alpha and ER-beta which play different roles in mediating the actions of estrogenic compounds. Multiple
studies have determined that the two isoforms can form functional homo- and hetero-dimers in vitro and in
vivo which are capable of binding DNA (Papoutsi et al. 2009) and initiating transcription of target genes (Cowley
et al. 1997). Furthermore, ER homo- and heterodimers display ligand-selective activity (Powell and Xu 2008)
leading in turn to a unique but overlapping set of dimer-mediated transcriptional changes (Li et al. 2004, Monroe
et al. 2005). Thus, a complete understanding of the potential estrogenic effects of EDCs requires the
comprehensive profiling of the three physiological dimers. To assess the activity of the ER-alpha/-beta
heterodimer, this assay was designed to utilize the ability of the coexpressed ER-alpha and ER-beta LBDs to
heterodimerize upon ligand-binding with estrogenic compounds. This dimerization and concurrent
conformational changes brings into close proximity the fused fragments of split-YFP, leading to a dramatic
increase in YFP intensity. This intensity change is dose-dependent and shows saturation-binding that plateaus
differently for agonists versus antagonists. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
estrogen receptor alpha mediated pathways and potentially affect endocrine systems in exposed populations.
There is strong evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse
Outcome Pathway (AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some
evidence that estrogen receptor activation is the MIE for putative AOPs leading to reduced survival due to renal
failure and leading to skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under
development). Chemical-activity profiles derived from this assay can inform prioritization decisions for
compound selection in more resource intensive in vivo studies to further investigate the involvement of ER
agonism in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. This assay monitors the ligand binding domain (LBD) of
the wild type human ER (isoforms alpha and -beta, stably expressed in HEK293T cells) for xenoestrogenic
activation. The HEK-293 cell line are human embryonic epithelial kidney cells (of unknown parentage)
transformed with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation
incorporated approximately 4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells,
and subsequent cytogenetic characterization established that the 293 line is pseudotriploid (Bylund et al. 2004).
HEK-293T cells are derived from the HEK293 cell line by the addition of the SV40 large T antigen that has been
shown to increase vector production of some viral vectors. HEK293T are reported to have relatively high
transfection efficiencies when compared to other cell lines COS-7 and HepG2 cell lines (Dai et al. 2015) and it is
among the most frequently utilized cell lines for in small-scale protein production and in viral vector propagation
using the transient transfection method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

-------
2.5 Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured. The OT ER alpha / ER beta assay is a
heterodimer PCA of the ligand binding domain (LBD) of ER alpha (amino acids 310-547) and human ER beta
(amino acids 263-489) stably expressed in HEK293T cells. ER alpha/ER beta LBD cells are seeded into optical
quality 384-well poly-D-lysine coated plates in phenol red-free medium supplemented with 10 percent dextran-
treated FBS and allowed to adhere for 24 hours prior to treatment with compounds of interest or controls for
24 hours. Cells are fixed in 4 percent formaldehyde and stained with Draq5 (BioStatus) to identify cells prior to
signal detection. Fluorescent signal in the basal state of the assay is very low and is primarily punctuate and
cytoplasmic. Modulation of this assay is quantified as an increase in mean fluorescence intensity in the nucleus
(for agonists) or in the nucleus and cytoplasm (antagonists) relative to vehicle controls, and can be quantified
on a high content imaging device or a laser scanning cytometer. The latter instrument is favored for this assay
because of the rapid mode of data acquisition, the ability to acquire data from the entire cell population in a
well (as opposed to a fraction of the well on the high content devices) and the larger dynamic range typically
achieved with this instrument. To assess sensitivity of the ER alpha/ER beta LBD assay to ligand, cells were
treated with the ER agonist 17- beta-estradiol (E2) in 10-point concentration- response format for 24 hours and
monitored response on a laser scanning plate cytometer (Acumen eX3; TTP Lab Tech).

Baseline median absolute deviation for the assay (bmad): 1.913

Response cutoff threshold used to determine hit calls: 20

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6	Response: Estrogen signaling pathway protein fragment dimerization and enzyme formation in response to
estrogen receptor (-alpha /-beta) ligand-binding is measured by changes in fluorescence intensity relative to
DMSO (neutral control) baseline.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

3.

Additionally, this assay was annotated to the intended target family of nuclear receptor.
Data Interpretation

-------
The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed into in the positive fitting direction (receptor gain-of-signal activity)
as a percent of 17b-Estradiol (positive control, 100 percent activation) and relative to DMSO, negative control
and signal baseline for activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with

-------
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053	Number of chemicals tested: 1857

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
396

Inactive hit count: 0
-------
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	12.916

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

0.995
0.14
14.17%

6.786
0.428

NA

-------
Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 155.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

-------
• Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227., Bolt MJ,
Stossi F, Callison AM, Mancini MG, Dandekar R, Mancini MA. Systems level-based RNAi screening by high
content analysis identifies UBR5 as a regulator of estrogen receptor-? protein levels and activity. Oncogene.
2015 Jan 8;34(2):154-64. doi: 10.1038/onc.2013.550. Epub 2014 Jan 20. PMID: 24441042; PMCID:
PMC4871123., Stossi F, Bolt MJ, Ashcroft FJ, Lamerdin JE, Melnick JS, Powell RT, Dandekar RD, Mancini MG,
Walker CL, Westwick JK, Mancini MA. Defining estrogenic mechanisms of bisphenol A analogs through high
throughput microscopy-based contextual assays. Chem Biol. 2014 Jun 19;21(6):743-53. doi:
10.1016/j.chembiol.2014.03.013. Epub 2014 May 22. PMID: 24856822; PMCID: PMC4301571.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 746

OT_ER_ERbERb_0480

1.	General Information

1.1	Assay Title: Odyssey Thera 8-hour HEK293T Protein-Complementation Assay for Estrogen Receptor-beta/-beta
Homodimer

1.2	Assay Summary: OT_ER_ERbERb_0480 is a cell-based assay that uses HEK293T, a human kidney cell line, with
measurements taken at 8 hours after chemical dosing in a 384-well plate. OT_ER_ERbERb_0480 is one of one
assay component(s) measured or calculated from the OT_ER_ERbERb_0480 assay. It is designed to make
measurements of protein fragment complementation, a form of binding reporter, as detected with fluorescence
intensity signals by Protein-fragment Complementation technology. Data from the assay component
OT_ER_ERbERb_0480 was analyzed into 1 assay endpoint. This assay endpoint, OT_ER_ERbERb_0480, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of binding reporter, measures of receptor for gain-of-signal activity can be used to
understand the binding at the pathway-level as they relate to the gene ESR2. Furthermore, this assay endpoint
can be referred to as a primary readout, because the performed assay has only produced 1 assay endpoint. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. HEK-293T cells are seeded into 384-well microtiter plates and allowed to
adhere for 24 hours prior to 8 hour chemical exposures.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human estrogen receptor 2 (ER beta)
[GeneSymbol:ESR2 | GenelD:2100 | Uniprot_SwissProt_Accession:Q92731],

The Odyssey Thera ER-beta/-beta ligand binding domain assay utilized the ability of the ER-beta to
homodimerize upon ligand-binding with estrogenic compounds. This activity is monitored via Protein-Fragment
Complementation Assays (PCAs) which investigate the biochemical pathways capable of bringing separate
protein fragments into close proximity. Each ER-beta protein contains a rationally dissected fragment of a
yellow-fluorescent protein (YFP) reporter enzyme. When the estrogenic pathway is unimpeded, separate ER-
beta proteins form homodimers and the resulting YFP signal can be measured using fluorescence microscopy to

-------
screen a diverse chemical library for potential xenobiotic ligand-binding and ER-beta interactions. Changes in
protein complex interactions can be impacted by a variety of biochemical events within a pathway, and this
assay is designed to track xenobiotic changes at the level of cell functioning which may occur at a number of
points along the estrogen signaling pathway following a 24-hour incubation of cells with test compound in 384-
well plate, using DMSO as a negative control and baseline signal and 17-beta-estradiol as a positive control and
measure of 100 percent ligand-binding activity in ER-beta.

2.2	Scientific Principles: Endocrine disrupting compounds bind to nuclear hormone receptors, leading to a diverse
array of transcriptional and signaling pathway alterations. These cell- and tissue-type specific changes affect
many aspects of human physiology, including those involved in inflammation, neonatal development and
oncogenesis. Among the human NRs, the estrogen receptor family is particularly susceptible to perturbation by
EDCs because of the ER LBD proclivity to bind a disparate set of small hydrophobic molecules commonly found
in nature and in man-made materials (Shanle and Xu 2010). The estrogen receptor is expressed in two forms,
ER-alpha and ER-beta which play different roles in mediating the actions of estrogenic compounds. Cell-based
and in vivo experiments suggest that the ER-alpha isoform is the primary mediator of estrogenic effects by EDCs
on reproductive development and on cell proliferation, while ER-beta is thought to inhibit estrogen dependent
cell growth, especially in breast cancer cells (Helguero et al. 2005, Hewitt et al. 2005, Jurvanen et al. 2000).
Several studies have associated a loss of ER-beta or a decreased ratio of ER-beta/ER-alpha with other cancer
types, including ovarian and colorectal cancers (Zhao et al. 2008), suggesting a tumor suppressor role for ER-
beta in several cell types. Several isotype-selective ligands that bind one receptor with higher affinity than the
other have been identified (Kraichely et al. 2000, Sun et al. 1999) that produce distinct physiological effects
when tested in animal models (Zhao et al. 2008). Therefore, a highly sensitive assay that can detect ER-beta
transcriptional changes in the context of a whole cell would be a useful tool to differentiate EDCs that
preferentially activate ER-beta-specific pathways. The OT ER-beta/ER-beta LBD PCA utilizes the ability of the ER-
beta LBD to homodimerize upon binding to estrogenic compounds. This dimizeration and concurrent
conformational changes brings into close proximity the fused fragments of split-YFP, leading to a dramatic
increase in YFP intensity. This intensity change is dose-dependent and shows saturation-binding that plateaus
differently for agonists versus antagonists. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
estrogen receptor alpha mediated pathways and potentially affect endocrine systems in exposed populations.
There is strong evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse
Outcome Pathway (AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some
evidence that estrogen receptor activation is the MIE for putative AOPs leading to reduced survival due to renal
failure and leading to skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under
development). Chemical-activity profiles derived from this assay can inform prioritization decisions for
compound selection in more resource intensive in vivo studies to further investigate the involvement of ER
agonism in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. This assay monitors the ligand binding domain (LBD) of
the wild type human ER-beta (stably expressed in HEK293T cells) for xenoestrogenic activation. The HEK-293
cell line are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated
approximately 4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent
cytogenetic characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells
are derived from the HEK293 cell line by the addition of theSV40 large T antigen that has been shown to increase
vector production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies
when compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently
utilized cell lines for in small-scale protein production and in viral vector propagation using the transient
transfection method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

-------
2.5 Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured. The OT ER beta / ER beta assay is a
homodimer PCA of the ligand binding domain (LBD; amino acids 310-547) of human ER beta stably expressed in
HEK293T cells. Cells are seeded into optical quality 384-well poly-D-lysine coated plates in phenol red-free
medium supplemented with 10 percent dextran-treated FBS and allowed to adhere for 8 hours prior to
treatment with compounds of interest or controls for 8 hours. Cells are fixed in 4 percent formaldehyde and
stained with Draq5 (BioStatus) to identify cells prior to signal detection. Fluorescent signal in the basal state of
the assay is very low and is primarily punctuate and cytoplasmic. Modulation of this assay is quantified as an
increase in mean fluorescence intensity in the nucleus (for agonists) or in the nucleus and cytoplasm
(antagonists) relative to vehicle controls, and can be quantified on a high content imaging device or a laser
scanning cytometer. The latter instrument is favored for this assay because of the rapid mode of data
acquisition, the ability to acquire data from the entire cell population in a well (as opposed to a fraction of the
well on the high content devices) and the larger dynamic range typically achieved with this instrument. To assess
sensitivity of the ER beta/ER beta LBD assay to ligand, cells were treated with the ER agonist 17- beta-estradiol
(E2) in 10-point concentration- response format for 8 hours and monitored response on a laser scanning plate
cytometer (Acumen eX3; TTP Lab Tech).

Baseline median absolute deviation for the assay (bmad): 3.038

Response cutoff threshold used to determine hit calls: 20

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6	Response: Estrogen signaling pathway protein fragment dimerization and enzyme formation in response to
estrogen receptor beta ligand-binding is measured by changes in fluorescence intensity relative to DMSO
(neutral control) baseline.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

Additionally, this assay was annotated to the intended target family of nuclear receptor.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed into in the positive fitting direction (receptor gain-of-signal activity)
as a percent of 17b-Estradiol (positive control, 100 percent activation) and relative to DMSO, negative control
and signal baseline for activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag

-------
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053	Number of chemicals tested: 1857

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
327

Inactive hit count: 0
-------
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

1.113

Neutral control median absolute deviation, by plate: nmad

0.192

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

18.21%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

0.374

6.751

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

12.809

NA

-------
Positive control signal-to-noise: ((pmed-nmed)/nmad)

NA

Positive control signal-to-background: (pmed/nmed)

NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

NA

Negative control well median absolute deviation value, by plate: mmad

NA

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrtfmmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 119.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

-------
•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227., Bolt MJ,
Stossi F, Callison AM, Mancini MG, Dandekar R, Mancini MA. Systems level-based RNAi screening by high
content analysis identifies UBR5 as a regulator of estrogen receptor-? protein levels and activity. Oncogene.
2015 Jan 8;34(2):154-64. doi: 10.1038/onc.2013.550. Epub 2014 Jan 20. PMID: 24441042; PMCID:
PMC4871123., Stossi F, Bolt MJ, Ashcroft FJ, Lamerdin JE, Melnick JS, Powell RT, Dandekar RD, Mancini MG,
Walker CL, Westwick JK, Mancini MA. Defining estrogenic mechanisms of bisphenol A analogs through high
throughput microscopy-based contextual assays. Chem Biol. 2014 Jun 19;21(6):743-53. doi:
10.1016/j.chembiol.2014.03.013. Epub 2014 May 22. PMID: 24856822; PMCID: PMC4301571.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 747

OT_ER_ERbERb_1440

1.	General Information

1.1	Assay Title: Odyssey Thera 24-hour HEK293T Protein-Complementation Assay for Estrogen Receptor-beta/-beta
Homodimer

1.2	Assay Summary: OT_ER_ERbERb_1440 is a cell-based assay that uses HEK293T, a human kidney cell line, with
measurements taken at 24 hours after chemical dosing in a 384-well plate. OT_ER_ERbERb_1440 is one of one
assay component(s) measured or calculated from the OT_ER_ERbERb_1440 assay. It is designed to make
measurements of protein fragment complementation, a form of binding reporter, as detected with fluorescence
intensity signals by Protein-fragment Complementation technology. Data from the assay component
OT_ER_ERbERb_1440 was analyzed into 1 assay endpoint. This assay endpoint, OT_ER_ERbERb_1440, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of binding reporter, measures of receptor for gain-of-signal activity can be used to
understand the binding at the pathway-level as they relate to the gene ESR2. Furthermore, this assay endpoint
can be referred to as a primary readout, because the performed assay has only produced 1 assay endpoint. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. HEK-293T cells are seeded into 384-well microtiter plates and allowed to
adhere for 24 hours prior to 24 hour chemical exposures.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human estrogen receptor 2 (ER beta)
[GeneSymbol:ESR2 | GenelD:2100 | Uniprot_SwissProt_Accession:Q92731],

The Odyssey Thera ER-beta/-beta ligand binding domain assay utilized the ability of the ER-beta to
homodimerize upon ligand-binding with estrogenic compounds. This activity is monitored via Protein-Fragment
Complementation Assays (PCAs) which investigate the biochemical pathways capable of bringing separate
protein fragments into close proximity. Each ER-beta protein contains a rationally dissected fragment of a
yellow-fluorescent protein (YFP) reporter enzyme. When the estrogenic pathway is unimpeded, separate ER-
beta proteins form homodimers and the resulting YFP signal can be measured using fluorescence microscopy to

-------
screen a diverse chemical library for potential xenobiotic ligand-binding and ER-beta interactions. Changes in
protein complex interactions can be impacted by a variety of biochemical events within a pathway, and this
assay is designed to track xenobiotic changes at the level of cell functioning which may occur at a number of
points along the estrogen signaling pathway following a 24-hour incubation of cells with test compound in 384-
well plate, using DMSO as a negative control and baseline signal and 17-beta-estradiol as a positive control and
measure of 100 percent ligand-binding activity in ER-beta.

2.2	Scientific Principles: Endocrine disrupting compounds bind to nuclear hormone receptors, leading to a diverse
array of transcriptional and signaling pathway alterations. These cell- and tissue-type specific changes affect
many aspects of human physiology, including those involved in inflammation, neonatal development and
oncogenesis. Among the human NRs, the estrogen receptor family is particularly susceptible to perturbation by
EDCs because of the ER LBD proclivity to bind a disparate set of small hydrophobic molecules commonly found
in nature and in man-made materials (Shanle and Xu 2010). The estrogen receptor is expressed in two forms,
ER-alpha and ER-beta which play different roles in mediating the actions of estrogenic compounds. Cell-based
and in vivo experiments suggest that the ER-alpha isoform is the primary mediator of estrogenic effects by EDCs
on reproductive development and on cell proliferation, while ER-beta is thought to inhibit estrogen dependent
cell growth, especially in breast cancer cells (Helguero et al. 2005, Hewitt et al. 2005, Jurvanen et al. 2000).
Several studies have associated a loss of ER-beta or a decreased ratio of ER-beta/ER-alpha with other cancer
types, including ovarian and colorectal cancers (Zhao et al. 2008), suggesting a tumor suppressor role for ER-
beta in several cell types. Several isotype-selective ligands that bind one receptor with higher affinity than the
other have been identified (Kraichely et al. 2000, Sun et al. 1999) that produce distinct physiological effects
when tested in animal models (Zhao et al. 2008). Therefore, a highly sensitive assay that can detect ER-beta
transcriptional changes in the context of a whole cell would be a useful tool to differentiate EDCs that
preferentially activate ER-beta-specific pathways. The OT ER-beta/ER-beta LBD PCA utilizes the ability of the ER-
beta LBD to homodimerize upon binding to estrogenic compounds. This dimizeration and concurrent
conformational changes brings into close proximity the fused fragments of split-YFP, leading to a dramatic
increase in YFP intensity. This intensity change is dose-dependent and shows saturation-binding that plateaus
differently for agonists versus antagonists. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
estrogen receptor alpha mediated pathways and potentially affect endocrine systems in exposed populations.
There is strong evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse
Outcome Pathway (AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some
evidence that estrogen receptor activation is the MIE for putative AOPs leading to reduced survival due to renal
failure and leading to skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under
development). Chemical-activity profiles derived from this assay can inform prioritization decisions for
compound selection in more resource intensive in vivo studies to further investigate the involvement of ER
agonism in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. This assay monitors the ligand binding domain (LBD) of
the wild type human ER-beta (stably expressed in HEK293T cells) for xenoestrogenic activation. The HEK-293
cell line are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated
approximately 4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent
cytogenetic characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells
are derived from the HEK293 cell line by the addition of theSV40 large T antigen that has been shown to increase
vector production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies
when compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently
utilized cell lines for in small-scale protein production and in viral vector propagation using the transient
transfection method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

-------
2.5 Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured. The OT ER beta / ER beta assay is a
homodimer PCA of the ligand binding domain (LBD; amino acids 310-547) of human ER beta stably expressed in
HEK293T cells. Cells are seeded into optical quality 384-well poly-D-lysine coated plates in phenol red-free
medium supplemented with 10 percent dextran-treated FBS and allowed to adhere for 24 hours prior to
treatment with compounds of interest or controls for 24 hours. Cells are fixed in 4 percent formaldehyde and
stained with Draq5 (BioStatus) to identify cells prior to signal detection. Fluorescent signal in the basal state of
the assay is very low and is primarily punctuate and cytoplasmic. Modulation of this assay is quantified as an
increase in mean fluorescence intensity in the nucleus (for agonists) or in the nucleus and cytoplasm
(antagonists) relative to vehicle controls, and can be quantified on a high content imaging device or a laser
scanning cytometer. The latter instrument is favored for this assay because of the rapid mode of data
acquisition, the ability to acquire data from the entire cell population in a well (as opposed to a fraction of the
well on the high content devices) and the larger dynamic range typically achieved with this instrument. To assess
sensitivity of the ER beta/ER beta LBD assay to ligand, cells were treated with the ER agonist 17- beta-estradiol
(E2) in 10-point concentration- response format for 24 hours and monitored response on a laser scanning plate
cytometer (Acumen eX3; TTP Lab Tech).

Baseline median absolute deviation for the assay (bmad): 1.484

Response cutoff threshold used to determine hit calls: 20

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6	Response: Estrogen signaling pathway protein fragment dimerization and enzyme formation in response to
estrogen receptor beta ligand-binding is measured by changes in fluorescence intensity relative to DMSO
(neutral control) baseline.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

Additionally, this assay was annotated to the intended target family of nuclear receptor.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed into in the positive fitting direction (receptor gain-of-signal activity)
as a percent of 17b-Estradiol (positive control, 100 percent activation) and relative to DMSO, negative control
and signal baseline for activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag

-------
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053	Number of chemicals tested: 1857

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
346

Inactive hit count: 0
-------
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

1.251

Neutral control median absolute deviation, by plate: nmad

0.186

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

14.72%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

14.006

0.648

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

18.301

NA

-------
Positive control signal-to-noise: ((pmed-nmed)/nmad)

NA

Positive control signal-to-background: (pmed/nmed)

NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

NA

Negative control well median absolute deviation value, by plate: mmad

NA

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrtfmmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 158.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

-------
•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227., Bolt MJ,
Stossi F, Callison AM, Mancini MG, Dandekar R, Mancini MA. Systems level-based RNAi screening by high
content analysis identifies UBR5 as a regulator of estrogen receptor-? protein levels and activity. Oncogene.
2015 Jan 8;34(2):154-64. doi: 10.1038/onc.2013.550. Epub 2014 Jan 20. PMID: 24441042; PMCID:
PMC4871123., Stossi F, Bolt MJ, Ashcroft FJ, Lamerdin JE, Melnick JS, Powell RT, Dandekar RD, Mancini MG,
Walker CL, Westwick JK, Mancini MA. Defining estrogenic mechanisms of bisphenol A analogs through high
throughput microscopy-based contextual assays. Chem Biol. 2014 Jun 19;21(6):743-53. doi:
10.1016/j.chembiol.2014.03.013. Epub 2014 May 22. PMID: 24856822; PMCID: PMC4301571.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 750

OT_ERa_GFPERaERE_0120

1.	General Information

1.1	Assay Title: Odyssey Thera 2-hour HeLa Transcription Induction Assay for Estrogen Receptor

1.2	Assay Summary: OT_ERa_GFPERaERE_0120 is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 2 hours after chemical dosing in a 384-well plate.
OT_ERa_GFPERaERE_0120 is one of one assay component(s) measured or calculated from the
OT_ERa_GFPERaERE_0120 assay. It is designed to make measurements of fluorescent protein induction, a form
of inducible reporter, as detected with optical microscopy: fluorescence microscopy signals by Microscopy
technology. Data from the assay component OT_ERa_GFPERaERE_0120 was analyzed into 1 assay endpoint.
This assay endpoint, OT_ERa_EREGFP_0120, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter, measures of receptor
for gain-of-signal activity can be used to understand the reporter gene at the pathway-level as they relate to
the gene ESR1. Furthermore, this assay endpoint can be referred to as a primary readout, because the
performed assay has only produced 1 assay endpoint. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. ER:PRL-HeLa cell line was developed and provided to
Odyssey Thera, Inc. by the Mancini lab at Baylor University.

1.9	Assay Throughput: 384-well plate. HeLa cells are seeded into 384-well microtiter plates and allowed to adhere
for 24 hours prior to treatment with test compounds or controls for 2 hours.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to optical microscopy: fluorescence microscopy signals produced from the fluorescent
protein induction are indicative of a change in the receptor function and kinetics for the human estrogen
receptor 1 [GeneSymbol:ESRl | GenelD:2099 | Uniprot_SwissProt_Accession:P03372],

This Odyssey Thera estrogen receptor alpha green fluorescent protein (GFP) estrogen response element (ERE)
assay was developed to measure long-term transcriptional changes induced by ligand-binding as detected in a
cervical adenocarcinoma cell line stably expressing both full-length human ER-alpha and multiple estrogen
responsive prolactin promoter sequences. ER-alpha interacts with estrogenic ligands and following 2-hour
incubation of test compound with cells in a 384-well plate, xenoestrogenic activation of a microscopically visible
reporter gene is detected as an increase in mean signal relative to baseline activity (DMSO control) using a 10-

-------
point concentration-response assay format. The ER:PRL-HeLa line constitutively expresses physiologically-
relevant levels of fluorescently-tagged, full-length human ER-alpha, and contains multi-copy genomic insertions
of the prolactin promoter. When stimulated by agonists, tagged ER-alpha accumulates on the prolactin array in
an open (transcriptionally-active) binding mode, leading to a bright, micron-sized spot. In contrast, antagonist-
treated cells lead to tagged-ER-alpha binding in its closed (transcriptionally-repressive) binding-mode, leading
to a condensed array that appears as a sub-micron-sized point. A robust algorithm determines both array
occupancy and diffuse/condensed status, allowing for the simultaneous readout of ER-alpha-induced ERE
binding as well as binding mode (agonist versus antagonist).

2.2	Scientific Principles: Endocrine disrupting compounds bind to nuclear hormone receptors, leading to a diverse
array of transcriptional and signaling pathway alterations. These cell- and tissue-type specific changes affect
many aspects of human physiology, including those involved in inflammation, neonatal development and
oncogenesis. Among the human NRs, the estrogen receptor family is particularly susceptible to perturbation by
EDCs because of the ER LBD's proclivity to bind a disparate set of small hydrophobic molecules commonly found
in nature and in man-made materials (Shanle and Xu 2010). Cell-based and in vivo experiments suggest that the
ER-alpha isoform is the primary mediator of estrogenic effects by EDCs. This assay is intended for use as a part
of an integrated testing strategy, to screen a large structurally diverse chemical library for compounds with the
potential to interact with estrogen receptor mediated pathways and potentially affect endocrine systems in
exposed populations. There is strong evidence that estrogen receptor agonism is the MIE leading to
reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen receptor activation
is the MIE for putative adverse outcome pathways leading to reduced survival due to renal failure and leading
to skewed sex ratios due to altered sexual differentiation in males. Chemical-activity profiles derived from this
assay can inform prioritization decisions for compound selection in more resource intensive in vivo studies to
further investigate the involvement of ER agonism in pathways leading to hazardous outcomes in biological
systems.

2.3	Experimental System: adherent HeLa cell line used. GFP_ER-alpha:PRL-HeLa cells are cervical adenocarcinoma
epithelial cells, isolated from a 31 year old African-American female in February, 1951 (Jones et al. 1971), which
constitutively express fluorescently-tagged full length human ER-alpha and multiple integrated prolactin
promotor sequences. The stable ER:PRL-HeLa cell line was developed by the Mancini lab at Baylor College of
Medicine (Ashcroft et al. 2011).

2.4	Metabolic Competence: Constitutive expression of CYP1A1 and CYP1B1 mRNA; CYP1A2 expression was
examined but not detected in HeLa cells (Iwanari et al. 2002, Nakajima et al. 2003). Expression of tumor-
suppressing p53 and pRB proteins has been reported to be low (Scheffner et al. 1991).

2.5	Exposure Regime: The stable ER alpha: PRL-HeLa line is seeded into optical quality 384-well poly-D-lysine coated
plates in phenol red-free medium with 10 percent dextran-treated FBS and allowed to adhere for 24 hours prior
to treatment with test compounds or controls for 1 or 8 hours. Cells are fixed in 4 percent formaldehyde and
stained with Draq5 (BioStatus) to identify cells prior to signal detection. All assay endpoints are quantified using
high content image analysis algorithms.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

(Z)-4-Hydroxytamoxifen
Baseline median absolute deviation for the assay (bmad): 0.488
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Microscopy (Microscopy)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

-------
2.6

Response: Ligand binding of estrogen receptor alpha and xenoestrogenic effects on transcriptional regulation
is measured by is measured by changes in fluorescent reporter gene expression.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using percent activity
4-Hydroxytamoxifen as positive control (100 percent activity) over DMSO controls.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053	Number of chemicals tested: 1857

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
289

1759

5

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

72
147

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

840

114

quadratic-polynomialfpoly2) model: 309

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

4

72

337

158

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	8.75

Neutral control median absolute deviation, by plate: nmad	1.483

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	17.17%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	321.5

Positive control well median absolute deviation, by plate: pmad	16.494

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	18.683

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 158.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Bolt MJ, Stossi F, Callison AM, Mancini MG, Dandekar R, Mancini MA. Systems level-based RNAi
screening by high content analysis identifies UBR5 as a regulator of estrogen receptor-? protein levels and
activity. Oncogene. 2015 Jan 8;34(2):154-64. doi: 10.1038/onc.2013.550. Epub 2014 Jan 20. PMID: 24441042;
PMCID: PMC4871123., Stossi F, Bolt MJ, Ashcroft FJ, Lamerdin JE, Melnick JS, Powell RT, Dandekar RD, Mancini
MG, Walker CL, Westwick JK, Mancini MA. Defining estrogenic mechanisms of bisphenol A analogs through high
throughput microscopy-based contextual assays. Chem Biol. 2014 Jun 19;21(6):743-53. doi:
10.1016/j.chembiol.2014.03.013. Epub 2014 May 22. PMID: 24856822; PMCID: PMC4301571.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 751

OT_ERa_GFPERaERE_0480

1.	General Information

1.1	Assay Title: Odyssey Thera 8-hour HeLa Transcription Induction Assay for Estrogen Receptor

1.2	Assay Summary: OT_ERa_GFPERaERE_0480 is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 8 hours after chemical dosing in a 384-well plate.
OT_ERa_GFPERaERE_0480 is one of one assay component(s) measured or calculated from the
OT_ERa_GFPERaERE_0480 assay. It is designed to make measurements of fluorescent protein induction, a form
of inducible reporter, as detected with optical microscopy: fluorescence microscopy signals by Microscopy
technology. Data from the assay component OT_ERa_GFPERaERE_0480 was analyzed into 1 assay endpoint.
This assay endpoint, OT_ERa_EREGFP_0480, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter, measures of receptor
for gain-of-signal activity can be used to understand the reporter gene at the pathway-level as they relate to
the gene ESR1. Furthermore, this assay endpoint can be referred to as a primary readout, because the
performed assay has only produced 1 assay endpoint. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. ER:PRL-HeLa cell line was developed and provided to
Odyssey Thera, Inc. by the Mancini lab at Baylor University.

1.9	Assay Throughput: 384-well plate. HeLa cells are seeded into 384-well microtiter plates and allowed to adhere
for 24 hours prior to treatment with test compounds or controls for 8 hours.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to optical microscopy: fluorescence microscopy signals produced from the fluorescent
protein induction are indicative of a change in the receptor function and kinetics for the human estrogen
receptor 1 [GeneSymbol:ESRl | GenelD:2099 | Uniprot_SwissProt_Accession:P03372],

This Odyssey Thera estrogen receptor alpha green fluorescent protein (GFP) estrogen response element (ERE)
assay was developed to measure long-term transcriptional changes induced by ligand-binding as detected in a
cervical adenocarcinoma cell line stably expressing both full-length human ER-alpha and multiple estrogen
responsive prolactin promoter sequences. ER-alpha interacts with estrogenic ligands and following 2-hour
incubation of test compound with cells in a 384-well plate, xenoestrogenic activation of a microscopically visible
reporter gene is detected as an increase in mean signal relative to baseline activity (DMSO control) using a 10-

-------
point concentration-response assay format. The ER:PRL-HeLa line constitutively expresses physiologically-
relevant levels of fluorescently-tagged, full-length human ER-alpha, and contains multi-copy genomic insertions
of the prolactin promoter. When stimulated by agonists, tagged ER-alpha accumulates on the prolactin array in
an open (transcriptionally-active) binding mode, leading to a bright, micron-sized spot. In contrast, antagonist-
treated cells lead to tagged-ER-alpha binding in its closed (transcriptionally-repressive) binding-mode, leading
to a condensed array that appears as a sub-micron-sized point. A robust algorithm determines both array
occupancy and diffuse/condensed status, allowing for the simultaneous readout of ER-alpha-induced ERE
binding as well as binding mode (agonist versus antagonist).

2.2	Scientific Principles: Endocrine disrupting compounds bind to nuclear hormone receptors, leading to a diverse
array of transcriptional and signaling pathway alterations. These cell- and tissue-type specific changes affect
many aspects of human physiology, including those involved in inflammation, neonatal development and
oncogenesis. Among the human NRs, the estrogen receptor family is particularly susceptible to perturbation by
EDCs because of the ER LBD's proclivity to bind a disparate set of small hydrophobic molecules commonly found
in nature and in man-made materials (Shanle and Xu 2010). Cell-based and in vivo experiments suggest that the
ER-alpha isoform is the primary mediator of estrogenic effects by EDCs. This assay is intended for use as a part
of an integrated testing strategy, to screen a large structurally diverse chemical library for compounds with the
potential to interact with estrogen receptor mediated pathways and potentially affect endocrine systems in
exposed populations. There is strong evidence that estrogen receptor agonism is the MIE leading to
reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen receptor activation
is the MIE for putative adverse outcome pathways leading to reduced survival due to renal failure and leading
to skewed sex ratios due to altered sexual differentiation in males. Chemical-activity profiles derived from this
assay can inform prioritization decisions for compound selection in more resource intensive in vivo studies to
further investigate the involvement of ER agonism in pathways leading to hazardous outcomes in biological
systems.

2.3	Experimental System: adherent HeLa cell line used. GFP_ER-alpha:PRL-HeLa cells are cervical adenocarcinoma
epithelial cells, isolated from a 31 year old African-American female in February, 1951 (Jones et al. 1971), which
constitutively express fluorescently-tagged full length human ER-alpha and multiple integrated prolactin
promotor sequences.

2.4	Metabolic Competence: Constitutive expression of CYP1A1 and CYP1B1 mRNA; CYP1A2 expression was
examined but not detected in HeLa cells (Iwanari et al. 2002, Nakajima et al. 2003). Expression of tumor-
suppressing p53 and pRB proteins has been reported to be low (Scheffner et al. 1991).

2.5	Exposure Regime: The stable ER alpha: PRL-HeLa line is seeded into optical quality 384-well poly-D-lysine coated
plates in phenol red-free medium with 10 percent dextran-treated FBS and allowed to adhere for 24 hours prior
to treatment with test compounds or controls for 8 hours. Cells are fixed in 4 percent formaldehyde and stained
with Draq5 (BioStatus) to identify cells prior to signal detection. All assay endpoints are quantified using high
content image analysis algorithms.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

(Z)-4-Hydroxytamoxifen

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.227
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Microscopy (Microscopy)

-------
2.6	Response: Ligand binding of estrogen receptor alpha and xenoestrogenic effects on transcriptional regulation
is measured by is measured by changes in fluorescent reporter gene expression.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using percent activity
4-Hydroxytamoxifen as positive control (100 percent activity) over DMSO controls.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053	Number of chemicals tested: 1857

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
293

1760

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

75
126

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

938

125

quadratic-polynomialfpoly2) model: 252

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

94

2

297

144

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	6.5

Neutral control median absolute deviation, by plate: nmad	0.741

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	13.19%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	351.25

Positive control well median absolute deviation, by plate: pmad	16.679

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	20.297

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 144.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Bolt MJ, Stossi F, Callison AM, Mancini MG, Dandekar R, Mancini MA. Systems level-based RNAi
screening by high content analysis identifies UBR5 as a regulator of estrogen receptor-? protein levels and
activity. Oncogene. 2015 Jan 8;34(2):154-64. doi: 10.1038/onc.2013.550. Epub 2014 Jan 20. PMID: 24441042;
PMCID: PMC4871123., Stossi F, Bolt MJ, Ashcroft FJ, Lamerdin JE, Melnick JS, Powell RT, Dandekar RD, Mancini
MG, Walker CL, Westwick JK, Mancini MA. Defining estrogenic mechanisms of bisphenol A analogs through high
throughput microscopy-based contextual assays. Chem Biol. 2014 Jun 19;21(6):743-53. doi:
10.1016/j.chembiol.2014.03.013. Epub 2014 May 22. PMID: 24856822; PMCID: PMC4301571.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 753

OT_FXR_FXRSRC1_0480

1.	General Information

1.1	Assay Title: Odyssey Thera 8-hour HEK293T Protein-Complementation Assay for Farnesoid X receptor/SRC-1 Co-
activator

1.2	Assay Summary: OT_FXR_FXRSRC1_0480 is a cell-based assay that uses HEK293T, a human kidney cell line, with
measurements taken at 8 hours after chemical dosing in a 384-well plate. OT_FXR_FXRSRC1_0480 is one of one
assay component(s) measured or calculated from the OT_FXR_FXRSRC1_0480 assay. It is designed to make
measurements of protein fragment complementation, a form of binding reporter, as detected with fluorescence
intensity signals by Protein-fragment Complementation technology. Data from the assay component
OT_FXR_FXRSRC1_0480 was analyzed into 1 assay endpoint. This assay endpoint, OT_FXR_FXRSRC1_0480, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of binding reporter, measures of receptor for gain-of-signal activity can be used to
understand the binding at the pathway-level as they relate to the gene NR1H4. Furthermore, this assay
endpoint can be referred to as a primary readout, because the performed assay has only produced 1 assay
endpoint. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
nuclear receptor intended target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. HEK-293T cells are seeded into 384-well microtiter plates and allowed to
adhere for 24 hours prior to 24 hour chemical exposures.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human SRC proto-oncogene, non-receptor
tyrosine kinase and nuclear receptor subfamily 1, group H, member 4 [GeneSymbol:SRC& NR1H4 | GenelD:6714
& 9971 | Uniprot_SwissProt_Accession:P12931 & Q96RI1],

Odyssey Thera (OT) developed high-throughput chemical screening assays which utilize diverse technologies to
screen protein expression and interactions in stably transfected cell-lines. The specialized Protein
Complementation Assays (PCAs) monitor key nodes in biochemical pathways.

2.2	Scientific Principles: Interaction of the tagged key proteins of the node brings separate protein fragments into
close proximity and reconstitutes a functional reporter (fluorescent protein) producing a fluorescent signal

-------
when the pathway is unimpeded. Changes in protein complex interactions can be impacted by a variety of
biochemical events within a pathway, and these assays are designed to track xenobiotic induced changes at the
level of cell functioning which may occur at multiple points along signaling pathways following incubation with
test compound in 384-well plates.

2.3	Experimental System: adherent HEK293T cell line used. This assay utilizes HEK293T cells. The HEK-293 cell line
are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells are derived
from the HEK293 cell line by the addition of the SV40 large T antigen that has been shown to increase vector
production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies when
compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently utilized
cell lines for in small-scale protein production and in viral vector propagation using the transient transfection
method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

2.5	Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured.

Baseline median absolute deviation for the assay (bmad): 5.013

Response cutoff threshold used to determine hit calls: 25.064

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6 Response: Farnesoid X receptor signaling pathway protein formation in response to FXR ligand-binding /SRC-1
co-factor recruitment is measured by changes in fluorescence intensity relative to DMSO (neutral control)

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

GW4064

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

baseline.

-------
2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using percent activity
GW4064 as positive control (100 percent activity) over DMSO controls.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053	Number of chemicals tested: 1857

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
405

Inactive hit count: 0
-------
exponentials (exp5) model:

189

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

36.423

Neutral control median absolute deviation, by plate: nmad

3.871

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

10.14%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

104.769

8.521

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	7.318

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 189.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 754

OT_FXR_FXRSRC1_1440

1.	General Information

1.1	Assay Title: Odyssey Thera 24-hour HEK293T Protein-Complementation Assay for Farnesoid X receptor/SRC-1
Co-activator

1.2	Assay Summary: OT_FXR_FXRSRC1_1440 is a cell-based assay that uses HEK293T, a human kidney cell line, with
measurements taken at 24 hours after chemical dosing in a 384-well plate. OT_FXR_FXRSRC1_1440 is one of
one assay component(s) measured or calculated from the OT_FXR_FXRSRC1_1440 assay. It is designed to make
measurements of protein fragment complementation, a form of binding reporter, as detected with fluorescence
intensity signals by Protein-fragment Complementation technology. Data from the assay component
OT_FXR_FXRSRC1_1440 was analyzed into 1 assay endpoint. This assay endpoint, OT_FXR_FXRSRC1_1440, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of binding reporter, measures of receptor for gain-of-signal activity can be used to
understand the binding at the pathway-level as they relate to the gene NR1H4. Furthermore, this assay
endpoint can be referred to as a primary readout, because the performed assay has only produced 1 assay
endpoint. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
nuclear receptor intended target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. HEK-293T cells are seeded into 384-well microtiter plates and allowed to
adhere for 24 hours prior to 8 hour chemical exposures.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human SRC proto-oncogene, non-receptor
tyrosine kinase and nuclear receptor subfamily 1, group H, member 4 [GeneSymbol:SRC& NR1H4 | GenelD:6714
& 9971 | Uniprot_SwissProt_Accession:P12931 & Q96RI1],

Odyssey Thera (OT) developed high-throughput chemical screening assays which utilize diverse technologies to
screen protein expression and interactions in stably transfected cell-lines. The specialized Protein
Complementation Assays (PCAs) monitor key nodes in biochemical pathways.

2.2	Scientific Principles: Interaction of the tagged key proteins of the node brings separate protein fragments into
close proximity and reconstitutes a functional reporter (fluorescent protein) producing a fluorescent signal

-------
when the pathway is unimpeded. Changes in protein complex interactions can be impacted by a variety of
biochemical events within a pathway, and these assays are designed to track xenobiotic induced changes at the
level of cell functioning which may occur at multiple points along signaling pathways following incubation with
test compound in 384-well plates.

2.3	Experimental System: adherent HEK293T cell line used. This assay utilizes HEK293T cells. The HEK-293 cell line
are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells are derived
from the HEK293 cell line by the addition of the SV40 large T antigen that has been shown to increase vector
production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies when
compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently utilized
cell lines for in small-scale protein production and in viral vector propagation using the transient transfection
method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

2.5	Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured.

Baseline median absolute deviation for the assay (bmad): 8.324

Response cutoff threshold used to determine hit calls: 41.622

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6 Response: Farnesoid X receptor signaling pathway protein formation in response to FXR ligand-binding /SRC-1
co-factor recruitment is measured by changes in fluorescence intensity relative to DMSO (neutral control)

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

GW4064

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

baseline.

-------
2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using percent activity
GW4064 as positive control (100 percent activity) over DMSO controls.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 2053	Number of chemicals tested: 1857

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
356

Inactive hit count: 0
-------
exponentials (exp5) model:

271

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

72.868

Neutral control median absolute deviation, by plate: nmad

8.851

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

12.62%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

179.36

12.127

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	6.04

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 271.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 755

OT_N U RR1_N U RRlRXRa_0480

1.	General Information

1.1	Assay Title: Odyssey Thera 8-hour HEK293T Protein Complementation Assay for Nurrl/retinoid X receptor (RXR)

1.2	Assay Summary: OT_NURRl_NURRlRXRa_0480 is a cell-based assay that uses HEK293T, a human kidney cell
line, with measurements taken at 8 hours after chemical dosing in a 384-well plate.
OT_NURRl_NURRlRXRa_0480 is one of one assay component(s) measured or calculated from the
OT_NURRl_NURRlRXRa_0480 assay. It is designed to make measurements of protein fragment
complementation, a form of binding reporter, as detected with fluorescence intensity signals by Protein-
fragment Complementation technology. Data from the assay component OT_NURRl_NURRlRXRa_0480 was
analyzed into 1 assay endpoint. This assay endpoint, OT_NURRl_NURRlRXRa_0480, was analyzed in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, measures of receptor for gain-of-signal activity can be used to understand the binding at
the pathway-level as they relate to the gene RXRA. Furthermore, this assay endpoint can be referred to as a
primary readout, because the performed assay has only produced 1 assay endpoint. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. HEK-293T cells are seeded into 384-well microtiter plates and allowed to
adhere for 24 hours prior to 24 hour chemical exposures.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human nuclear receptor subfamily 4,
group A, member 2 and retinoid X receptor, alpha [GeneSymbol:NR4A2 & RXRA | GenelD:4929 & 6256 |
Uniprot_SwissProt_Accession:P43354 & P19793],

Odyssey Thera (OT) developed high-throughput chemical screening assays which utilize diverse technologies to
screen protein expression and interactions in stably transfected cell-lines. The specialized Protein
Complementation Assays (PCAs) monitor key nodes in biochemical pathways.

2.2 Scientific Principles: Interaction of the tagged key proteins of the node brings separate protein fragments into
close proximity and reconstitutes a functional reporter (fluorescent protein) producing a fluorescent signal

-------
when the pathway is unimpeded. Changes in protein complex interactions can be impacted by a variety of
biochemical events within a pathway, and these assays are designed to track xenobiotic induced changes at the
level of cell functioning which may occur at multiple points along signaling pathways following incubation with
test compound in 384-well plates.

2.3	Experimental System: adherent HEK293T cell line used. This assay utilizes HEK293T cells. The HEK-293 cell line
are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells are derived
from the HEK293 cell line by the addition of the SV40 large T antigen that has been shown to increase vector
production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies when
compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently utilized
cell lines for in small-scale protein production and in viral vector propagation using the transient transfection
method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

2.5	Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured.

Baseline median absolute deviation for the assay (bmad): 4.039

Response cutoff threshold used to determine hit calls: 20.197

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6	Response: NURR1 signaling pathway protein fragment dimerization and enzyme formation in response to
retinoid X receptor (RXR) ligand binding is measured by changes in fluorescence intensity relative to DMSO
(neutral control) baseline.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

Isoproterenol; Fenoterol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

-------
2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using percent activity
GW4064 as positive control (100 percent activity) over DMSO controls.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 3064	Number of chemicals tested: 1860

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
261

Inactive hit count: 0
-------
exponentials (exp5) model:

196

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

59.415

Neutral control median absolute deviation, by plate: nmad

5.526

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

9.83%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

169.661

12.815

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	8.166

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 196.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 756

OT_N U RR1_N U RRlRXRa_1440

1.	General Information

1.1	Assay Title: Odyssey Thera 24-hour HEK293T Protein Complementation Assay for Nurrl/retinoid X receptor
(RXR)

1.2	Assay Summary: OT_NURRl_NURRlRXRa_1440 is a cell-based assay that uses HEK293T, a human kidney cell
line, with measurements taken at 24 hours after chemical dosing in a 384-well plate.
OT_NURRl_NURRlRXRa_1440 is one of one assay component(s) measured or calculated from the
OT_NURRl_NURRlRXRa_1440 assay. It is designed to make measurements of protein fragment
complementation, a form of binding reporter, as detected with fluorescence intensity signals by Protein-
fragment Complementation technology. Data from the assay component OT_NURRl_NURRlRXRa_1440 was
analyzed into 1 assay endpoint. This assay endpoint, OT_NURRl_NURRlRXRa_1440, was analyzed in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of binding reporter, measures of receptor for gain-of-signal activity can be used to understand the binding at
the pathway-level as they relate to the gene RXRA. Furthermore, this assay endpoint can be referred to as a
primary readout, because the performed assay has only produced 1 assay endpoint. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family,
where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. HEK-293T cells are seeded into 384-well microtiter plates and allowed to
adhere for 24 hours prior to 8 hour chemical exposures.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human nuclear receptor subfamily 4,
group A, member 2 and retinoid X receptor, alpha [GeneSymbol:NR4A2 & RXRA | GenelD:4929 & 6256 |
Uniprot_SwissProt_Accession:P43354 & P19793],

Odyssey Thera (OT) developed high-throughput chemical screening assays which utilize diverse technologies to
screen protein expression and interactions in stably transfected cell-lines. The specialized Protein
Complementation Assays (PCAs) monitor key nodes in biochemical pathways.

-------
2.2	Scientific Principles: Interaction of the tagged key proteins of the node brings separate protein fragments into
close proximity and reconstitutes a functional reporter (fluorescent protein) producing a fluorescent signal
when the pathway is unimpeded. Changes in protein complex interactions can be impacted by a variety of
biochemical events within a pathway, and these assays are designed to track xenobiotic induced changes at the
level of cell functioning which may occur at multiple points along signaling pathways following incubation with
test compound in 384-well plates.

2.3	Experimental System: adherent HEK293T cell line used. This assay utilizes HEK293T cells. The HEK-293 cell line
are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells are derived
from the HEK293 cell line by the addition of the SV40 large T antigen that has been shown to increase vector
production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies when
compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently utilized
cell lines for in small-scale protein production and in viral vector propagation using the transient transfection
method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

2.5	Exposure Regime: Protein-fragment complementation assay, or PCA, is a method for the identification and
quantification of protein-protein interactions. In the PCA, the proteins of interest ("bait" and "prey") are each
covalently linked to fragments of a third protein (e.g. DHFR, which acts as a "reporter"). Interaction between
the bait and the prey proteins brings the fragments of the reporter protein in close proximity to allow them to
form a functional reporter protein whose activity can be measured.

Baseline median absolute deviation for the assay (bmad): 5.157

Response cutoff threshold used to determine hit calls: 25.783

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6	Response: NURR1 signaling pathway protein fragment dimerization and enzyme formation in response to
retinoid X receptor (RXR) ligand binding is measured by changes in fluorescence intensity relative to DMSO
(neutral control) baseline.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

Isoproterenol; Fenoterol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

-------
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Readout data was analyzed in the positive (gain of signal) fitting direction using percent activity
lsoproterenol;Fenoterol as positive control (100 percent activity) over DMSO controls.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall

-------
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 3064	Number of chemicals tested: 1860

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
255

Inactive hit count: Oihitc 0.9
1661

WINING MODEL SELECTION

NA hit count: hitc^O
114S

Number of sample-assay endpoints with winning hill model:

66
121

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

248

1002

quadratic-polynomialfpoly2) model: 650

exponential-2 (exp2) model:

84

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

215

668

10

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

58.582

Neutral control median absolute deviation, by plate: nmad

5.539

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

10.17%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

155.971
11.744

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	7.022

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 215.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 757

OT_PPARg_PPARgSRCl_0480

1.	General Information

1.1	Assay Title: Odyssey Thera 8-hour Protein-Complementation Assay for Peroxisome Proliferator-activated
Receptor Gamma / SRC-1 Co-activator

1.2	Assay Summary: OT PPARg PPARgSRCl 0480 is a cell-based assay that uses HEK293T, a human kidney cell line,
with measurements taken at 8 hours after chemical dosing in a 384-well plate. OT_PPARg_PPARgSRCl_0480 is
one of one assay component(s) measured or calculated from the OT_PPARg_PPARgSRCl_0480 assay. It is
designed to make measurements of protein fragment complementation, a form of binding reporter, as detected
with fluorescence intensity signals by Protein-fragment Complementation technology. Data from the assay
component OT_PPARg_PPARgSRCl_0480 was analyzed into 1 assay endpoint. This assay endpoint,
OT_PPARg_PPARgSRCl_0480, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, measures of receptor for gain-of-
signal activity can be used to understand the binding at the pathway-level as they relate to the gene PPARG.
Furthermore, this assay endpoint can be referred to as a primary readout, because the performed assay has
only produced 1 assay endpoint. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the nuclear receptor intended target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. Stably transfected HEK293T cells are aliquoted into 384-well microtiter plates
and incubated with test compounds for 8 hours prior to monitoring fluorescence emission resulting from
xenobiotic PPAR gamma activation and co-factor recruitment.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human peroxisome proliferator-activated
receptor gamma and SRC proto-oncogene, non-receptor tyrosine kinase [GeneSymbol:PPARG & SRC |
GenelD:5468 & 6714 | Uniprot_SwissProt_Accession:P37231 & P12931],

The OT PPAR-gamma/SRC-1 assay is a PCA expressed transiently in HEK293T cells for 48h prior to treatment
with test compound for 8 hours. Cells are fixed with 4 percent formaldehyde and stained with Draq5 to identify
cells and subcellular compartment boundaries prior to signal detection. Fluorescent signal in the basal state of
the assay is very low and is restricted to the nucleus. Modulation of this assay is quantified as an increase in

-------
mean fluorescence in the nucleus relative to vehicle controls, and can be quantified on a high content imaging
device or a laser scanning cytometer. Stimulation of the assay was assessed in a 10-point dose response with
the selective PPAR-gamma agonist GW1929. Each PPAR-gamma protein and its associated coactivator (SRC-1)
contain a rationally dissected fragment of a yellow-fluorescent protein (YFP) reporter enzyme. When the PPAR-
gamma responsive signaling pathway is impacted by chemical activation or interference, the resulting YFP signal
production can be measured using fluorescence microscopy to screen a diverse chemical library for potential
xenobiotic PPAR-gamma ligand-binding. Changes in protein complex interactions can be impacted by a variety
of biochemical events within a pathway, and this assay is designed to track changes at the level of cell
functioning which may occur at a number of points along the signaling pathway following an 8-hour incubation
of cells with test compound in 384-well plate, using DMSO as a negative control and baseline signal and GW1929
as a positive control and measure of 100 percent PPAR-gamma/SRCl activation.

2.2	Scientific Principles: The peroxisome proliferator-activated receptor gamma (PPAR-gamma) is a ligand-
activated nuclear receptor that plays a key role in mediating differentiation of adipocytes and regulating fat
metabolism. PPAR-gamma has been implicated in the pathophysiology of atherosclerosis, inflammation,
obesity, diabetes, immune response, and ageing. The PPAR-gamma nuclear receptor functions as a transcription
factor as part of a large protein complex through interactions with transcriptional co-repressors and co-
activators. PPARs form heterodimers with the retinoid X receptor (RXR) and bind to PPAR response elements
(PPREs) in enhancer sites of regulated genes. In the absence of ligand, nuclear receptor co-repressors bind to
these heterodimers and recruit histone deactylases (HDACs) to repress transcription. Ligand binding to the C-
terminal activation function (AF-2) domain induces a conformational change in the receptor dimer which
excludes co-repressors from the complex. Ligand binding also increases PPAR's affinity for a number of co-
activators, including SRC-1, whose binding facilitates chromatin remodeling by histone modification and
nucleosome mobilization, leading to the recruitment of the basal transcription machinery to PPAR target genes.
Therefore, measuring stimulation of the PPAR-gamma/SRC-1 complex represents ligand-dependent activation
of the receptor. This assay is intended for use as a part of an integrated testing strategy, to screen a large
structurally diverse chemical library for compounds with the potential to interact with peroxisome proliferator-
activated receptor alpha (PPARg) receptor mediated pathways and potentially affect endocrine systems in
exposed populations. There is some evidence to support a putative AOP linking PPAR gamma receptor activation
with increased occurrence of sarcomas in rats, mice, and hamsters (AOP currently under development).
Chemical-activity profiles derived from this assay can inform prioritization decisions for compound selection in
more resource intensive in vivo studies to further investigate the involvement of PPAR activation in pathways
leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. This assay monitors the ligand binding domain (LBD) of
the human PPAR-gamma protein (transiently expressed in HEK293T) for xenobiotic activation. The HEK-293 cell
line are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared adenovirus
5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells are derived
from the HEK293 cell line by the addition of the SV40 large T antigen that has been shown to increase vector
production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies when
compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently utilized
cell lines for in small-scale protein production and in viral vector propagation using the transient transfection
method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

2.5	Exposure Regime: The OT PPARg/SRC-1 assay assessed receptor-chemical interactions using a rapidly maturing,
intensely fluorescent mutant of YFP known as Venus, rationally dissected into two separate fragments. The
fragments were obtained as follows: first, fragments coding for YFP1 and YFP2 (corresponding to amino acid

-------
residues 1-158 and 159-239 of the full length YFP, respectively) were generated by oligonucleotide synthesis
(Blue Heron Biotechnology), and then PCR mutagenesis was used to generate the mutant fragments IFP1 and
IFP2. Fusion constructs were transfected into HEK293T cells with a (Gly4Ser)2 linker between the PPARg/SRC-1
and YFP fragment genes to facilitate complementation when interacting proteins bring fragments into close
proximity. The construct is transiently transfected in HEK293T cells 48 hours prior to treatment with test
compounds. Cells are seeded into optical quality 384-well poly-D-lysine coated plates in MEM alpha medium
(Invitrogen) supplemented with 10 percent FBS (Gemini Bio-Products) 1 percent penicillin and 1 percent
streptomycin, and grown in 37C incubator equilibrated to 5 percent C02. Cells are allowed to adhere for 24
hours prior to treatment with compounds of interest or controls for 8 hours. Cells are fixed in 4 percent
formaldehyde and stained with Draq5 (BioStatus) to identify cells and subcellular compartment boundaries prior
to signal detection. 8 Images per well were acquired on an Evotec Opera at 2 wavelengths (488 and 635nm),
and the ratio of fluorescence in the nucleus relative to fluorescence in the cytoplasm (N/C Ratio) in the 488nm
channel was calculated for a minimum of 350 cells per image.

Baseline median absolute deviation for the assay (bmad): 1.192

Response cutoff threshold used to determine hit calls: 20

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6	Response: Peroxisome proliferator-activated receptor gamma signaling pathway protein formation in response
to PPAR-gamma ligand-binding /SRC-1 co-factor recruitment is measured by changes in fluorescence intensity
relative to DMSO (neutral control) baseline.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

GW1929

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

NA

-------
3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Each data point was formed by taking the log of the ratio of the sample signal to the vehicle
control (DMSO) signal. A minimum of 8 replicate wells were analyzed each for sample and vehicle controls. Wells
located in the outer ring of the plate were omitted due to the potential for edge effects. Data were captured on
a confocal microscope: the fold increase in mean fluorescence intensity was calculated relative to the vehicle
controls from 16 images, each containing a minimum of 350 cells. Gain-of-signal activity indicates PPAR-
gamma/SRC-1 complex formation and data are plotted as percent of GW1929 activity, and are plotted relative
to DMSO, negative control and signal baseline.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active

-------
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 3064	Number of chemicals tested: 1860

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

34	2898	132

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	87

gain-loss (gnls) model:	119

power(pow) model:	349

linear-polynomial (polyl) model:	1052

quadratic-polynomial(poly2) model:	448

exponential-2 (exp2) model:	28

exponential-3 (exp3) model:	7

exponential-4 (exp4) model:	778

exponential-5 (exp5) model:	196

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	11.16

Neutral control median absolute deviation, by plate: nmad	1.825

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	17.93%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	107.941

Positive control well median absolute deviation, by plate: pmad	7.784

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	11.105

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 196.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 758

OT_PPARg_PPARgSRCl_1440

1.	General Information

1.1	Assay Title: Odyssey Thera 24-hour Protein-Complementation Assay for Peroxisome Proliferator-activated
Receptor Gamma/ SRC-1 Co-activator

1.2	Assay Summary: OT PPARg PPARgSRCl 1440 is a cell-based assay that uses HEK293T, a human kidney cell line,
with measurements taken at 24 hours after chemical dosing in a 384-well plate. OT_PPARg_PPARgSRCl_1440
is one of one assay component(s) measured or calculated from the OT_PPARg_PPARgSRCl_1440 assay. It is
designed to make measurements of protein fragment complementation, a form of binding reporter, as detected
with fluorescence intensity signals by Protein-fragment Complementation technology. Data from the assay
component OT_PPARg_PPARgSRCl_1440 was analyzed into 1 assay endpoint. This assay endpoint,
OT_PPARg_PPARgSRCl_1440, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of binding reporter, measures of receptor for gain-of-
signal activity can be used to understand the binding at the pathway-level as they relate to the gene PPARG.
Furthermore, this assay endpoint can be referred to as a primary readout, because the performed assay has
only produced 1 assay endpoint. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the nuclear receptor intended target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Odyssey Thera is a drug discovery company focusing on high-throughput screening with a
particular focus on developing novel protein:protein interaction assays using protein-fragment
complementation technology.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The protein-fragment complementation assay (PCA) methods are trademarked and
patented by Odyssey Thera, Inc.

1.9	Assay Throughput: 384-well plate. Stably transfected HEK293T cells are aliquoted into 384-well microtiter plates
and incubated with test compounds for 24 hours prior to monitoring fluorescence emission resulting from
xenobiotic PPAR gamma activation and co-factor recruitment.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to fluorescence intensity signals produced from the protein fragment complementation
are indicative of a change in the receptor function and kinetics for the human peroxisome proliferator-activated
receptor gamma and SRC proto-oncogene, non-receptor tyrosine kinase [GeneSymbol:PPARG & SRC |
GenelD:5468 & 6714 | Uniprot_SwissProt_Accession:P37231 & P12931],

The OT PPAR-gamma/SRC-1 assay is a PCA expressed transiently in HEK293T cells for 48h prior to treatment
with test compounds for 24 hours. Cells are fixed with 4 percent formaldehyde and stained with Draq5 to
identify cells and subcellular compartment boundaries prior to signal detection. Fluorescent signal in the basal
state of the assay is very low and is restricted to the nucleus. Modulation of this assay is quantified as an increase

-------
in mean fluorescence in the nucleus relative to vehicle controls, and can be quantified on a high content imaging
device or a laser scanning cytometer. Stimulation of the assay was assessed in a 10-point dose response with
the selective PPAR-gamma agonist GW1929. Each PPAR-gamma protein and its associated coactivator (SRC-1)
contain a rationally dissected fragment of a yellow-fluorescent protein (YFP) reporter enzyme. When the PPAR-
gamma responsive signaling pathway is impacted by chemical activation or interference, the resulting YFP signal
production can be measured using fluorescence microscopy to screen a diverse chemical library for potential
xenobiotic PPAR-gamma ligand-binding. Changes in protein complex interactions can be impacted by a variety
of biochemical events within a pathway, and this assay is designed to track changes at the level of cell
functioning which may occur at a number of points along the signaling pathway following a 24-hour incubation
of cells with test chemicals in 384-well plate, using DMSO as a negative control and baseline signal and GW1929
as a positive control and measure of 100 percent PPAR-gamma/SRCl activation.

2.2	Scientific Principles: The peroxisome proliferator-activated receptor gamma (PPAR-gamma) is a ligand-
activated nuclear receptor that plays a key role in mediating differentiation of adipocytes and regulating fat
metabolism. PPAR-gamma has been implicated in the pathophysiology of atherosclerosis, inflammation,
obesity, diabetes, immune response, and ageing. The PPAR-gamma nuclear receptor functions as a transcription
factor as part of a large protein complex through interactions with transcriptional co-repressors and co-
activators. PPARs form heterodimers with the retinoid X receptor (RXR) and bind to PPAR response elements
(PPREs) in enhancer sites of regulated genes. In the absence of ligand, nuclear receptor co-repressors bind to
these heterodimers and recruit histone deactylases (HDACs) to repress transcription. Ligand binding to the C-
terminal activation function (AF-2) domain induces a conformational change in the receptor dimer which
excludes co-repressors from the complex. Ligand binding also increases PPAR's affinity for a number of co-
activators, including SRC-1, whose binding facilitates chromatin remodeling by histone modification and
nucleosome mobilization, leading to the recruitment of the basal transcription machinery to PPAR target genes.
Therefore, measuring stimulation of the PPAR-gamma/SRC-1 complex represents ligand-dependent activation
of the receptor. This assay is intended for use as a part of an integrated testing strategy, to screen a large
structurally diverse chemical library for compounds with the potential to interact with peroxisome proliferator-
activated receptor alpha (PPARg) receptor mediated pathways and potentially affect endocrine systems in
exposed populations. There is some evidence to support a putative AOP linking PPAR gamma receptor activation
with increased occurrence of sarcomas in rats, mice, and hamsters (AOP currently under development).
Chemical-activity profiles derived from this assay can inform prioritization decisions for compound selection in
more resource intensive in vivo studies to further investigate the involvement of PPAR activation in pathways
leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. This assay monitors the ligand binding domain (LBD) of
the human PPAR-gamma protein (transiently expressed in HEK293T) for xenobiotic activation. The HEK-293 cell
line are human embryonic epithelial kidney cells (of unknown parentage) transformed with sheared adenovirus
5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK-293T cells are derived
from the HEK293 cell line by the addition of the SV40 large T antigen that has been shown to increase vector
production of some viral vectors. HEK293T are reported to have relatively high transfection efficiencies when
compared to other cell lines (COS-7 and HepG2) (Dai et al. 2015) and it is among the most frequently utilized
cell lines for in small-scale protein production and in viral vector propagation using the transient transfection
method (Lin et al. 2014).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Constitutive
expression of Phase l/ll enzymes has not been well characterized in this HEK293T cell line and metabolic
activation/detoxification of test compounds is potentially limited.

2.5	Exposure Regime: The OT PPARg/SRC-1 assay assessed receptor-chemical interactions using a rapidly maturing,
intensely fluorescent mutant of YFP known as Venus, rationally dissected into two separate fragments. The
fragments were obtained as follows: first, fragments coding for YFP1 and YFP2 (corresponding to amino acid

-------
residues 1-158 and 159-239 of the full length YFP, respectively) were generated by oligonucleotide synthesis
(Blue Heron Biotechnology), and then PCR mutagenesis was used to generate the mutant fragments IFP1 and
IFP2. Fusion constructs were transfected into HEK293T cells with a (Gly4Ser)2 linker between the PPARg/SRC-1
and YFP fragment genes to facilitate complementation when interacting proteins bring fragments into close
proximity. The construct is transiently transfected in HEK293T cells 48 hours prior to treatment with test
compounds. Cells are seeded into optical quality 384-well poly-D-lysine coated plates in MEM alpha medium
(Invitrogen) supplemented with 10 percent FBS (Gemini Bio-Products) 1 percent penicillin and 1 percent
streptomycin, and grown in 37C incubator equilibrated to 5 percent C02. Cells are allowed to adhere for 24
hours prior to treatment with compounds of interest or controls for 24 hours. Cells are fixed in 4 percent
formaldehyde and stained with Draq5 (BioStatus) to identify cells and subcellular compartment boundaries prior
to signal detection. 8 Images per well were acquired on an Evotec Opera at 2 wavelengths (488 and 635nm),
and the ratio of fluorescence in the nucleus relative to fluorescence in the cytoplasm (N/C Ratio) in the 488nm
channel was calculated for a minimum of 350 cells per image.

Baseline median absolute deviation for the assay (bmad): 0.938

Response cutoff threshold used to determine hit calls: 20

Detection technology used: Protein-fragment Complementation (Fluorescence)

2.6	Response: Peroxisome proliferator-activated receptor gamma signaling pathway protein formation in response
to PPAR-gamma ligand-binding /SRC-1 co-factor recruitment is measured by changes in fluorescence intensity
relative to DMSO (neutral control) baseline.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.564 nM
Key positive control:

GW1929

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

188 nM
Neutral vehicle control:

DMSO

NA

-------
3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Each data point was formed by taking the log of the ratio of the sample signal to the vehicle
control (DMSO) signal. A minimum of 8 replicate wells were analyzed each for sample and vehicle controls. Wells
located in the outer ring of the plate were omitted due to the potential for edge effects. Data were captured on
a confocal microscope: the fold increase in mean fluorescence intensity was calculated relative to the vehicle
controls from 16 images, each containing a minimum of 350 cells. Gain-of-signal activity indicates PPAR-
gamma/SRC-1 complex formation and data are plotted as percent of GW1929 activity, and are plotted relative
to DMSO, negative control and signal baseline.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

2: bval.apid.lowconc.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for test compound wells (wilt = t) with a concentration index (cndx)
of 1 or 2.), 3: pval.apid.medpcbyconc.max (Calculate the positive control value (pval) as the plate-wise
maximum, by assay plate ID (apid), of the medians of the corrected values (cval) for gain-of-signal single-
or multiple-concentration negative control wells (wilt = m or o) by apid, well type, and concentration.),
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active

-------
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 3064	Number of chemicals tested: 1860

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

38	2974	52

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	87

gain-loss (gnls) model:	129

power(pow) model:	316

linear-polynomial (polyl) model:	1005

quadratic-polynomial(poly2) model:	532

exponential-2 (exp2) model:	24

exponential-3 (exp3) model:	9

exponential-4 (exp4) model:	743

exponential-5 (exp5) model:	219

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	10.872

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

10.104
1.483
15.22%

133.452
11.684

NA

Positive control signal-to-background: (pmed/nmed)

NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 219.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: MacDonald ML, Lamerdin J, Owens S, Keon BH, Bilter GK, Shang Z, Huang Z, Yu H, Dias J, Minami
T, Michnick SW, Westwick JK. Identifying off-target effects and hidden phenotypes of drugs in human cells. Nat
Chem Biol. 2006 Jun;2(6):329-37. Epub 2006 May 7. PubMed PMID: 16680159., Yu H, West M, Keon BH, Bilter
GK, Owens S, Lamerdin J, Westwick JK. Measuring drug action in the cellular context using protein-fragment
complementation assays. Assay Drug DevTechnol. 2003 Dec;l(6):811-22. PubMed PMID: 15090227.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1682

STM_H9_CystinelSnorm_perc

1.	General Information

1.1	Assay Title: Stemina devTOX quickPredict H9 human embryonic stem cell (hESC)-based Assay for Cystine (CYSS)
Metabolite Biomarker, Normalized

1.2	Assay Summary: STM_H9_secretome utilizes undifferentiated H9 cells to measure relative changes in two
metabolites, ornithine (ORN) and cystine (CYSS) in the secretome, targeting a critical drop in the ORN/CYSS ratio
as a biomarker for developmental toxicity. STM_H9_CystinelSnorm_perc is one of 10 components calculated in
the STM_H9_secretome assay. It measures the integrated area of the endogenous cystine normalized to the
spiked-in isotopically labeled standard and to the median value of the 0.1% DMSO neutral control treatment
using UPLC-HRMS technology. Data from the assay component STM_H9_CystinelSnorm_perc was analyzed at
the assay endpoint STM_H9_CystinelSnorm_perc in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can
be used to understand relative metabolite change.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Stemina Biomarker Discovery is a Contract Research Organization (CRO) that provides stem cell-
based developmental toxicity screening for chemical compounds.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: devTOX quickPredict platform is a technology trademarked and patented by Stemina
Biomarker Discovery, Inc.

1.9	Assay Throughput: 96-well plate. All treatments were carried out in Matrigel-coated 96-well plates. H9 cells
were plated with a seeding density of 100000 cells per well in mTeSRl medium containing lOuM Y27632 Rho-
associated kinase inhibitor (ATCC, Manassas, Virginia) to increase plating efficiency. Y27632 was removed prior
to compound addition at 24h after plating.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in cystine utilization relative to the neutral control.

The Stemina devTOX quickPredict platform is a human pluripotent stem cell-based assay that predicts the
developmental toxicity potential based on changes in cellular metabolism following chemical exposure.

2.2	Scientific Principles: Profiling hESCs for their secreted metabolites has been proposed as an alternative testing
platform for identifying compounds with potential developmental toxicity. Dynamic variations in metabolite
abundance with functional changes in biochemical pathways and cellular metabolic response may be direct or
secondary consequences of chemical exposure, the profile of intermediary metabolites and small molecules
released by hESCs to their environment (secretome) could lead to identification of the extent of adverse
outcome pathways in the developing embryo. The ToxCast STM platform described here provides a potency
read-out of a chemical compound's exposure-based potential for developmental toxicity based on a critical
imbalance in the targeted biomarker (decreased ORN/CYSS ratio detected in the H9 hESC conditioned medium).

-------
2.3	Experimental System: monolayer hESC cell-based used. H9 cells (NIH code WA09, WiCell Research Institute, Inc,
Madison, Wisconsin) were used as approved for federally funded research and selected because of their
commercial availability, genetic stability (normal female karyotype), and scientific legacy (hundreds of
publications). Derivation and characterization of the H9 cell line was originally reported by Thomson et al.
(1998). Cells were handled as described (Palmer et al., 2013). Briefly, cells were maintained under feeder-free
conditions with mTeSRl media (StemCell Technologies, Vancouver, Canada) on Matrigel hESC-Qualified Matrix
(Corning, Bedford, Massachusetts) coated 6-well plates. Cultures were incubated at 37C in a humidified
atmosphere of <5% C02. Differentiated colonies were removed daily through aspiration to maintain the
undifferentiated stem cell population. Differentiation was based on visual inspection; there is typically < 5%
differentiation in a culture, and only highly pure undifferentiated H9 cell populations were used for these
experiments. Cultures were passaged using Versene (Life Technologies, Grand Island, New York) or ReLeSR
(StemCell Technologies) at 85%-90% confluency, karyotyped approximately every 10 passages, and the absence
of mycoplasma was routinely confirmed with the MycoAlert Mycoplasma Detection Kit (Lonza, Rockland,
Maine). The passage number of H9 cells used over the course of this study ranged from 31 to 48; anything above
passage 40 was karyotyped within 10 passages prior to use in the assay.

2.4	Metabolic Competence: hES cells are an innovative in vitro model system that is metabolically similar to
embryonic epiblast cells at gastrulation. Perturbations in cellular metabolism may be indicative of

2.5 Exposure Regime: H9 hESCs maintained in a pluripotent state were exposed to test compound for 72h with
chemical replenishment with media replacement every 24h. Cell-conditioned media from the final 24-h
treatment period was collected for analysis of the targeted biomarker, and cell viability of the corresponding
cell layer was assessed. The chemical library was tested in blinded fashion. Each test plate included luM
Methotrexate (MTX; Selleck Chemicals, Houston, Texas) as a positive reference, 5 nM MTX as a negative
reference, 0.1% DMSO as the neutral (vehicle) control, and sample-level media blanks. H9 cell-conditioned
media from the final 24-h treatment period was collected for analysis of the targeted biomarker and cell viability
was measured from the corresponding cell layer. Cell viability was measured using the CellTiter-Fluor assay
(Promega, Madison, Wisconsin) based on proteolytic cleavage of a substrate to fluorescent signal proportional
to the number of living cells. The cell viability Relative Fluorescence Unit (RFU) was background corrected and
normalized to mean RFU of the neutral control (0.1% DMSO). The collected H9-cell-conditioned media samples
were processed for targeted biomarker analysis as described (Palmer et al., 2013). Briefly, spent media samples
were deproteinized (40% acetonitrile) and processed for UPLC-HRMS. Data acquisition was performed using 4
separate UPLC-HRMS systems, consisting of an Agilent 1290 Infinity LC system (Agilent Technologies) interfaced
with an Agilent high-resolution mass spectrometer (models G6520A, G6520B, G6530A, and G6224A). A Waters
Acquity UPLC BEH Amide column (2.1mm x 50mm, 1.71m particle size; Waters, Milford, Massachusetts)
maintained at 40C was applied for separation of metabolites using a 6.5min solvent gradient with 0.1% formic
acid in water and 0.1% formic acid in acetonitrile (l.Oml/min flow rate). Data were acquired using MassHunter
Acquisition software (version B 04.00, Agilent Technologies). The extracted ion chromatogram (EIC) areas for
ORN, CYSS, and their respective spike-in C13-standards were determined using the Agilent MassHunter
Quantitative Analysis software, version B.05.00 or newer (Agilent Technologies), and data were normalized as
described in Palmer et al. (2017).

teratogenicity.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

300 nM
Key positive control:

Methotrexate

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

le+06 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.151

-------
Response cutoff threshold used to determine hit calls: 0.453
Detection technology used: UPLC-HRMS (Mass spectrometry)

2.6	Response: This assay measures relative changes in 2 metabolites, ornithine (ORN) and cystine (CYSS), targeting
the ORN/CYSS ratio as a biomarker for developmental toxicity. Ornithine is a nonproteogenic amino acid that
functions in several biochemical pathways including ammonia detoxification in the urea cycle, pyrimidine
synthesis via ornithine transcarbamy-lase, and polyamine synthesis via ornithine decarboxylase. Ornithine is
initially absent from the medium but released from viable cells; as such, decreased cellular release reflects
general metabolic states for these pathways. Cystine is initially present in the medium and used by cells in
glutathione production; as such, the change connected to decreased CYSS uptake likely reflects a change in
cellular glutathione synthesis and redox balance. Although minor effects on cell viability could account for
changes in cellular ORN release and/or CYSS uptake measured in the ORN/CYSS ratio, altered cell growth and/or
survival are potential modes of action in developmental toxicity. As such, compounds that impact the ORN/CYSS
biomarker due to minimal effects on cell viability should not be discounted because of it. Although cell viability
is measured, it is not included in the prediction by the assay, which is based solely on the ORN/CYSS ratio
response. Cell viability is provided as an additional endpoint to aid in the interpretation of the data.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of metabolite.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All raw data and metadata were loaded into a central database for ToxCast using standard
nomenclature to pipeline the data into invitrodb under for the Stemina DevToxqP assay source identifier (asid)
for the ToxCast platform, designated STM_H9 (asid 14); specifically, the 2 assay identifiers (aid)
STM_H9_secretome (aid 428) and STM_H9_viability (aid 437)representing data measures for the conditioned
media and cell monolayer, respectively. This included identifiers for tracking each chemical sample (spid), plate
identifier (apid), well position (row, column), micromolar concentration tested (dose), well quality (0=fail,
l=pass), and well type (wilt). The well type identifiers included media blank "b" lacking H9 cells, neutral control
"n" 0.1% DMSO, test compound "t," negative control "m" 0.005uM MTX, and positive control "p" l.OuM MTX.
Raw data values (rval) were stored under the following assay components (acid): peak area for C13-cystine (acid
1023) and C13-ornithine (acid 1024) tracers, peak area for measured cystine (acid 1025) and cystine
standardized to the C13 tracer (acid 1026), cystine normalized to the plate median value of the neutral controls
(acid 1027), peak area for measured ornithine (acid 1028) and ornithine standardized to the C13 tracer (acid

-------
1029), ornithine normalized to the plate median value of the neutral controls (acid 1030), the ORN:CYSS ratio
calculated from DMSO-normalized values (acid 1031), the targeted biomarker prediction (acid 1032, which is an
empty placeholder), background-corrected RFU from the CellTiter-Fluor assay (acid 1113), and cell viability
normalized to mean RFU of the DMSO control (acid 1114). The corresponding assay endpoint identifiers (aeids)
analyzed for the predictive model were: the decrease in media ornithine reflecting reduced cellular release
(STM_H9_ornithinelSnorm_perc, aeid 1688), the increase in media cystine reflecting less utilization
(STM_H9_cystinelSnorm_perc, aeid 1682), the ORN/CYSS ratio reflecting a decrease in the ORN:CYSS ratio as
the primary biomarker (STM_H9_OrnCysslSnorm_ratio, aeid 1690), and normalized cell viability
(STM_H9_NormalizedViability, aeid 1858).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates

-------
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 379	Number of chemicals tested: 379

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1

Neutral control median absolute deviation, by plate: nmad	0.054

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.44%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	4.115

Positive control well median absolute deviation, by plate: pmad	0.095

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	18.712

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	1

Negative control well median absolute deviation value, by plate: mmad	0.041

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.001

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

-------
(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 58.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Rush N, Kothiya P, Judson RS, Houck KA, Hunter ES, Baker NC, Palmer JA,
Thomas RS, Knudsen TB. Profiling the ToxCast Library With a Pluripotent Human (H9) Stem Cell Line-Based
Biomarker Assay for Developmental Toxicity. Toxicol Sci. 2020 Apr l;174(2):189-209. doi:
10.1093/toxsci/kfaa014. PMID: 32073639., Palmer, J. A., Smith, A. M., Egnash, L. A., Conard, K. R., West, P. R.,
Burrier, R. E., Donley, E. L., & Kirchner, F. R. (2013). Establishment and assessment of a new human embryonic
stem cell-based biomarker assay for developmental toxicity screening. Birth defects research. Part B,
Developmental and reproductive toxicology, 98(4), 343-363. https://doi.org/10.1002/bdrb.21078, Palmer, J. A.,
Smith, A. M., Egnash, L. A., Colwell, M. R., Donley, E. L. R., Kirchner, F. R., & Burrier, R. E. (2017). A human induced
pluripotent stem cell-based in vitro assay predicts developmental toxicity through a retinoic acid receptor-
mediated pathway for a series of related retinoid analogues. Reproductive toxicology (Elmsford, N.Y.), 73, 350-
361. https://doi.Org/10.1016/j.reprotox.2017.07.011

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1688

STM_H9_OmithinelSnorm_perc

1.	General Information

1.1	Assay Title: Stemina devTOX quickPredict H9 human embryonic stem cell (hESC)-based Assay for Ornithine
(ORN) Metabolite Biomarker, Normalized

1.2	Assay Summary: STM_H9_secretome utilizes undifferentiated H9 cells to measure relative changes in two
metabolites, ornithine (ORN) and cystine (CYSS) in the secretome, targeting a critical drop in the ORN/CYSS ratio
as a biomarker for developmental toxicity. STM_H9_OrnithinelSnorm is one of 10 components calculated in the
STM_H9_secretome assay. It measures the integrated area of the endogenous ornithine normalized to the
spiked-in isotopically labeled standard and to the median value of the 0.1% DMSO neutral control treatment
using UPLC-HRMS technology. Data from the assay component STM_H9_OrnithinelSnorm_perc was analyzed
at the assay endpoint STM_H9_OrnithinelSnorm_perc in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity
can be used to understand relative metabolite change.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Stemina Biomarker Discovery is a Contract Research Organization (CRO) that provides stem cell-
based developmental toxicity screening for chemical compounds.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: devTOX quickPredict platform is a technology trademarked and patented by Stemina
Biomarker Discovery, Inc.

1.9	Assay Throughput: 96-well plate. All treatments were carried out in Matrigel-coated 96-well plates. H9 cells
were plated with a seeding density of 100000 cells per well in mTeSRl medium containing lOuM Y27632 Rho-
associated kinase inhibitor (ATCC, Manassas, Virginia) to increase plating efficiency. Y27632 was removed prior
to compound addition at 24h after plating.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in ornithine release relative to the neutral control.

The Stemina devTOX quickPredict platform is a human pluripotent stem cell-based assay that predicts the
developmental toxicity potential based on changes in cellular metabolism following chemical exposure.

2.2	Scientific Principles: Profiling hESCs for their secreted metabolites has been proposed as an alternative testing
platform for identifying compounds with potential developmental toxicity. Dynamic variations in metabolite
abundance with functional changes in biochemical pathways and cellular metabolic response may be direct or
secondary consequences of chemical exposure, the profile of intermediary metabolites and small molecules
released by hESCs to their environment (secretome) could lead to identification of the extent of adverse
outcome pathways in the developing embryo. The ToxCast STM platform described here provides a potency
read-out of a chemical compound's exposure-based potential for developmental toxicity based on a critical
imbalance in the targeted biomarker (decreased ORN/CYSS ratio detected in the H9 hESC conditioned medium).

-------
2.3	Experimental System: monolayer hESC cell-based used. H9 cells (NIH code WA09, WiCell Research Institute, Inc,
Madison, Wisconsin) were used as approved for federally funded research and selected because of their
commercial availability, genetic stability (normal female karyotype), and scientific legacy (hundreds of
publications). Derivation and characterization of the H9 cell line was originally reported by Thomson et al.
(1998). Cells were handled as described (Palmer et al., 2013). Briefly, cells were maintained under feeder-free
conditions with mTeSRl media (StemCell Technologies, Vancouver, Canada) on Matrigel hESC-Qualified Matrix
(Corning, Bedford, Massachusetts) coated 6-well plates. Cultures were incubated at 37C in a humidified
atmosphere of <5% C02. Differentiated colonies were removed daily through aspiration to maintain the
undifferentiated stem cell population. Differentiation was based on visual inspection; there is typically < 5%
differentiation in a culture, and only highly pure undifferentiated H9 cell populations were used for these
experiments. Cultures were passaged using Versene (Life Technologies, Grand Island, New York) or ReLeSR
(StemCell Technologies) at 85%-90% confluency, karyotyped approximately every 10 passages, and the absence
of mycoplasma was routinely confirmed with the MycoAlert Mycoplasma Detection Kit (Lonza, Rockland,
Maine). The passage number of H9 cells used over the course of this study ranged from 31 to 48; anything above
passage 40 was karyotyped within 10 passages prior to use in the assay.

2.4	Metabolic Competence: hES cells are an innovative in vitro model system that is metabolically similar to
embryonic epiblast cells at gastrulation. Perturbations in cellular metabolism may be indicative of

2.5 Exposure Regime: H9 hESCs maintained in a pluripotent state were exposed to test compound for 72h with
chemical replenishment with media replacement every 24h. Cell-conditioned media from the final 24-h
treatment period was collected for analysis of the targeted biomarker, and cell viability of the corresponding
cell layer was assessed. The chemical library was tested in blinded fashion. Each test plate included luM
Methotrexate (MTX; Selleck Chemicals, Houston, Texas) as a positive reference, 5 nM MTX as a negative
reference, 0.1% DMSO as the neutral (vehicle) control, and sample-level media blanks. H9 cell-conditioned
media from the final 24-h treatment period was collected for analysis of the targeted biomarker and cell viability
was measured from the corresponding cell layer. Cell viability was measured using the CellTiter-Fluor assay
(Promega, Madison, Wisconsin) based on proteolytic cleavage of a substrate to fluorescent signal proportional
to the number of living cells. The cell viability Relative Fluorescence Unit (RFU) was background corrected and
normalized to mean RFU of the neutral control (0.1% DMSO). The collected H9-cell-conditioned media samples
were processed for targeted biomarker analysis as described (Palmer et al., 2013). Briefly, spent media samples
were deproteinized (40% acetonitrile) and processed for UPLC-HRMS. Data acquisition was performed using 4
separate UPLC-HRMS systems, consisting of an Agilent 1290 Infinity LC system (Agilent Technologies) interfaced
with an Agilent high-resolution mass spectrometer (models G6520A, G6520B, G6530A, and G6224A). A Waters
Acquity UPLC BEH Amide column (2.1mm x 50mm, 1.71m particle size; Waters, Milford, Massachusetts)
maintained at 40C was applied for separation of metabolites using a 6.5min solvent gradient with 0.1% formic
acid in water and 0.1% formic acid in acetonitrile (l.Oml/min flow rate). Data were acquired using MassHunter
Acquisition software (version B 04.00, Agilent Technologies). The extracted ion chromatogram (EIC) areas for
ORN, CYSS, and their respective spike-in C13-standards were determined using the Agilent MassHunter
Quantitative Analysis software, version B.05.00 or newer (Agilent Technologies), and data were normalized as
described in Palmer et al. (2017).

teratogenicity.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

300 nM
Key positive control:

Methotrexate

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

le+06 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.067

-------
Response cutoff threshold used to determine hit calls: 0.201
Detection technology used: UPLC-HRMS (Mass spectrometry)

2.6	Response: This assay measures relative changes in 2 metabolites, ornithine (ORN) and cystine (CYSS), targeting
the ORN/CYSS ratio as a biomarker for developmental toxicity. Ornithine is a nonproteogenic amino acid that
functions in several biochemical pathways including ammonia detoxification in the urea cycle, pyrimidine
synthesis via ornithine transcarbamy-lase, and polyamine synthesis via ornithine decarboxylase. Ornithine is
initially absent from the medium but released from viable cells; as such, decreased cellular release reflects
general metabolic states for these pathways. Cystine is initially present in the medium and used by cells in
glutathione production; as such, the change connected to decreased CYSS uptake likely reflects a change in
cellular glutathione synthesis and redox balance. Although minor effects on cell viability could account for
changes in cellular ORN release and/or CYSS uptake measured in the ORN/CYSS ratio, altered cell growth and/or
survival are potential modes of action in developmental toxicity. As such, compounds that impact the ORN/CYSS
biomarker due to minimal effects on cell viability should not be discounted because of it. Although cell viability
is measured, it is not included in the prediction by the assay, which is based solely on the ORN/CYSS ratio
response. Cell viability is provided as an additional endpoint to aid in the interpretation of the data.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of metabolite.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All raw data and metadata were loaded into a central database for ToxCast using standard
nomenclature to pipeline the data into invitrodb under for the Stemina DevToxqP assay source identifier (asid)
for the ToxCast platform, designated STM_H9 (asid 14); specifically, the 2 assay identifiers (aid)
STM_H9_secretome (aid 428) and STM_H9_viability (aid 437)representing data measures for the conditioned
media and cell monolayer, respectively. This included identifiers for tracking each chemical sample (spid), plate
identifier (apid), well position (row, column), micromolar concentration tested (dose), well quality (0=fail,
l=pass), and well type (wilt). The well type identifiers included media blank "b" lacking H9 cells, neutral control
"n" 0.1% DMSO, test compound "t," negative control "m" 0.005uM MTX, and positive control "p" l.OuM MTX.
Raw data values (rval) were stored under the following assay components (acid): peak area for C13-cystine (acid
1023) and C13-ornithine (acid 1024) tracers, peak area for measured cystine (acid 1025) and cystine
standardized to the C13 tracer (acid 1026), cystine normalized to the plate median value of the neutral controls
(acid 1027), peak area for measured ornithine (acid 1028) and ornithine standardized to the C13 tracer (acid

-------
1029), ornithine normalized to the plate median value of the neutral controls (acid 1030), the ORN:CYSS ratio
calculated from DMSO-normalized values (acid 1031), the targeted biomarker prediction (acid 1032, which is an
empty placeholder), background-corrected RFU from the CellTiter-Fluor assay (acid 1113), and cell viability
normalized to mean RFU of the DMSO control (acid 1114). The corresponding assay endpoint identifiers (aeids)
analyzed for the predictive model were: the decrease in media ornithine reflecting reduced cellular release
(STM_H9_ornithinelSnorm_perc, aeid 1688), the increase in media cystine reflecting less utilization
(STM_H9_cystinelSnorm_perc, aeid 1682), the ORN/CYSS ratio reflecting a decrease in the ORN:CYSS ratio as
the primary biomarker (STM_H9_OrnCysslSnorm_ratio, aeid 1690), and normalized cell viability
(STM_H9_NormalizedViability, aeid 1858).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

2: log2 (Transform the corrected response value (cval) to log-scale (base 2).)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates

-------
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 379	Number of chemicals tested: 379

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1

Neutral control median absolute deviation, by plate: nmad	0.018

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	1.83%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.418

Positive control well median absolute deviation, by plate: pmad	0.013

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-21.06

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	1.01

Negative control well median absolute deviation value, by plate: mmad	0.011

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.417

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

-------
(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 58.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Rush N, Kothiya P, Judson RS, Houck KA, Hunter ES, Baker NC, Palmer JA,
Thomas RS, Knudsen TB. Profiling the ToxCast Library With a Pluripotent Human (H9) Stem Cell Line-Based
Biomarker Assay for Developmental Toxicity. Toxicol Sci. 2020 Apr l;174(2):189-209. doi:
10.1093/toxsci/kfaa014. PMID: 32073639., Palmer, J. A., Smith, A. M., Egnash, L. A., Conard, K. R., West, P. R.,
Burrier, R. E., Donley, E. L., & Kirchner, F. R. (2013). Establishment and assessment of a new human embryonic
stem cell-based biomarker assay for developmental toxicity screening. Birth defects research. Part B,
Developmental and reproductive toxicology, 98(4), 343-363. https://doi.org/10.1002/bdrb.21078, Palmer, J. A.,
Smith, A. M., Egnash, L. A., Colwell, M. R., Donley, E. L. R., Kirchner, F. R., & Burrier, R. E. (2017). A human induced
pluripotent stem cell-based in vitro assay predicts developmental toxicity through a retinoic acid receptor-
mediated pathway for a series of related retinoid analogues. Reproductive toxicology (Elmsford, N.Y.), 73, 350-
361. https://doi.Org/10.1016/j.reprotox.2017.07.011

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1690

STM_H9_OrnCysslSnorm_ratio

1.	General Information

1.1	Assay Title: Stemina devTOX quickPredict H9 human embryonic stem cell (hESC)-based Assay for Ornithine
(ORN):Cystine (CYSS) Ratio

1.2	Assay Summary: STM_H9_secretome utilizes undifferentiated H9 cells to measure relative changes in two
metabolites, ornithine (ORN) and cystine (CYSS) in the secretome, targeting a critical drop in the ORN/CYSS ratio
as a biomarker for developmental toxicity. STM_H9_OrnCysslSnorm_ratio is one of 10 components calculated
in the STM_H9_secretome assay. It is the ratio value of OrnlSnorm_perc to CysslSnorm_perc. Data from the
assay component STM_H9_OrnCysslSnorm_ratio was analyzed at the assay endpoint
STM_H9_OrnCysslSnorm_ratio in the positive analysis fitting direction relative to DMSO as the negative control
and baseline of activity. Using a type of binding reporter, gain or loss-of-signal activity can be used to understand
relative metabolite change.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Stemina Biomarker Discovery is a Contract Research Organization (CRO) that provides stem cell-
based developmental toxicity screening for chemical compounds.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: devTOX quickPredict platform is a technology trademarked and patented by Stemina
Biomarker Discovery, Inc.

1.9	Assay Throughput: 96-well plate. All treatments were carried out in Matrigel-coated 96-well plates. H9 cells
were plated with a seeding density of 100000 cells per well in mTeSRl medium containing lOuM Y27632 Rho-
associated kinase inhibitor (ATCC, Manassas, Virginia) to increase plating efficiency. Y27632 was removed prior
to compound addition at 24h after plating.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Change in the ratio value of Orn release to Cyss utilization, indicative of the targeted biomarker of
the assay.

The Stemina devTOX quickPredict platform is a human pluripotent stem cell-based assay that predicts the
developmental toxicity potential based on changes in cellular metabolism following chemical exposure.

2.2	Scientific Principles: Profiling hESCs for their secreted metabolites has been proposed as an alternative testing
platform for identifying compounds with potential developmental toxicity. Dynamic variations in metabolite
abundance with functional changes in biochemical pathways and cellular metabolic response may be direct or
secondary consequences of chemical exposure, the profile of intermediary metabolites and small molecules
released by hESCs to their environment (secretome) could lead to identification of the extent of adverse
outcome pathways in the developing embryo. The ToxCast STM platform described here provides a potency
read-out of a chemical compound's exposure-based potential for developmental toxicity based on a critical
imbalance in the targeted biomarker (decreased ORN/CYSS ratio detected in the H9 hESC conditioned medium).

-------
2.3	Experimental System: monolayer hESC cell-based used. H9 cells (NIH code WA09, WiCell Research Institute, Inc,
Madison, Wisconsin) were used as approved for federally funded research and selected because of their
commercial availability, genetic stability (normal female karyotype), and scientific legacy (hundreds of
publications). Derivation and characterization of the H9 cell line was originally reported by Thomson et al.
(1998). Cells were handled as described (Palmer et al., 2013). Briefly, cells were maintained under feeder-free
conditions with mTeSRl media (StemCell Technologies, Vancouver, Canada) on Matrigel hESC-Qualified Matrix
(Corning, Bedford, Massachusetts) coated 6-well plates. Cultures were incubated at 37C in a humidified
atmosphere of <5% C02. Differentiated colonies were removed daily through aspiration to maintain the
undifferentiated stem cell population. Differentiation was based on visual inspection; there is typically < 5%
differentiation in a culture, and only highly pure undifferentiated H9 cell populations were used for these
experiments. Cultures were passaged using Versene (Life Technologies, Grand Island, New York) or ReLeSR
(StemCell Technologies) at 85%-90% confluency, karyotyped approximately every 10 passages, and the absence
of mycoplasma was routinely confirmed with the MycoAlert Mycoplasma Detection Kit (Lonza, Rockland,
Maine). The passage number of H9 cells used over the course of this study ranged from 31 to 48; anything above
passage 40 was karyotyped within 10 passages prior to use in the assay.

2.4	Metabolic Competence: hES cells are an innovative in vitro model system that is metabolically similar to
embryonic epiblast cells at gastrulation. Perturbations in cellular metabolism may be indicative of

2.5 Exposure Regime: H9 hESCs maintained in a pluripotent state were exposed to test compound for 72h with
chemical replenishment with media replacement every 24h. Cell-conditioned media from the final 24-h
treatment period was collected for analysis of the targeted biomarker, and cell viability of the corresponding
cell layer was assessed. The chemical library was tested in blinded fashion. Each test plate included luM
Methotrexate (MTX; Selleck Chemicals, Houston, Texas) as a positive reference, 5 nM MTX as a negative
reference, 0.1% DMSO as the neutral (vehicle) control, and sample-level media blanks. H9 cell-conditioned
media from the final 24-h treatment period was collected for analysis of the targeted biomarker and cell viability
was measured from the corresponding cell layer. Cell viability was measured using the CellTiter-Fluor assay
(Promega, Madison, Wisconsin) based on proteolytic cleavage of a substrate to fluorescent signal proportional
to the number of living cells. The cell viability Relative Fluorescence Unit (RFU) was background corrected and
normalized to mean RFU of the neutral control (0.1% DMSO). The collected H9-cell-conditioned media samples
were processed for targeted biomarker analysis as described (Palmer et al., 2013). Briefly, spent media samples
were deproteinized (40% acetonitrile) and processed for UPLC-HRMS. Data acquisition was performed using 4
separate UPLC-HRMS systems, consisting of an Agilent 1290 Infinity LC system (Agilent Technologies) interfaced
with an Agilent high-resolution mass spectrometer (models G6520A, G6520B, G6530A, and G6224A). A Waters
Acquity UPLC BEH Amide column (2.1mm x 50mm, 1.71m particle size; Waters, Milford, Massachusetts)
maintained at 40C was applied for separation of metabolites using a 6.5min solvent gradient with 0.1% formic
acid in water and 0.1% formic acid in acetonitrile (l.Oml/min flow rate). Data were acquired using MassHunter
Acquisition software (version B 04.00, Agilent Technologies). The extracted ion chromatogram (EIC) areas for
ORN, CYSS, and their respective spike-in C13-standards were determined using the Agilent MassHunter
Quantitative Analysis software, version B.05.00 or newer (Agilent Technologies), and data were normalized as
described in Palmer et al. (2017).

teratogenicity.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

300 nM
Key positive control:

Methotrexate

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

le+06 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.136

-------
Response cutoff threshold used to determine hit calls: 0.409
Detection technology used: UPLC-HRMS (Mass spectrometry)

2.6	Response: This assay measures relative changes in 2 metabolites, ornithine (ORN) and cystine (CYSS), targeting
the ORN/CYSS ratio as a biomarker for developmental toxicity. Ornithine is a nonproteogenic amino acid that
functions in several biochemical pathways including ammonia detoxification in the urea cycle, pyrimidine
synthesis via ornithine transcarbamy-lase, and polyamine synthesis via ornithine decarboxylase. Ornithine is
initially absent from the medium but released from viable cells; as such, decreased cellular release reflects
general metabolic states for these pathways. Cystine is initially present in the medium and used by cells in
glutathione production; as such, the change connected to decreased CYSS uptake likely reflects a change in
cellular glutathione synthesis and redox balance. Although minor effects on cell viability could account for
changes in cellular ORN release and/or CYSS uptake measured in the ORN/CYSS ratio, altered cell growth and/or
survival are potential modes of action in developmental toxicity. As such, compounds that impact the ORN/CYSS
biomarker due to minimal effects on cell viability should not be discounted because of it. Although cell viability
is measured, it is not included in the prediction by the assay, which is based solely on the ORN/CYSS ratio
response. Cell viability is provided as an additional endpoint to aid in the interpretation of the data.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of metabolite.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All raw data and metadata were loaded into a central database for ToxCast using standard
nomenclature to pipeline the data into invitrodb under for the Stemina DevToxqP assay source identifier (asid)
for the ToxCast platform, designated STM_H9 (asid 14); specifically, the 2 assay identifiers (aid)
STM_H9_secretome (aid 428) and STM_H9_viability (aid 437)representing data measures for the conditioned
media and cell monolayer, respectively. This included identifiers for tracking each chemical sample (spid), plate
identifier (apid), well position (row, column), micromolar concentration tested (dose), well quality (0=fail,
l=pass), and well type (wilt). The well type identifiers included media blank "b" lacking H9 cells, neutral control
"n" 0.1% DMSO, test compound "t," negative control "m" 0.005uM MTX, and positive control "p" l.OuM MTX.
Raw data values (rval) were stored under the following assay components (acid): peak area for C13-cystine (acid
1023) and C13-ornithine (acid 1024) tracers, peak area for measured cystine (acid 1025) and cystine
standardized to the C13 tracer (acid 1026), cystine normalized to the plate median value of the neutral controls
(acid 1027), peak area for measured ornithine (acid 1028) and ornithine standardized to the C13 tracer (acid

-------
1029), ornithine normalized to the plate median value of the neutral controls (acid 1030), the ORN:CYSS ratio
calculated from DMSO-normalized values (acid 1031), the targeted biomarker prediction (acid 1032, which is an
empty placeholder), background-corrected RFU from the CellTiter-Fluor assay (acid 1113), and cell viability
normalized to mean RFU of the DMSO control (acid 1114). The corresponding assay endpoint identifiers (aeids)
analyzed for the predictive model were: the decrease in media ornithine reflecting reduced cellular release
(STM_H9_ornithinelSnorm_perc, aeid 1688), the increase in media cystine reflecting less utilization
(STM_H9_cystinelSnorm_perc, aeid 1682), the ORN/CYSS ratio reflecting a decrease in the ORN:CYSS ratio as
the primary biomarker (STM_H9_OrnCysslSnorm_ratio, aeid 1690), and normalized cell viability
(STM_H9_NormalizedViability, aeid 1858).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.), 2: log2 (Transform the corrected response value (cval) to log-scale
(base 2).), 3: rmneg (Exclude wells with negative corrected response values (cval) and downgrading their
well quality (wllq); if cval < 0, wllq = 0.), 4: rmzero (Exclude wells with corrected response values (cval)
equal to zero and downgrading their well quality (wllq); if cval = 0, wllq = 0.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:

-------
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 428	Number of chemicals tested: 428

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1

Neutral control median absolute deviation, by plate: nmad	0.06

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.96%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.108

Positive control well median absolute deviation, by plate: pmad	0.004

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-14.139

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	1.011

Negative control well median absolute deviation value, by plate: mmad	0.038

Z Prime Factor for median negative and neutral control across all plates:	NA

-------
(1 - ((3 * (mmad + nmad)) / abs(mmed - rimed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.124

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 65.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Rush N, Kothiya P, Judson RS, Houck KA, Hunter ES, Baker NC, Palmer JA,
Thomas RS, Knudsen TB. Profiling the ToxCast Library With a Pluripotent Human (H9) Stem Cell Line-Based
Biomarker Assay for Developmental Toxicity. Toxicol Sci. 2020 Apr l;174(2):189-209. doi:
10.1093/toxsci/kfaa014. PMID: 32073639., Palmer, J. A., Smith, A. M., Egnash, L. A., Conard, K. R., West, P. R.,
Burrier, R. E., Donley, E. L., & Kirchner, F. R. (2013). Establishment and assessment of a new human embryonic

-------
stem cell-based biomarker assay for developmental toxicity screening. Birth defects research. Part B,
Developmental and reproductive toxicology, 98(4), 343-363. https://doi.org/10.1002/bdrb.21078, Palmer, J. A.,
Smith, A. M., Egnash, L A., Colwell, M. R., Donley, E. L R., Kirchner, F. R., & Burrier, R. E. (2017). A human induced
pluripotent stem cell-based in vitro assay predicts developmental toxicity through a retinoic acid receptor-
mediated pathway for a series of related retinoid analogues. Reproductive toxicology (Elmsford, N.Y.), 73, 350-
361. https://doi.Org/10.1016/j.reprotox.2017.07.011

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1858

STM_H9_NormalizedViability

1.	General Information

1.1	Assay Title: Cytotoxicity Assessment at 120 hours in the Stemina devTOX quickPredict H9 human embryonic
stem cell (hESC)-based Assay

1.2	Assay Summary: STM_H9_viability utilizes undifferentiated H9 cells to measure cell viability.
STM_H9_NormalizedViability is a component calculated in the STM_H9_viability assay. It measures cell viability
using background corrected relative fluorescence units (RFU) value from the CellTiterFluor Assay normalized to
the value of the 0.1% DMSO neutral control treatment (by 96-well plate) Data from the assay component
STM_H9_Viability was analyzed at the assay endpoint STM_H9_Viability_Norm in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter,
loss-of-signal activity can be used to understand viability.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Stemina Biomarker Discovery is a Contract Research Organization (CRO) that provides stem cell-
based developmental toxicity screening for chemical compounds.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: devTOX quickPredict platform is a technology trademarked and patented by Stemina
Biomarker Discovery, Inc.

1.9	Assay Throughput: 96-well plate. All treatments were carried out in Matrigel-coated 96-well plates. H9 cells
were plated with a seeding density of 100000 cells per well in mTeSRl medium containing lOuM Y27632 Rho-
associated kinase inhibitor (ATCC, Manassas, Virginia) to increase plating efficiency. Y27632 was removed prior
to compound addition at 24h after plating.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence are indicative of changes in cell viability.

The Stemina devTOX quickPredict platform is a human pluripotent stem cell-based assay that predicts the
developmental toxicity potential based on changes in cellular metabolism following chemical exposure.

2.2	Scientific Principles: Profiling hESCs for their secreted metabolites has been proposed as an alternative testing
platform for identifying compounds with potential developmental toxicity. Dynamic variations in metabolite
abundance with functional changes in biochemical pathways and cellular metabolic response may be direct or
secondary consequences of chemical exposure, the profile of intermediary metabolites and small molecules
released by hESCs to their environment (secretome) could lead to identification of the extent of adverse
outcome pathways in the developing embryo. The ToxCast STM platform described here provides a potency
read-out of a chemical compound's exposure-based potential for developmental toxicity based on a critical
imbalance in the targeted biomarker (decreased ORN/CYSS ratio detected in the H9 hESC conditioned medium).

-------
2.3	Experimental System: monolayer hESC cell-based used. H9 cells (NIH code WA09, WiCell Research Institute, Inc,
Madison, Wisconsin) were used as approved for federally funded research and selected because of their
commercial availability, genetic stability (normal female karyotype), and scientific legacy (hundreds of
publications). Derivation and characterization of the H9 cell line was originally reported by Thomson et al.
(1998). Cells were handled as described (Palmer et al., 2013). Briefly, cells were maintained under feeder-free
conditions with mTeSRl media (StemCell Technologies, Vancouver, Canada) on Matrigel hESC-Qualified Matrix
(Corning, Bedford, Massachusetts) coated 6-well plates. Cultures were incubated at 37C in a humidified
atmosphere of <5% C02. Differentiated colonies were removed daily through aspiration to maintain the
undifferentiated stem cell population. Differentiation was based on visual inspection; there is typically < 5%
differentiation in a culture, and only highly pure undifferentiated H9 cell populations were used for these
experiments. Cultures were passaged using Versene (Life Technologies, Grand Island, New York) or ReLeSR
(StemCell Technologies) at 85%-90% confluency, karyotyped approximately every 10 passages, and the absence
of mycoplasma was routinely confirmed with the MycoAlert Mycoplasma Detection Kit (Lonza, Rockland,
Maine). The passage number of H9 cells used over the course of this study ranged from 31 to 48; anything above
passage 40 was karyotyped within 10 passages prior to use in the assay.

2.4	Metabolic Competence: hES cells are an innovative in vitro model system that is metabolically similar to
embryonic epiblast cells at gastrulation. Perturbations in cellular metabolism may be indicative of

2.5 Exposure Regime: H9 hESCs maintained in a pluripotent state were exposed to test compound for 72h with
chemical replenishment with media replacement every 24h. Cell-conditioned media from the final 24-h
treatment period was collected for analysis of the targeted biomarker, and cell viability of the corresponding
cell layer was assessed. The chemical library was tested in blinded fashion. Each test plate included luM
Methotrexate (MTX; Selleck Chemicals, Houston, Texas) as a positive reference, 5 nM MTX as a negative
reference, 0.1% DMSO as the neutral (vehicle) control, and sample-level media blanks. H9 cell-conditioned
media from the final 24-h treatment period was collected for analysis of the targeted biomarker and cell viability
was measured from the corresponding cell layer. Cell viability was measured using the CellTiter-Fluor assay
(Promega, Madison, Wisconsin) based on proteolytic cleavage of a substrate to fluorescent signal proportional
to the number of living cells. The cell viability Relative Fluorescence Unit (RFU) was background corrected and
normalized to mean RFU of the neutral control (0.1% DMSO). The collected H9-cell-conditioned media samples
were processed for targeted biomarker analysis as described (Palmer et al., 2013). Briefly, spent media samples
were deproteinized (40% acetonitrile) and processed for UPLC-HRMS. Data acquisition was performed using 4
separate UPLC-HRMS systems, consisting of an Agilent 1290 Infinity LC system (Agilent Technologies) interfaced
with an Agilent high-resolution mass spectrometer (models G6520A, G6520B, G6530A, and G6224A). A Waters
Acquity UPLC BEH Amide column (2.1mm x 50mm, 1.71m particle size; Waters, Milford, Massachusetts)
maintained at 40C was applied for separation of metabolites using a 6.5min solvent gradient with 0.1% formic
acid in water and 0.1% formic acid in acetonitrile (l.Oml/min flow rate). Data were acquired using MassHunter
Acquisition software (version B 04.00, Agilent Technologies). The extracted ion chromatogram (EIC) areas for
ORN, CYSS, and their respective spike-in C13-standards were determined using the Agilent MassHunter
Quantitative Analysis software, version B.05.00 or newer (Agilent Technologies), and data were normalized as
described in Palmer et al. (2017).

teratogenicity.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
8

Standard minimum concentration tested:

300 nM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

le+06 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.059
Response cutoff threshold used to determine hit calls: 0.176

-------
Detection technology used: CellTiter-FluorTM (Fluorescence)

2.6	Response: This assay measures relative changes in 2 metabolites, ornithine (ORN) and cystine (CYSS), targeting
the ORN/CYSS ratio as a biomarker for developmental toxicity. Ornithine is a nonproteogenic amino acid that
functions in several biochemical pathways including ammonia detoxification in the urea cycle, pyrimidine
synthesis via ornithine transcarbamy-lase, and polyamine synthesis via ornithine decarboxylase. Ornithine is
initially absent from the medium but released from viable cells; as such, decreased cellular release reflects
general metabolic states for these pathways. Cystine is initially present in the medium and used by cells in
glutathione production; as such, the change connected to decreased CYSS uptake likely reflects a change in
cellular glutathione synthesis and redox balance. Although minor effects on cell viability could account for
changes in cellular ORN release and/or CYSS uptake measured in the ORN/CYSS ratio, altered cell growth and/or
survival are potential modes of action in developmental toxicity. As such, compounds that impact the ORN/CYSS
biomarker due to minimal effects on cell viability should not be discounted because of it. Although cell viability
is measured, it is not included in the prediction by the assay, which is based solely on the ORN/CYSS ratio
response. Cell viability is provided as an additional endpoint to aid in the interpretation of the data.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All raw data and metadata were loaded into a central database for ToxCast using standard
nomenclature to pipeline the data into invitrodb under for the Stemina DevToxqP assay source identifier (asid)
for the ToxCast platform, designated STM_H9 (asid 14); specifically, the 2 assay identifiers (aid)
STM_H9_secretome (aid 428) and STM_H9_viability (aid 437)representing data measures for the conditioned
media and cell monolayer, respectively. This included identifiers for tracking each chemical sample (spid), plate
identifier (apid), well position (row, column), micromolar concentration tested (dose), well quality (0=fail,
l=pass), and well type (wilt). The well type identifiers included media blank "b" lacking H9 cells, neutral control
"n" 0.1% DMSO, test compound "t," negative control "m" 0.005uM MTX, and positive control "p" l.OuM MTX.
Raw data values (rval) were stored under the following assay components (acid): peak area for C13-cystine (acid
1023) and C13-ornithine (acid 1024) tracers, peak area for measured cystine (acid 1025) and cystine
standardized to the C13 tracer (acid 1026), cystine normalized to the plate median value of the neutral controls
(acid 1027), peak area for measured ornithine (acid 1028) and ornithine standardized to the C13 tracer (acid
1029), ornithine normalized to the plate median value of the neutral controls (acid 1030), the ORN:CYSS ratio

-------
calculated from DMSO-normalized values (acid 1031), the targeted biomarker prediction (acid 1032, which is an
empty placeholder), background-corrected RFU from the CellTiter-Fluor assay (acid 1113), and cell viability
normalized to mean RFU of the DMSO control (acid 1114). The corresponding assay endpoint identifiers (aeids)
analyzed for the predictive model were: the decrease in media ornithine reflecting reduced cellular release
(STM_H9_ornithinelSnorm_perc, aeid 1688), the increase in media cystine reflecting less utilization
(STM_H9_cystinelSnorm_perc, aeid 1682), the ORN/CYSS ratio reflecting a decrease in the ORN:CYSS ratio as
the primary biomarker (STM_H9_OrnCysslSnorm_ratio, aeid 1690), and normalized cell viability
(STM_H9_NormalizedViability, aeid 1858).

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.), 2: log2 (Transform the corrected response value (cval) to log-scale
(base 2).)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than

-------
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 428	Number of chemicals tested: 428

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.999

Neutral control median absolute deviation, by plate: nmad	0.028

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	2.84%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	0.542

Positive control well median absolute deviation, by plate: pmad	0.012

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-12.93

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	1.018

Negative control well median absolute deviation value, by plate: mmad	0.014

-------
Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

0.415

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 50.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Rush N, Kothiya P, Judson RS, Houck KA, Hunter ES, Baker NC, Palmer JA,
Thomas RS, Knudsen TB. Profiling the ToxCast Library With a Pluripotent Human (H9) Stem Cell Line-Based
Biomarker Assay for Developmental Toxicity. Toxicol Sci. 2020 Apr l;174(2):189-209. doi:
10.1093/toxsci/kfaa014. PMID: 32073639., Palmer, J. A., Smith, A. M., Egnash, L. A., Conard, K. R., West, P. R.,

-------
Burrier, R. E., Donley, E. L., & Kirchner, F. R. (2013). Establishment and assessment of a new human embryonic
stem cell-based biomarker assay for developmental toxicity screening. Birth defects research. Part B,
Developmental and reproductive toxicology, 98(4), 343-363. https://doi.org/10.1002/bdrb.21078, Palmer, J. A.,
Smith, A. M., Egnash, L A., Colwell, M. R., Donley, E. L R., Kirchner, F. R., & Burrier, R. E. (2017). A human induced
pluripotent stem cell-based in vitro assay predicts developmental toxicity through a retinoic acid receptor-
mediated pathway for a series of related retinoid analogues. Reproductive toxicology (Elmsford, N.Y.), 73, 350-
361. https://doi.Org/10.1016/j.reprotox.2017.07.011

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1372

Tanguay_ZF_120hpf_MORT_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for
Mortality

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_MORT_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of embryonic mortality as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_MORT_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_MORT_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of viability reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the embryonic
mortality intended target family, where the subfamily is embryonic mortality.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Mortality is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 10.544
Response cutoff threshold used to determine hit calls: 52.721
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
109

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	12.847

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 106.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1373

Tanguay_ZF_120hpf_YSE_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Yolk
Sac Edema

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_YSE_legacy is one of one
assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_YSE_legacy was analyzed into 1 assay endpoints.
This assay endpoint, Tanguay_ZF_120hpf_YSE_legacy, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-
signal activity can be used to understand changes in developmental as they relate to the whole embryo. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is extra-embryonic membrane development.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 3.193
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
119

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.96

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 82.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1374

Tanguay_ZF_120hpf_AXIS_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Body
Axis Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_AXIS_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_AXIS_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_AXIS_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is body axis morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 2.635
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
109

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.616

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 74.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1375

Ta nguay_ZF_120h pf_EYE_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Eye
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_EYE_legacy is one of one
assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_EYE_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_EYE_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic eye development.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 2.302
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
74

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.541

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 51.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1376

Tanguay_ZF_120hpf_SNOU_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Snout
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_SNOU_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_SNOU_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_SNOU_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic snout morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 2.705
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
106

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.737

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 87.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1377

Ta nguay_ZF_120h pf_J AWJegacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Jaw
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_JAW_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_JAW_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_JAW_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic jaw development.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 2.629
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
99

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.607

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 75.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1378

Tanguay_ZF_120hpf_OTIC_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Otic
Vesicle Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_OTIC_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_OTIC_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_OTIC_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is otic vesicle morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 1.978
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
42

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.357

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 40.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1379

Tanguay_ZF_120hpf_PE_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for
Pericardial Edema

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_PE_legacy is one of one
assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_PE_legacy was analyzed into 1 assay endpoints.
This assay endpoint, Tanguay_ZF_120hpf_PE_legacy, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-
signal activity can be used to understand changes in developmental as they relate to the whole embryo. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 2.885
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
113

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.883

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 93.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1380

Ta nguay_ZF_120h pf_B RAI Jegacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Brain
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_BRAI_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_BRAI_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_BRAI_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is brain morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 2.1
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
63

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.346

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 51.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1381

Tanguay_ZF_120hpf_SOMI_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Somite
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_SOMI_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_SOMI_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_SOMI_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is somitogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 1.976
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
39

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.295

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 30.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1382

Tanguay_ZF_120hpf_PFI N Jegacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for
Pectoral Fin Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_PFIN_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_PFIN_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_PFIN_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is pectoral fin morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 2.183
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
58

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.437

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 52.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1383

Tanguay_ZF_120hpf_CFI N Jegacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Caudal
Fin Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_CFIN_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_CFIN_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_CFIN_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is caudal fin morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 2.135
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
44

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.462

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 40.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1384

Tanguay_ZF_120hpf_PIG_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for
Pigmentation Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_PIG_legacy is one of one
assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_PIG_legacy was analyzed into 1 assay endpoints.
This assay endpoint, Tanguay_ZF_120hpf_PIG_legacy, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-
signal activity can be used to understand changes in developmental as they relate to the whole embryo. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic pigmentation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 1.933
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
46

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.231

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 34.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1385

Ta nguay_ZF_120h pf_CI RCJegacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for
Circulatory Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_CIRC_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_CIRC_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_CIRC_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is angiogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 1.826
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
18

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.063

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 32.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1386

Tanguay_ZF_120hpf_TRU NJegacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for
Truncated Body

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_TRUN_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_TRUN_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_TRUN_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 2.215
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
53

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.489

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 46.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1387

Tanguay_ZF_120hpf_SWI M Jegacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Swim
Bladder Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_SWIM_legacy is one of
one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to
make measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_SWIM_legacy was analyzed into 1 assay
endpoints. This assay endpoint, Tanguay_ZF_120hpf_SWIM_legacy, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain-of-signal activity can be used to understand changes in developmental as they relate to the whole embryo.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is swim bladder morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 1.876
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
36

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.273

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 37.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1388

Tanguay_ZF_120hpf_NC_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for
Notochord and Tail Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_NC_legacy is one of one
assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_NC_legacy was analyzed into 1 assay endpoints.
This assay endpoint, Tanguay_ZF_120hpf_NC_legacy, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-
signal activity can be used to understand changes in developmental as they relate to the whole embryo. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic notochord morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 1.769
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
14

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.076

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 18.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1389

Ta nguay_ZF_120h pf_TR_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Touch
Response

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_TR_legacy is one of one
assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_TR_legacy was analyzed into 1 assay endpoints.
This assay endpoint, Tanguay_ZF_120hpf_TR_legacy, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-
signal activity can be used to understand changes in developmental as they relate to the whole embryo. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic functional response.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2

Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Standard minimum concentration tested:
0.0095 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
95.12 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 2.251
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit

-------
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
78

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.338

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	0%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 52.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1797

Tanguay_ZF_120hpf_ActivityScore_legacy

1.	General Information

1.1	Assay Title: (Legacy) Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay Score for
Activity

1.2	Assay Summary: Tanguay_ZF_120hpf_legacy is a whole embryo, multiplexed endpoint assay using zebrafish
larvae exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_ActivityScore_legacy is
one of one assay component(s) measured or calculated from the Tanguay_ZF_120hpf_legacy assay. It is
designed to make measurements of developmental malformations and mortality as detected with brightfield
microscopy and combines scoring from 18 other assay components measuring more specific morphologies. Data
from the assay component Tanguay_ZF_120hpf_ActivityScore_legacy was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_ActivityScore_legacy, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-
signal activity can be used to understand changes in developmental as they relate to the whole embryo. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish
development intended target family, where the subfamily is embryonic morphogenesis and mortality.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: An algorithm was used to combine results from 18 Tanquay_ZF_120hpf_legacy assay components.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

-------
2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 18 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
5

Target (nominal) number of replicates:
32

-------
Standard minimum concentration tested:

0.0095 nM
Key positive control:

NA

Standard maximum concentration tested:

95.12 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 2.239
Response cutoff threshold used to determine hit calls: 20
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the eighteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: All statistical analysis was performed using code developed in R. The data used were binary
incidences recorded for each endpoint from Zebrafish acquisition and analysis program (ZAAP), plus associated
plate and well-location information. This information was used to test for confounding plate, well, and chemical
effects across all controls and to identify outliers. Outliers were defined as chemicals having an incidence rate
greater than 3 SDs from the mean rate in controls across multiple endpoints. The lowest effect level (LEL) in
micromolar is computed as the concentration at which the incidence exceeded a significance threshold over the
background (control) incidence rate. Because the endpoints are binary and replicates are measured in separate
wells, the 0/1 responses for each chemical-endpoint-concentration-replicate combination translate to a series
(n = 32) of Bernoulli trials, or "coin-flips." Therefore, the LEL significance threshold was estimated using a
binomial test, which provided a straightforward method to adjust for plate and/or chemical effects and the
pooling/separation of controls. Given the experimental design, the binomial maximized power versus a typical
logistic/curve-fit approach by accounting for the falsely "nonmonotonic" responses occurring when the MORT
endpoint led to missing specific endpoint measurements at higher concentrations. Because background
incidence rate varied slightly across chemicals and endpoints, the significance threshold (x) was determined
independently from the binomial distribution function for each chemical-endpoint pair.

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.), 6: no.unbounded.models (Exclude unbounded models and only fit data to bounded models
(hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 5: bmad5 (Add a cutoff value of
5 multiplied the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical

-------
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 1078

Number of chemicals tested: 1060

Active hit count: hitc>0.9
203

ACTIVITY HIT CALLS

Inactive hit count: Oihitc 0.9
865

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	222

gain-loss (gnls) model:	9

power(pow) model:	0

linear-polynomial (polyl) model:	0

quadratic-polynomial(poly2) model:	0

exponential-2 (exp2) model:	0

exponential-3 (exp3) model:	0

exponential-4 (exp4) model:	740

exponential-5 (exp5) model:	107

NA hit count: hitc^O
10

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.

-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

0.478
2.105
254.25%

NA
NA

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 107.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong L, Reif DM, St Mary L, Geier MC, Truong HD, Tanguay RL. Multidimensional in vivo hazard
assessment using zebrafish. Toxicol Sci. 2014 Jan;137(l):212-33. doi: 10.1093/toxsci/kft235. Epub 2013 Oct 17.
PubMed PMID: 24136191; PubMed Central PMCID: PMC3871932.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3194

Tanguay_ZF_120hpf_MO24

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 24 Hour Post-fertilization Zebrafish Assay for Mortality

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_MO24 is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_MO24 was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_MO24, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain-of-signal activity
can be used to understand changes in developmental as they relate to the whole embryo. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is embryonic mortality.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Mortality is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.015 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 139

Number of chemicals tested: 139

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
7

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 42.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L. (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3195

Tanguay_ZF_120hpf_DP24

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 24 Hour Post-fertilization Zebrafish Assay for Delayed
Development

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_DP24 is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_DP24 was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_DP24, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal
activity can be used to understand changes in developmental as they relate to the whole embryo. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is embryonic morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2 Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
0.015 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
100 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 139

Number of chemicals tested: 139

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
9

Inactive hit count: Oihitc 0.9
130

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

5

20

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

60

54

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 54.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3196

Tanguay_ZF_120hpf_SM24

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 24 Hour Post-fertilization Zebrafish Assay for No Spontaneous
Movement

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_SM24 is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_SM24 was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_SM24, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal
activity can be used to understand changes in developmental as they relate to the whole embryo. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is embryonic mobility.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2 Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
0.015 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
100 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 139

Number of chemicals tested: 139

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: Oihitc 0.9
139

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

0
0

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

0

139

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 139.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3197

Tanguay_ZF_120hpf_MORT

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Mortality

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_MORT is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_MORT was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_MORT, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of viability reporter, gain-of-signal activity
can be used to understand changes in developmental as they relate to the whole embryo. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is embryonic mortality.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Mortality is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in

-------
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.015 nM
Key positive control:

Target (nominal) number of replicates:

2

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

-------
NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 139

Number of chemicals tested: 139

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
18

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 33.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L. (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3198

Ta nguay_ZF_120h pf_CRAN

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Craniofacial
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_CRAN is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_CRAN was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_CRAN, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal
activity can be used to understand changes in developmental as they relate to the whole embryo. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is craniofacial morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2 Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
0.015 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
100 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 139

Number of chemicals tested: 139

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
15

Inactive hit count: Oihitc 0.9
124

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

10
22

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

42

65

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 42.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3199

Tanguay_ZF_120hpf_AXIS

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Axial
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_AXIS is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_AXIS was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_AXIS, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal
activity can be used to understand changes in developmental as they relate to the whole embryo. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is body axis morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2 Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
0.015 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
100 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 139

Number of chemicals tested: 139

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
9

Inactive hit count: Oihitc 0.9
130

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

5

11

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

65

58

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 58.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3200

Tanguay_ZF_120hpf_EDEM

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Edema
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_EDEM is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_EDEM was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_EDEM, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal
activity can be used to understand changes in developmental as they relate to the whole embryo. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is embryonic edema.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2 Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
0.015 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
100 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 139

Number of chemicals tested: 139

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
14

Inactive hit count: Oihitc 0.9
125

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

7

23

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

41

68

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 41.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3201

Tanguay_ZF_120hpf_MUSC

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Muscle
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_MUSC is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_MUSC was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_MUSC, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal
activity can be used to understand changes in developmental as they relate to the whole embryo. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is myogensis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2 Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
0.015 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
100 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 139

Number of chemicals tested: 139

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
4

Inactive hit count: Oihitc 0.9
135

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

5
5

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

45

84

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 84.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3202

Tanguay_ZF_120hpf_LTRK

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Lower Trunk
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_LTRK is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_LTRK was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_LTRK, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal
activity can be used to understand changes in developmental as they relate to the whole embryo. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is lower trunk morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2 Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
0.015 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
100 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 139

Number of chemicals tested: 139

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: Oihitc 0.9
139

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

2
1

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

49

87

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 87.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3203

Tanguay_ZF_120hpf_BRN

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Brain Region
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_BRN is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_BRN was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_BRN, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal
activity can be used to understand changes in developmental as they relate to the whole embryo. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is brain morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2 Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
0.015 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
100 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 139

Number of chemicals tested: 139

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
5

Inactive hit count: Oihitc 0.9
134

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

4
1

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

25

109

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 109.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3204

Ta nguay_ZF_120h pf_SKI N

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Skin
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_SKIN is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_SKIN was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_SKIN, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal
activity can be used to understand changes in developmental as they relate to the whole embryo. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is embryonic pigmentation.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2 Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
0.015 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
100 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 139

Number of chemicals tested: 139

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: Oihitc 0.9
139

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

0
0

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

5

134

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 134.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3205

Tanguay_ZF_120hpf_NC

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Notochord
Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_NC is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_NC was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_NC, was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal activity can
be used to understand changes in developmental as they relate to the whole embryo. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is embryonic notochord morphogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2 Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
0.015 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
100 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 139

Number of chemicals tested: 139

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: Oihitc 0.9
139

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

0
0

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

4

135

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 135.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3206

Tanguay_ZF_120h pfTCH R

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Touch
Responsive

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_TCHR is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of zebrafish development as detected with brightfield microscopy of developing zebrafish
embryos. Data from the assay component Tanguay_ZF_120hpf_TCHR was analyzed into 1 assay endpoints. This
assay endpoint, Tanguay_ZF_120hpf_TCHR, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal
activity can be used to understand changes in developmental as they relate to the whole embryo. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the zebrafish development
intended target family, where the subfamily is embryonic functional response.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Morphology is measured by light microscopic examination of developing zebrafish embryos.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

2.2 Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of

-------
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:
0.015 nM

Target (nominal) number of replicates:

2

Standard maximum concentration tested:
100 nM

-------
Key positive control:	Neutral vehicle control:

NA	DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 139

Number of chemicals tested: 139

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
5

Inactive hit count: Oihitc 0.9
134

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

5
5

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

0

quadratic-polynomialfpoly2) model: 0

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

52

77

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 77.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3207

Ta nguay_ZF_120h pf_AN Y

1.	General Information

1.1	Assay Title: Oregon State University Tanguay Lab 120 Hour Post-fertilization Zebrafish Assay for Any Mortality
and Malformation

1.2	Assay Summary: Tanguay_ZF_120hpf is a whole embryo, multiplexed endpoint assay using zebrafish larvae
exposed for 120 hours post fertilization on a 96-well plate. Tanguay_ZF_120hpf_ANY is one of one assay
component(s) measured or calculated from the Tanguay_ZF_120hpf assay. It is designed to make
measurements of developmental malformations and mortality as detected with brightfield microscopy and
combines scoring from 13 other assay components measuring more specific morphologies. Data from the assay
component Tanguay_ZF_120hpf_ANY was analyzed into 1 assay endpoints. This assay endpoint,
Tanguay_ZF_120hpf_ANY, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal activity can be used
to understand changes in developmental as they relate to the whole embryo. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the zebrafish development intended target family,
where the subfamily is embryonic morphogenesis and mortality.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Tanguay Lab, based at the Oregon State University Sinnhuber Aquatic Research Laboratory,
uses zebrafish as a systems toxicology model.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Assay is non-proprietary; observations were made using a custom photomotor response
analysis tool (PRAT), Viewpoint Zebralab, and Zebrafish acquisition and analysis program (ZAAP).

1.9	Assay Throughput: 96-well plate. The assay is conducted on 96-well plates with each plate containing 1, six hour
post-fertilization dechorionated embryo using an automated embryo placement system.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: An algorithm was used to combine results from 13 Tanquay_ZF_120hpf assay components.

Zebrafish (Danio rerio) is popular species in embryology, pharmacology and biomedical research and is
particularly amenable to large-scale screening of chemical libraries. These animals easy to rear and maintain
and they mature rapidly (6 days). Zebrafish are also are small enough for sustaining in 96-well microtiter plates.
These assays screened embryonic responses to chemical exposures by visually assessing multiple phenotypic
indicators of developmental interference, including malformations, failure to hatch, and mortality. There are
scientific advantages to assessing zebrafish as a prototype for delineating the functional activity of specific
biological pathways and their regulatory controls. Many key developmental signaling pathways and their
regulatory mechanisms are conserved between fish and mammals, making zebrafish toxicity assays a unique
integrative model of embryogenesis and highly adaptable to a medium throughput toxicity screening platform.

-------
2.2	Scientific Principles: The utilization of simultaneously measured endpoints means that the entire system
serves as a robust biological sensor for chemical hazard. The experimental design enables the description of
global patterns of variation across tested compounds, evaluation the concordance of the available in vitro and in
vivo data, can highlight specific mechanisms and novel biology related development, and demonstrate that the
developmental zebrafish detects adverse responses that would be missed by less comprehensive testing
strategies.

2.3	Experimental System: suspension NA whole embryo used. Dechorionated tropical 5D wild-type zebrafish (Danio
rerio) embryos placed 1 embryo per well in a 96-well plate. The parental fish were tropical 5D wild-type zebrafish
were housed at Oregon State University's Sinnhuber Aquatic Research Laboratory (SARL, Corvallis, OR) in
densities of 1000 fish per 100-gallon tank according to the Institutional Animal Care and Use Committee
protocols (Barton et al., 2016). Fish were maintained at 28C on a 14:10 h light/dark cycle in recirculating filtered
water, supplemented with Instant Ocean salts. Adult, larval and juvenile fish were fed with size-appropriate
GEMMA Micro food 2-3 times a day (Skretting). Spawning funnels were placed in the tanks the night prior, and
the following morning, embryos were collected and staged (Kimmel et al., 1995, Westerfield, 2007). Embryos
were maintained in embryo medium (EM) in an incubator at 28C until further processing. EM consisted of 15
mM NaCI, 0.5 mM KCI, 1 mM MgS04,0.15 mM KH2P04,0.05 mM Na2HP04, and 0.7 mM NaHC03 (Westerfield,
2000).

2.4	Metabolic Competence: Zebrafish provide a rapidly developing and easily maintained test organism which is
visually transparent through much of its embryonic development and has an elevated xenobiotic
biotransformation potential when compared to other commonly used models of developmental toxicity.
Zebrafish embryos (Danio rerio) were obtained from tropical 5D wild-type adult zebrafish were housed in at an
approximate density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State
University, Corvallis, OR. Zebrafish are a good model in which to study metabolism because they possess all the
key organs required for metabolic control in humans, from the appetite circuits that are present in the
hypothalamus, through to the pancreas and insulin-sensitive tissues [liver, muscle and white adipose tissue
(WAT)].

2.5	Exposure Regime: Zebrafish husbandry: Tropical 5D wild-type adult zebrafish were housed in at an approximate
density of 1000 per 100 gallon tank at the Sinnhuber Aquatic Research Laboratory, Oregon State University,
Corvallis, OR. Each tank was kept at standard laboratory conditions of 28C on a 14-h light/10-h dark photoperiod
in fish water consisting of reverse osmosis water supplemented with a commercially available salt (Instant
Ocean). Spawning funnels were placed into the tanks the night prior, and embryos were collected and staged.
To increase bioavailability, the chorion was enzymatically removed using pronase (63.6mg/ml, > 3.5U/mg,
Sigma-Aldrich: P5147) at 4 hours post fertilization (hpf) using a custom automated dechorionator. Chemical
exposures: Six hpf dechorionated embryos were placed 1 embryo per well in a 96-well plate prefilled with 90 ul
of EM using automated embryo placement systems (AEPS). Ten microliters of each row of dilution plate 2 was
added to 2 exposure plates. The final DMSO concentration used was 0.64% (vol/vol). Thirty-two embryos were
also exposed to 5uM trimethyltin chloride (positive control). Plates were sealed to prevent evaporation and foil
covered to reduce light exposure and kept in a 28C incubator. Embryos were statically exposed until 120 hpf. At
24 hpf, embryos were assessed for photomotor response using a custom photomotor response analysis tool
(PRAT) and for developmental toxicity endpoints (M024: mortality at 24 hpf, DP: developmental progression,
SM: spontaneous movement, and NC: notochord distortion). At 120 hpf, locomotor activity was measured using
Viewpoint Zebralab and assessed for 13 endpoints. Zebrafish acquisition and analysis program (ZAAP), a custom
program designed to inventory, acquire, and manage zebrafish data, was used to collect developmental
endpoints as either present or absent (i.e., binary responses were recorded). If mortality occurred for an embryo
(at either 24 or 120 hpf), the non-mortality endpoints were not measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Target (nominal) number of replicates:
2

-------
Standard minimum concentration tested:

0.015 nM
Key positive control:

NA

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 5
Response cutoff threshold used to determine hit calls: 25
Detection technology used: light microscopy (Microscopy)

2.6	Response: The raw data from the larval assessments consisted of an assigned binary (0 or 1) response to every
larvae observed for each of the thirteen endpoints. Responses across all these endpoints were collapsed into a
singular binary (0 or 1) morphology endpoint named 'ANY1.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Developmental Toxicity: Assays associated with developmental toxicity, Non-mammalian Vertebrate:
Assays associated with non-mammalian vertebrate species

Additionally, this assay was annotated to the intended target family of zebrafish development.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: A benchmark dose (BMD) approach was applied to morphologically bioactive samples on the
summary statistic of 'ANY' endpoint using a parametric curve fitting process. Briefly, this data was modeled
using the unrestricted 3-parameter log-logistic model for dichotomous data with "extra risk" and was
implemented following guidelines from the EPA BMDS v3.2 manual. A benchmark response (BMR) was defined
as a 10% change relative to the background response and BMD10 values were calculated.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

23: agg.percent.rep.spid.mini (Use for binary data. Aggregate technical replicates as percentage by
taking the sum of hits relative to total replicates per sample id and concentration index, where at least
one hit is required.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

5: bmad5.onesdl6.static (Replace baseline median absolute deviation (bmad) with 5 and one standard
deviation (osd) of the normalized response for test compound wells (wilt = t) with a concentration index
(cndx) of 1 or 2 with 16. Typically used for binary data where values would otherwise be 0; non-zero
values are required for tcplfit2 processing.), 6: no.unbounded.models (Exclude unbounded models and
only fit data to bounded models (hill, gnls, exp4 and exp5).)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.), 28: ow_bidirectional_gain
(Multiply winning model hitcall (hitc) by -1 for models fit in the negative analysis direction. Typically
used for endpoints where only positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 139

Number of chemicals tested: 139

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
33

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	NA%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 40.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Truong, L., Rericha, Y., Thunga, P., Marvel, S., Wallis, D., Simonich, M. T., Field, J. A., Cao, D., Reif,
D. M., & Tanguay, R. L. (2022). Systematic developmental toxicity assessment of a structurally diverse library of
PFAS in zebrafish. Journal of hazardous materials, 431,128615. https://doi.Org/10.1016/j.jhazmat.2022.128615

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 759

TOX21_AR_BLA_Agonist_chl

1.	General Information

1.1	Assay Title: Tox21 HEK293T Androgen Receptor (AR) Agonism Beta-lactamase Assay, Channel 1 Readout of
Uncleaved Substrate

1.2	Assay Summary: TOX21_AR_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-ar-
bla-agonist-pl. TOX21_AR_BLA_Agonist_chl is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the uncleaved reporter gene substrate and is used to calculate the ratio of
cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_AR_BLA_Agonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_AR_BLA_Agonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene AR. Furthermore, this assay endpoint can
be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic AR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_AR_BLA_Agonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to AR gene(s) using a positive
control of R1881

The Tox21 androgen receptor agonism beta-lactamase assay screened a library of diverse environmental
chemicals to probe for xenobiotic androgen receptor ligand-binding and potential to induce androgen-
dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid

-------
GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the
assay and following 37 hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run
at interfering wavelengths to allow for background artifact detection. Concentration-response relationships
were determined by monitoring luminescent signals relative to DMSO-only exposures which provided a signal
baseline, and to a known androgen receptor agonist (Methyltrienolone) as a positive control, which provided
an indication of 100 percent androgen receptor activation.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer AR-UAS-bla GripTite 293 cells contain the
ligand-binding domain (LBD) of the rat androgen receptor (AR) fused to the DNA-binding domain of GAL4 stably
integrated in the GeneBLAzer UAS-bla GripTite 293 cell line. This portion of the rat AR is identical to the human
AR in the conserved LBD and differs from the human sequence at five residues in the hinge region. These cells
stably express a -beta-lactamase reporter gene under the transcriptional control of an upstream activator
sequence (UAS). When an agonist binds to the LBD of the GAL4 (DBD)-AR (LBD) fusion protein, the protein binds
to the UAS, resulting in expression of-beta-lactamase. The HEK-293 cell line is a human embryonic kidney cell
line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973. The
transformation incorporated approximately 4.5 kilobases from the viral genome into human chromosome 19 of
the HEK cells, and subsequent cytogenetic characterization established that the 293 line is pseudotriploid.
HEK293 cells are popular for their ease of growth and transfection cells and are frequently used to produce
exogenous proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cell culture is maintained by passaging twice a week and should
not reach more than 90% confluence. The assay should be performed in black-clear bottom 1536 well plates, so
the bottom of the plates should not be touched. Cell Media Required: Growth (90% DMEM -- high glucose, 10%
Dialyzed FBS, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin, 80ug/ml Hygromycin, 80ug/ml Zeocin), Assay (90% Opti-MEM, 10% Dialyzed FB, O.lmN NEAA, 1
mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and Streptomycin), Thaw (90% DMEM - high glucose,
10% Dialyzed FB, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin), Freezing (100% Recovery Cell Culture Freezing Medium) Thawing method: 1ml frozen cells of
AR-bla were taken in pre-warmed 10ml of thaw medium for centrifuging. Thaw medium is used to re-suspend
the pellet. Seed the cells at 2 million per T-75 flask with thaw medium. Cell Proliferation Method: Detach the
cells from the flask using 0.05% Trypsin. The cells are re-seeded in T-225 flask at 3-4 million Assay Protocol: Spin
down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate
the cells in black-clear bottom 1536 well plate at 2000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate at 37C for 5hrs. Transfer 23nL of compounds from the library collection and positive control to the
assay plates through Pintool. Incubate at 37C for 16hrs. Add luL of CCF4 (FRET Substrate) dye using a single tip
plate dispenser (BioRAPTR). Incubate at room temperature for 2hrs. Read the fluorescence intensity through
Envision plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:
0.00979371647509579 nM

Target (nominal) number of replicates:

3

Standard maximum concentration tested:
765.134099616858 nM

-------
Key positive control:	Neutral vehicle control:

R1881	DMSO

Baseline median absolute deviation for the assay (bmad): 3.807
Response cutoff threshold used to determine hit calls: 22.84
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Androgen receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
R1881 was used as a positive AR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series

-------
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.184

Neutral control median absolute deviation, by plate: nmad	&.21

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3441.58%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-8.862

Positive control well median absolute deviation, by plate: pmad	5.811

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.973

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 638.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Kleinstreuer NC, Ceger P, Watt ED, Martin
M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ, Allen D, Sakamuru S, Xia M, Huang R, Judson R.
Development and Validation of a Computational Model for Androgen Receptor Activity. Chem Res Toxicol. 2017
Apr 17;30(4):946-964. doi: 10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID:
PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
-------
Assay Endpoint ID: 760

TOX21_AR_BLA_Agonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293T Androgen Receptor (AR) Agonism Beta-lactamase Assay, Channel 2 Readout of
Cleaved Substrate

1.2	Assay Summary: TOX21_AR_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-ar-
bla-agonist-pl. TOX21_AR_BLA_Agonist_ch2 is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio of cleaved
(ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_AR_BLA_Agonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_AR_BLA_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, increased activity can be used to
understand changes in the reporter gene as they relate to the gene AR. Furthermore, this assay endpoint can
be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic AR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_AR_BLA_Agonist_ch2 was designed to measure cleaved reporter gene substrate to target
nuclear receptor activity at the protein (receptor) level, specifically mapping to AR gene(s) using a positive
control of R1881

The Tox21 androgen receptor agonism beta-lactamase assay screened a library of diverse environmental
chemicals to probe for xenobiotic androgen receptor ligand-binding and potential to induce androgen-
dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid

-------
GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the
assay and following 37 hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run
at interfering wavelengths to allow for background artifact detection. Concentration-response relationships
were determined by monitoring luminescent signals relative to DMSO-only exposures which provided a signal
baseline, and to a known androgen receptor agonist (Methyltrienolone) as a positive control, which provided
an indication of 100 percent androgen receptor activation.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer AR-UAS-bla GripTite 293 cells contain the
ligand-binding domain (LBD) of the rat androgen receptor (AR) fused to the DNA-binding domain of GAL4 stably
integrated in the GeneBLAzer UAS-bla GripTite 293 cell line. This portion of the rat AR is identical to the human
AR in the conserved LBD and differs from the human sequence at five residues in the hinge region. These cells
stably express a -beta-lactamase reporter gene under the transcriptional control of an upstream activator
sequence (UAS). When an agonist binds to the LBD of the GAL4 (DBD)-AR (LBD) fusion protein, the protein binds
to the UAS, resulting in expression of-beta-lactamase. The HEK-293 cell line is a human embryonic kidney cell
line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973. The
transformation incorporated approximately 4.5 kilobases from the viral genome into human chromosome 19 of
the HEK cells, and subsequent cytogenetic characterization established that the 293 line is pseudotriploid.
HEK293 cells are popular for their ease of growth and transfection cells and are frequently used to produce
exogenous proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cell culture is maintained by passaging twice a week and should
not reach more than 90% confluence. The assay should be performed in black-clear bottom 1536 well plates, so
the bottom of the plates should not be touched. Cell Media Required: Growth (90% DMEM -- high glucose, 10%
Dialyzed FBS, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin, 80ug/ml Hygromycin, 80ug/ml Zeocin), Assay (90% Opti-MEM, 10% Dialyzed FB, O.lmN NEAA, 1
mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and Streptomycin), Thaw (90% DMEM - high glucose,
10% Dialyzed FB, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin), Freezing (100% Recovery Cell Culture Freezing Medium) Thawing method: 1ml frozen cells of
AR-bla were taken in pre-warmed 10ml of thaw medium for centrifuging. Thaw medium is used to re-suspend
the pellet. Seed the cells at 2 million per T-75 flask with thaw medium. Cell Proliferation Method: Detach the
cells from the flask using 0.05% Trypsin. The cells are re-seeded in T-225 flask at 3-4 million Assay Protocol: Spin
down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate
the cells in black-clear bottom 1536 well plate at 2000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate at 37C for 5hrs. Transfer 23nL of compounds from the library collection and positive control to the
assay plates through Pintool. Incubate at 37C for 16hrs. Add luL of CCF4 (FRET Substrate) dye using a single tip
plate dispenser (BioRAPTR). Incubate at room temperature for 2hrs. Read the fluorescence intensity through
Envision plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:
0.00979371647509579 nM

Target (nominal) number of replicates:

3

Standard maximum concentration tested:
765.134099616858 nM

-------
Key positive control:	Neutral vehicle control:

R1881	DMSO

Baseline median absolute deviation for the assay (bmad): 2.128
Response cutoff threshold used to determine hit calls: 20
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Androgen receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
R1881 was used as a positive AR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series

-------
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.132

Neutral control median absolute deviation, by plate: nmad	4.498

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3377.39%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.5

Positive control well median absolute deviation, by plate: pmad	21.714

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	4.476

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 558.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Kleinstreuer NC, Ceger P, Watt ED, Martin
M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ, Allen D, Sakamuru S, Xia M, Huang R, Judson R.
Development and Validation of a Computational Model for Androgen Receptor Activity. Chem Res Toxicol. 2017
Apr 17;30(4):946-964. doi: 10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID:
PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
-------
Assay Endpoint ID: 761

TOX21_AR_BLA_Agonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293T Androgen Receptor (AR) Agonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21_AR_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-ar-
bla-agonist-pl. TOX21_AR_BLA_Agonist_ratio is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of
target activity. Data from the assay component TOX21_AR_BLA_Agonist_ratio was analyzed into 1 assay
endpoint. This assay endpoint, TOX21_AR_BLA_Agonist_ratio, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter,
gain-of-signal activity can be used to understand changes in the reporter gene as they relate to the gene AR.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints and this ratio serves a reporter gene function to understand target activity. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic AR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_AR_BLA_Agonist_ratio was designed to target nuclear receptor activity at the protein
(receptor) level, specifically mapping to AR gene(s) using a positive control of R1881

The Tox21 androgen receptor agonism beta-lactamase assay screened a library of diverse environmental
chemicals to probe for xenobiotic androgen receptor ligand-binding and potential to induce androgen-
dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid
GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the

-------
assay and following 37 hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run
at interfering wavelengths to allow for background artifact detection. Concentration-response relationships
were determined by monitoring luminescent signals relative to DMSO-only exposures which provided a signal
baseline, and to a known androgen receptor agonist (Methyltrienolone) as a positive control, which provided
an indication of 100 percent androgen receptor activation.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Androgen receptors have pleiotropic regulatory roles in a diverse
range of tissues; particularly in mediating the activity of endogenous androgens in the hypothalamus, pituitary,
liver, prostate, and testicular tissues. Endogenous androgens are important for target gene expression in
physiological processes like developmental differentiation of the male embryo, initiation and maintenance of
spermatogenesis and in neuroendocrine system functioning. Endogenous androgens also influence male
pubertal maturation and maintenance of secondary sexual characteristics in adults. Disruption of androgen
signaling following exposure to endocrine-mimicking environmental chemicals can result in hormonal cancers,
impaired reproductive development and infertility. This assay is designed to screen a large, structurally diverse
chemical library to identify compounds capable of interference with endogenous androgenic signaling
pathways. The Tox21 AR bla assays are qHTS format assays which measured the ability of a chemical to interact
with AR by monitoring modulation of fluorescence reporter gene signals. This assay utilized a human embryonic
kidney cell line (HEK293T) which expresses rat AR in a one-hybrid GAL4 system to quantify xenobiotic androgen
receptor agonism. This assay is intended for use as a part of an integrated testing strategy, to screen a large
structurally diverse chemical library for compounds with the potential to interact with androgen receptor
mediated pathways and potentially affect endocrine systems in exposed populations. There is strong evidence
that androgen receptor agonism is a molecular initiating event (MIE) in an Adverse Outcome Pathway (AOP)
leading to reproductive dysfunction in fish populations (EAGMST Approved AOP), and there is some evidence
that androgen receptor activation is the MIE for a putative pathway leading to hepatocellular adenomas and
carcinomas (in mouse and rat models) (AOP currently under development). Chemical-activity profiles derived
from this assay can inform prioritization decisions for compound selection in more resource intensive in vivo
studies to further investigate the involvement of AR agonism in pathways leading to hazardous outcomes in
biological systems.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer AR-UAS-bla GripTite 293 cells contain the
ligand-binding domain (LBD) of the rat androgen receptor (AR) fused to the DNA-binding domain of GAL4 stably
integrated in the GeneBLAzer UAS-bla GripTite 293 cell line. This portion of the rat AR is identical to the human
AR in the conserved LBD and differs from the human sequence at five residues in the hinge region. These cells
stably express a -beta-lactamase reporter gene under the transcriptional control of an upstream activator
sequence (UAS). When an agonist binds to the LBD of the GAL4 (DBD)-AR (LBD) fusion protein, the protein binds
to the UAS, resulting in expression of-beta-lactamase. The HEK-293 cell line is a human embryonic kidney cell
line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973. The
transformation incorporated approximately 4.5 kilobases from the viral genome into human chromosome 19 of
the HEK cells, and subsequent cytogenetic characterization established that the 293 line is pseudotriploid.

-------
HEK293 cells are popular for their ease of growth and transfection cells and are frequently used to produce
exogenous proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cell culture is maintained by passaging twice a week and should
not reach more than 90% confluence. The assay should be performed in black-clear bottom 1536 well plates, so
the bottom of the plates should not be touched. Cell Media Required: Growth (90% DMEM -- high glucose, 10%
Dialyzed FBS, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin, 80ug/ml Hygromycin, 80ug/ml Zeocin), Assay (90% Opti-MEM, 10% Dialyzed FB, O.lmN NEAA, 1
mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and Streptomycin), Thaw (90% DMEM - high glucose,
10% Dialyzed FB, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin), Freezing (100% Recovery Cell Culture Freezing Medium) Thawing method: 1ml frozen cells of
AR-bla were taken in pre-warmed 10ml of thaw medium for centrifuging. Thaw medium is used to re-suspend
the pellet. Seed the cells at 2 million per T-75 flask with thaw medium. Cell Proliferation Method: Detach the
cells from the flask using 0.05% Trypsin. The cells are re-seeded in T-225 flask at 3-4 million Assay Protocol: Spin
down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate
the cells in black-clear bottom 1536 well plate at 2000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate at 37C for 5hrs. Transfer 23nL of compounds from the library collection and positive control to the
assay plates through Pintool. Incubate at 37C for 16hrs. Add luL of CCF4 (FRET Substrate) dye using a single tip
plate dispenser (BioRAPTR). Incubate at room temperature for 2hrs. Read the fluorescence intensity through
Envision plate reader.

Baseline median absolute deviation for the assay (bmad): 3.303

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Androgen receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast AR Pathway Model: Androgen receptor assays used in ToxCast AR Pathway model. See
10.1016/j.yrtph.2020.104764 and 10.1021/acs.chemrestox.6b00347

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

R1881

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
R1881 was used as a positive AR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test

-------
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
625

Inactive hit count: 0
-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

32

2420

554

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.198

Neutral control median absolute deviation, by plate: nmad

6.374

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-2854.01%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

99.355
21.535

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	4.376

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 554.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Kleinstreuer NC, Ceger P, Watt ED, Martin
M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ, Allen D, Sakamuru S, Xia M, Huang R, Judson R.
Development and Validation of a Computational Model for Androgen Receptor Activity. Chem Res Toxicol. 2017
Apr 17;30(4):946-964. doi: 10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID:
PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 762

T0X2 l_AR_BLA_Antagon ist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293T Androgen Receptor (AR) Antagonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21 AR BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-ar-
bla-antagonist-pl. TOX21_AR_BLA_Antagonist_ratio is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate
used as the measure of target activity. Data from the assay component TOX21_AR_BLA_Antagonist_ratio was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_AR_BLA_Antagonist_ratio, was analyzed in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of inducible reporter, loss-of-signal activity can be used to understand changes in the reporter gene as they
relate to the gene AR. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints and this ratio serves a reporter gene function to understand target
activity. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic AR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_AR_BLA_Antagonist_ratio was designed to target nuclear receptor activity at the protein
(receptor) level, specifically mapping to AR gene(s) using a positive control of Cyproterone acetate

The Tox21 androgen receptor antagonism beta-lactamase assay screened a library of diverse environmental
compounds to probe for xenobiotic androgen receptor ligand-binding and potential to suppress androgen-
dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid
GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the

-------
assay and following 16 hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Each compound was tested in a concentration-response format, using 15 concentrations
ranging from 1.1 nM to 92 uM. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection. Concentration-response
relationships were determined by monitoring FRET signals relative to DMSO-only exposures which provided a
signal baseline, and to a known androgen receptor antagonist (Cyproterone acetate) as a positive control which
provided a reference for 100 percent androgen receptor inhibition, as assessed in the presence of 0.01 uM
R1881, a known AR agonist.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Androgen receptors have pleiotropic regulatory roles in a diverse
range of tissues; particularly in mediating the activity of endogenous androgens in the hypothalamus, pituitary,
liver, prostate, and testicular tissues. Endogenous androgens are important for target gene expression in
physiological processes like developmental differentiation of the male embryo, initiation and maintenance of
spermatogenesis and in neuroendocrine system functioning. Endogenous androgens also influence male
pubertal maturation and maintenance of secondary sexual characteristics in adults. Disruption of androgen
signaling following exposure to endocrine-mimicking environmental chemicals can result in hormonal cancers,
impaired reproductive development and infertility. This assay is designed to screen a large, structurally diverse
chemical library to identify compounds capable of interference with endogenous androgenic signaling
pathways. The Tox21 AR bla assays are qHTS format assays which measured the ability of a chemical to interact
with AR by monitoring modulation of fluorescence reporter gene signals. This assay utilized a human embryonic
kidney cell line (HEK293T) which expresses rat AR in a one-hybrid GAL4 system to quantify xenobiotic androgen
receptor agonism. This assay is intended for use as a part of an integrated testing strategy, to screen a large
structurally diverse chemical library for compounds with the potential to interact with androgen receptor
mediated pathways and potentially affect endocrine systems in exposed populations. Chemical-activity profiles
derived from this assay can inform prioritization decisions for compound selection in more resource intensive
in vivo studies to further investigate the involvement of AR antagonism in pathways leading to hazardous
outcomes in biological systems. This experimental system expresses a fusion protein of a rodent androgen
receptor ligand-binding domain coupled to GAL4 DNA-binding domain which when activated by xenobiotic
compounds stimulates -beta-lactamase reporter gene expression, and AR antagonism by test compounds
results in decreased signal production relative to cyproterone acetate in the presence of a known AR agonist
(R1881). To detect loss of signal due to compound cytotoxicity, a CellTiter-Glo fluorescence assay to measure
ATP production was run concurrently in all wells using tetraoctylammonium bromide as a positive control for
cell death.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer AR-UAS-bla GripTite 293 cells contain the
ligand-binding domain (LBD) of the rat androgen receptor (AR) fused to the DNA-binding domain of GAL4 stably
integrated in the GeneBLAzer UAS-bla GripTite 293 cell line. This portion of the rat AR is identical to the human
AR in the conserved LBD and differs from the human sequence at five residues in the hinge region. These cells
stably express a -beta-lactamase reporter gene under the transcriptional control of an upstream activator
sequence (UAS). When an agonist binds to the LBD of the GAL4 (DBD)-AR (LBD) fusion protein, the protein binds
to the UAS, resulting in expression of-beta-lactamase. The HEK-293 cell line is a human embryonic kidney cell

-------
line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973. The
transformation incorporated approximately 4.5 kilobases from the viral genome into human chromosome 19 of
the HEK cells, and subsequent cytogenetic characterization established that the 293 line is pseudotriploid.
HEK293 cells are popular for their ease of growth and transfection cells and are frequently used to produce
exogenous proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cell culture is maintained by passaging twice a week and should
not reach more than 90% confluence. The assay should be performed in black-clear bottom 1536 well plates, so
the bottom of the plates should not be touched. Cell Media Required: Growth (90% DMEM -- high glucose, 10%
Dialyzed FBS, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin, 80ug/ml Hygromycin, 80ug/ml Zeocin), Assay (90% Opti-MEM, 10% Dialyzed FB, O.lmN NEAA, 1
mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and Streptomycin), Thaw (90% DMEM - high glucose,
10% Dialyzed FB, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin), Freezing (100% Recovery Cell Culture Freezing Medium) Thawing method: 1ml frozen cells of
AR-bla were taken in pre-warmed 10ml of thaw medium for centrifuging. Thaw medium is used to re-suspend
the pellet. Seed the cells at 2 million per T-75 flask with thaw medium. Cell Proliferation Method: Detach the
cells from the flask using 0.05% Trypsin. The cells are re-seeded in T-225 flask at 3-4 million Assay Protocol: Spin
down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate
the cells in black-clear bottom 1536 well plate at 2000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate at 37C for 5hrs. Transfer 23nL of compounds from the library collection and positive control to the
assay plates through Pintool. Incubate at 37C for 16hrs. Add luL of CCF4 (FRET Substrate) dye using a single tip
plate dispenser (BioRAPTR). Incubate at room temperature for 2hrs. Read the fluorescence intensity through
Envision plate reader.

Baseline median absolute deviation for the assay (bmad): 4.349
Response cutoff threshold used to determine hit calls: 26.092
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Androgen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Cyproterone acetate

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast AR Pathway Model: Androgen receptor assays used in ToxCast AR Pathway model. See
10.1016/j.yrtph.2020.104764 and 10.1021/acs.chemrestox.6b00347

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Cyproterone acetate was used as a positive AR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -

-------
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1915

Inactive hit count: 0
-------
power(pow) model:
linear-polynomial (polyl) model:

712

3381

quadratic-polynomialfpoly2) model:	1270

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

371

81

2519

860

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-------
-0.188

Neutral control median absolute deviation, by plate: nmad	6.931

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3698.26%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100.309

Positive control well median absolute deviation, by plate: pmad	3.043

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-13.308

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 860.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Kleinstreuer NC, Ceger P, Watt ED, Martin
M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ, Allen D, Sakamuru S, Xia M, Huang R, Judson R.
Development and Validation of a Computational Model for Androgen Receptor Activity. Chem Res Toxicol. 2017
Apr 17;30(4):946-964. doi: 10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID:
PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 763

T0X2 l_AR_BLA_Antagon ist_via bi I ity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293T Androgen Receptor (AR) Antagonism Beta-lactamase
Assay

1.2	Assay Summary: TOX21 AR BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-ar-
bla-antagonist-pl. TOX21_AR_BLA_Antagonist_viability is an assay readout measuring cellular ATP content and
detected with CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_AR_BLA_Antagonist_viability used a
type of viability reporter where loss-of-signal activity can be used to understand changes in the cell viability.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a viability function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic AR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 androgen receptor antagonism beta-lactamase assay screened a library of diverse environmental
compounds to probe for xenobiotic androgen receptor ligand-binding and potential to suppress androgen-
dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid
GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the
assay and following 16 hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap the

-------
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Each compound was tested in a concentration-response format, using 15 concentrations
ranging from 1.1 nM to 92 uM. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection. Concentration-response
relationships were determined by monitoring FRET signals relative to DMSO-only exposures which provided a
signal baseline, and to a known androgen receptor antagonist (Cyproterone acetate) as a positive control which
provided a reference for 100 percent androgen receptor inhibition, as assessed in the presence of 0.01 uM
R1881, a known AR agonist.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Androgen receptors have pleiotropic regulatory roles in a diverse
range of tissues; particularly in mediating the activity of endogenous androgens in the hypothalamus, pituitary,
liver, prostate, and testicular tissues. Endogenous androgens are important for target gene expression in
physiological processes like developmental differentiation of the male embryo, initiation and maintenance of
spermatogenesis and in neuroendocrine system functioning. Endogenous androgens also influence male
pubertal maturation and maintenance of secondary sexual characteristics in adults. Disruption of androgen
signaling following exposure to endocrine-mimicking environmental chemicals can result in hormonal cancers,
impaired reproductive development and infertility. This assay is designed to screen a large, structurally diverse
chemical library to identify compounds capable of interference with endogenous androgenic signaling
pathways. The Tox21 AR bla assays are qHTS format assays which measured the ability of a chemical to interact
with AR by monitoring modulation of fluorescence reporter gene signals. This assay utilized a human embryonic
kidney cell line (HEK293T) which expresses rat AR in a one-hybrid GAL4 system to quantify xenobiotic androgen
receptor agonism. This assay is intended for use as a part of an integrated testing strategy, to screen a large
structurally diverse chemical library for compounds with the potential to interact with androgen receptor
mediated pathways and potentially affect endocrine systems in exposed populations. Chemical-activity profiles
derived from this assay can inform prioritization decisions for compound selection in more resource intensive
in vivo studies to further investigate the involvement of AR antagonism in pathways leading to hazardous
outcomes in biological systems. This experimental system expresses a fusion protein of a rodent androgen
receptor ligand-binding domain coupled to GAL4 DNA-binding domain which when activated by xenobiotic
compounds stimulates -beta-lactamase reporter gene expression, and AR antagonism by test compounds
results in decreased signal production relative to cyproterone acetate in the presence of a known AR agonist
(R1881). To detect loss of signal due to compound cytotoxicity, a CellTiter-Glo fluorescence assay to measure
ATP production was run concurrently in all wells using tetraoctylammonium bromide as a positive control for
cell death.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer AR-UAS-bla GripTite 293 cells contain the
ligand-binding domain (LBD) of the rat androgen receptor (AR) fused to the DNA-binding domain of GAL4 stably
integrated in the GeneBLAzer UAS-bla GripTite 293 cell line. This portion of the rat AR is identical to the human
AR in the conserved LBD and differs from the human sequence at five residues in the hinge region. These cells
stably express a -beta-lactamase reporter gene under the transcriptional control of an upstream activator
sequence (UAS). When an agonist binds to the LBD of the GAL4 (DBD)-AR (LBD) fusion protein, the protein binds
to the UAS, resulting in expression of-beta-lactamase. The HEK-293 cell line is a human embryonic kidney cell
line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973. The
transformation incorporated approximately 4.5 kilobases from the viral genome into human chromosome 19 of

-------
the HEK cells, and subsequent cytogenetic characterization established that the 293 line is pseudotriploid.
HEK293 cells are popular for their ease of growth and transfection cells and are frequently used to produce
exogenous proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cell culture is maintained by passaging twice a week and should
not reach more than 90% confluence. The assay should be performed in black-clear bottom 1536 well plates, so
the bottom of the plates should not be touched. Cell Media Required: Growth (90% DMEM -- high glucose, 10%
Dialyzed FBS, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin, 80ug/ml Hygromycin, 80ug/ml Zeocin), Assay (90% Opti-MEM, 10% Dialyzed FB, O.lmN NEAA, 1
mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and Streptomycin), Thaw (90% DMEM - high glucose,
10% Dialyzed FB, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin), Freezing (100% Recovery Cell Culture Freezing Medium) Thawing method: 1ml frozen cells of
AR-bla were taken in pre-warmed 10ml of thaw medium for centrifuging. Thaw medium is used to re-suspend
the pellet. Seed the cells at 2 million per T-75 flask with thaw medium. Cell Proliferation Method: Detach the
cells from the flask using 0.05% Trypsin. The cells are re-seeded in T-225 flask at 3-4 million Assay Protocol: Spin
down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate
the cells in black-clear bottom 1536 well plate at 2000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate at 37C for 5hrs. Transfer 23nL of compounds from the library collection and positive control to the
assay plates through Pintool. Incubate at 37C for 16hrs. Add luL of CCF4 (FRET Substrate) dye using a single tip
plate dispenser (BioRAPTR). Incubate at room temperature for 2hrs. Read the fluorescence intensity through
Envision plate reader.

Baseline median absolute deviation for the assay (bmad): 3.843
Response cutoff threshold used to determine hit calls: 23.059

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Androgen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region, ToxCast AR Pathway Model: Androgen
receptor assays used in ToxCast AR Pathway model. See 10.1016/j.yrtph.2020.104764 and
10.1021/acs.chemrestox.6b00347

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning

-------
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1108

Inaclive hit count: 0
-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.186

Neutral control median absolute deviation, by plate: nmad

7.506

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-4055.92%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

NA

NA

Z Prime Factor for median positive and neutral control across all plates:

NA

-------
(1 - ((3 * (pmad + nmad)) / abs(pmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 666.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Kleinstreuer NC, Ceger P, Watt ED, Martin
M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ, Allen D, Sakamuru S, Xia M, Huang R, Judson R.
Development and Validation of a Computational Model for Androgen Receptor Activity. Chem Res Toxicol. 2017
Apr 17;30(4):946-964. doi: 10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID:
PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 764

T0X21_AR_LU C_M DAKB2_Agon ist

1.	General Information

1.1	Assay Title: Tox21 MDa-kb2 Androgen Receptor (AR) Agonism Luciferase Assay

1.2	Assay Summary: TOX21 AR LUC MDAKB2 Agonist is a cell-based, single-readout assay that uses MDA-kb2, a
human breast cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See
tox21-ar-mda-kb2-luc-agonist-pl. TOX21_AR_LUC_MDAKB2_Agonist is one of one assay component(s)
measured or calculated from the TOX21_AR_LUC_MDAkb2_Agonist assay. It is designed to make measurements
of luciferase induction, a form of inducible reporter, as detected with bioluminescence signals by CellTiter-Glo
Luciferase-coupled ATP quantitation technology. Data from the assay component
TOX21_AR_LUC_MDAKB2_Agonist was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_AR_LUC_MDAKB2_Agonist, was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be
used to understand changes in the reportergene as they relate to the gene AR. Furthermore, this assay endpoint
can be referred to as a primary readout, because the performed assay has only produced 1 assay endpoint. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected MDA-kb2 cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring luminescence resulting from AR
gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction involving the key
substrate [One-Glo] are indicative of changes in transcriptional gene expression due to agonist activity regulated
by the human androgen receptor [GeneSymbohAR | GenelD:367 | Uniprot_SwissProt_Accession:P10275],

The Tox21 androgen receptor agonism luciferase assay screened a large library of diverse environmental
compounds to probe for xenobiotic androgenic activity and potential to induce androgen-dependent
transcription, monitored through luciferase reporter gene signal activation using an AR-luciferase reporter gene

-------
construct. The assay is run in triplicate on a 1536-well microplate and bioluminescence was measured following
16 hour incubation of cells with test compounds and 30 min incubation of test system with ONE-GloTM
luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux plate reader.
Following the incubation period, the cell culture was screened for bioluminescent signals in agonist mode using
luciferase ATP detection technology. Each compound was tested in a concentration-response format, using 15
concentrations ranging from 1.1 nM to 92 uM. Compound auto-fluorescence was monitored in various
"TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Androgen receptors have pleiotropic regulatory roles in a
diverse range of tissues; particularly in mediating the activity of endogenous androgens in the hypothalamus,
pituitary, liver, prostate, and testicular tissues. Endogenous androgens are important for target gene expression
in physiological processes like developmental differentiation of the male embryo, initiation and maintenance of
spermatogenesis and in neuroendocrine system functioning. Endogenous androgens also influence male
pubertal maturation and maintenance of secondary sexual characteristics in adults. Disruption of androgen
signaling following exposure to endocrine-mimicking environmental chemicals can result in hormonal cancers,
impaired reproductive development and infertility. This assay is designed to screen a large, structurally diverse
chemical library to identify compounds capable of interference with endogenous androgenic signaling by
monitoring the increase in luminescent signals relative to a known androgen receptor agonist
(Methyltrienolone) as a positive control, and indicator of androgenic activity. The Tox21 MDA-kb2 AR luciferase
assays are qHTS format assays which measured the ability of a chemical to interact with AR by monitoring
modulation of luminescent reporter gene signals. This assay utilized an epithelial breast cancer cell line which
expresses firefly luciferase under control of a MMTV promoter that contains androgen response elements to
quantify xenobiotic androgen receptor agonism. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
androgen receptor mediated pathways and potentially affect endocrine systems in exposed populations. There
is strong evidence that androgen receptor agonism is a molecular initiating event (MIE) in an Adverse Outcome
Pathway (AOP) leading to reproductive dysfunction in fish populations (EAGMST Approved AOP), and there is
some evidence that androgen receptor activation is the MIE for a putative pathway leading to hepatocellular
adenomas and carcinomas (in mouse and rat models) (AOP currently under development). Chemical-activity
profiles derived from this assay can inform prioritization decisions for compound selection in more resource
intensive in vivo studies to further investigate the involvement of AR agonism in pathways leading to hazardous
outcomes in biological systems.

2.3	Experimental System: adherent MDA-kb2 cell line used. The MDA-kb2 AR-luc cell line was derived from
epithelial breast cancer cell line, MDA-MB-453 (originally obtained in 1976 from pleural effusion of metastatic
carcinoma from 48-yo Caucasian female) by stable transfection with a mouse mammary tumor virus (MMTV)
neomycin-resistant luciferase reporter gene construct. MDA-MB-453 cells have fibroblastic morphology and
were selected for transformation due to high levels of functional, endogenous androgen and glucocorticoid
receptors while estrogen receptor (ER) alpha and progesterone receptor (PR) mRNA are not detectable, and ER-
beta is apparently expressed at very low levels. This cell line expresses firefly luciferase under control of a MMTV
promoter that contains response elements for both GR and AR. MDA-kb2 may be used in an in vitro assay to
screen androgen agonist and antagonists and to characterize its specificity and sensitivity to endocrine
disrupting chemicals.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Metabolic activity
has been examined for the parental MDA-MB-453 cells. CYP1A1 and CYP1B1 have been shown to be inducible

-------
following TCDD exposures, with exposure to AhR agonists showing highly preferential induction of CYP1B1 as
opposed to CYP1A1.

2.5 Exposure Regime: Quality Control Precautions: Maintain cell culture below 85-90% confluence. Cell culturing
and assay culture doesn't require C02. Cell Thawing Method: Thaw a vial of cells in 9ml of pre-warmed
thaw/culture medium and then centrifuge. Resuspend the pellet with the thaw/culture medium and seed at 2
million cells per T-75 flask. Cell Proliferation Method: Trypsinize cells from the culturing flask and centrifuge
and then resuspend cells in culture medium. Passage cells at 6-7 million per T-225 flask. Assay Protocol:
Trypsinize cells from the culturing flask and centrifuge and then resuspend cells in culture/assay medium.
Dispense 3000 cells/5uL/well (for agonist mode) into 1536-well tissue treated white/solid bottom plates using
a 8 tip dispenser (Multidrop). Incubate the plates for 5hrs at 37C and 0% C02. Transfer 23nL of compounds from
the library collection (0.59nM to 92uM) and positive control. Incubate the plates for 16hrs at 37C and 0% C02.
Add 5ul of ONE-Glo(TM) Luciferase reagent using a single tip dispenser (BioRAPTR). Incubate the plates at room
temperature for 30min. Measure luminescence by ViewLux plate reader.

Baseline median absolute deviation for the assay (bmad): 1.01
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Androgen receptor agonism and enhanced gene expression is measured by monitoring luminescent
production by the luciferase reporter gene under control of androgen response element promoters. The
cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability using by
CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast AR Pathway Model: Androgen receptor assays used in ToxCast AR Pathway model. See
10.1016/j.yrtph.2020.104764 and 10.1021/acs.chemrestox.6b00347

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

R1881

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Increased luminescence (gain-of-signal) was measured relative to R1881 signal (positive control,
100 percent agonist activity), using DMSO (neutral control) as a baseline for luciferase induction. Response was
reported as a percent of positive control activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag

-------
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

501	9331	664

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	699

gain-loss (gnls) model:	743

power(pow) model:	905

linear-polynomial (polyl) model:	3274

quadratic-polynomial(poly2) model:	1081

exponential-2 (exp2) model:	377

exponential-3 (exp3) model:	137

exponential-4 (exp4) model:	2518

exponential-5 (exp5) model:	762

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In

-------
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.111

Neutral control median absolute deviation, by plate: nmad	3.344

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-2954.27%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.508

Positive control well median absolute deviation, by plate: pmad	7.912

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	11.459

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

-------
Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 762.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Kleinstreuer NC, Ceger P, Watt ED, Martin M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ,
Allen D, Sakamuru S, Xia M, Huang R, Judson R. Development and Validation of a Computational Model for
Androgen Receptor Activity. Chem Res Toxicol. 2017 Apr 17;30(4):946-964. doi:
10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID: PMC5396026.

7.

Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 765

T0X21_AR_LU C_M DAKB2_Antagon ist_10n M_R1881

1.	General Information

1.1	Assay Title: Tox21 MDa-kb2 Androgen Receptor (AR) Antagonism (lOnM R1881) Luciferase Assay

1.2	Assay Summary: TOX21 AR LUC MDAKB2 Antagonist lOnM R1881 is a cell-based, single-readout assay that
uses MDA-kb2, a human breast cell line, with measurements taken at 24 hours after chemical dosing in a 1536-
well plate. This is an secondary assay to TOX21_AR_LUC_MDAKB2_Agonist for antagonist specificity. See tox21-
ar-mda-kb2-luc-antagonist-pl. TOX21_AR_LUC_MDAKB2_Antagonist_10nM_R1881 is one of one assay
component(s) measured or calculated from theTOX21_AR_LUC_MDAKB2_Antagonist_10nM_R1881 assay. It is
designed to make measurements of luciferase induction, a form of inducible reporter, as detected with
bioluminescence signals by CellTiter-Glo Luciferase-coupled ATP quantitation technology. Data from the assay
component TOX21_AR_LUC_MDAKB2_Antagonist_10nM_R1881 was analyzed into 1 assay endpoint. This assay
endpoint, TOX21_AR_LUC_MDAKB2_Antagonist_10nM_R1881, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter,
loss-of-signal activity can be used to understand changes in the reporter gene as they relate to the gene AR.
Furthermore, this assay endpoint can be referred to as a primary readout, because the performed assay has
only produced 1 assay endpoint. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected MDA-kb2 cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring luminescence resulting from AR
gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Androgen receptor (AR) is an important member of the nuclear receptor family. Its signaling plays
a critical role in AR-dependent prostate cancer and other androgen related diseases. Considerable attention has
been given in the past decades to develop methods to study and screen for the environmental chemicals that
have the potential to interfere with metabolic homeostasis, reproduction, developmental and behavioral
functions. Therefore AR binding assay for screening androgen antagonists can be used to identify potential
endocrine disruptors. Changes to bioluminescence signals produced from an enzymatic reaction involving the

-------
key substrate [One-Glo] are indicative of changes in transcriptional gene expression due to antagonist activity
regulated by the human androgen receptor [GeneSymbokAR | GenelD:367 |
Uniprot_SwissProt_Accession:P10275].

The Tox21 androgen receptor antagonism luciferase assay screened a large library of diverse environmental
compounds to probe for xenobiotic androgenic activity and potential to inhibit androgen-dependent
transcription, monitored through luciferase reporter gene signal activity using an AR-luciferase reporter gene
construct. The assay is run in triplicate on a 1536-well microplate and bioluminescence was measured following
15.5 hour incubation of cells with test compounds and 30 min incubation of test system with ONE-GloTM
luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux plate reader.
Following the incubation period, the cell culture was screened for bioluminescent signals in antagonist mode
using luciferase detection technology. Each compound was tested in a concentration-response format, using 15
concentrations ranging from 1.1 nM to 92 uM. To help distinguish true antagonistic activity from cytotoxic
effects, this assay was multiplexed with a fluorescence-based cell viability assay which measured conserved and
constitutive protease activity within live cells (Promega). Compound auto-fluorescence was monitored in
various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.
Concentration-response relationships were determined by monitoring luminescent signals relative to DMSO-
only exposures which provided a signal baseline, and to a known androgen receptor antagonist (Nilutamide) as
a positive control which provided a reference for 100 percent androgen receptor inhibition, as assessed in the
presence of 0.5 uM R1881, a known AR agonist.

2.2 Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Androgen receptors have pleiotropic regulatory roles in a
diverse range of tissues; particularly in mediating the activity of endogenous androgens in the hypothalamus,
pituitary, liver, prostate, and testicular tissues. Endogenous androgens are important for target gene expression
in physiological processes like developmental differentiation of the male embryo, initiation and maintenance of
spermatogenesis and in neuroendocrine system functioning. Endogenous androgens also influence male
pubertal maturation and maintenance of secondary sexual characteristics in adults. Disruption of androgen
signaling following exposure to endocrine-mimicking environmental chemicals can result in hormonal cancers,
impaired reproductive development and infertility. This assay is designed to screen a large, structurally diverse
chemical library to identify compounds capable of interference with endogenous androgenic signaling by
monitoring the increase in luminescent signals relative to a known androgen receptor agonist
(Methyltrienolone) as a positive control, and indicator of androgenic activity. The Tox21 MDA-kb2 AR luciferase
assays are qHTS format assays which measured the ability of a chemical to interact with AR by monitoring
modulation of luminescent reporter gene signals. This assay utilized an epithelial breast cancer cell line which
expresses firefly luciferase under control of a MMTV promoter that contains androgen response elements to
quantify xenobiotic androgen receptor agonism. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
androgen receptor mediated pathways and potentially affect endocrine systems in exposed populations. There
is strong evidence that androgen receptor agonism is a molecular initiating event (MIE) in an Adverse Outcome
Pathway (AOP) leading to reproductive dysfunction in fish populations (EAGMST Approved AOP), and there is
some evidence that androgen receptor activation is the MIE for a putative pathway leading to hepatocellular
adenomas and carcinomas (in mouse and rat models) (AOP currently under development). Chemical-activity
profiles derived from this assay can inform prioritization decisions for compound selection in more resource
intensive in vivo studies to further investigate the involvement of AR agonism in pathways leading to hazardous
outcomes in biological systems.

-------
2.3	Experimental System: adherent MDA-kb2 cell line used. The MDA-kb2 AR-luc cell line was derived from
epithelial breast cancer cell line, MDA-MB-453 (originally obtained in 1976 from pleural effusion of metastatic
carcinoma from 48-yo Caucasian female) by stable transfection with a mouse mammary tumor virus (MMTV)
neomycin-resistant luciferase reporter gene construct. MDA-MB-453 cells have fibroblastic morphology and
were selected for transformation due to high levels of functional, endogenous androgen and glucocorticoid
receptors (while estrogen receptor (ER) alpha and progesterone receptor (PR) mRNA are not detectable, and
ER-beta is apparently expressed at very low levels). This cell line expresses firefly luciferase under control of a
MMTV promoter that contains response elements for both GR and AR. MDA-kb2 may be used in an in vitro
assay to screen androgen agonist and antagonists and to characterize its specificity and sensitivity to endocrine
disrupting chemicals.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Metabolic activity
has been examined for the parental MDA-MB-453 cells. CYP1A1 and CYP1B1 have been shown to be inducible
following TCDD exposures, with exposure to AhR agonists showing highly preferential induction of CYP1B1 as
opposed to CYP1A1.

2.5	Exposure Regime: Quality Control Precautions: Maintain cell culture below 85-90% confluence. Cell culturing
and assay culture doesn't require C02. Cell Thawing Method: Thaw a vial of cells in 9ml of pre-warmed
thaw/culture medium and then centrifuge. Resuspend the pellet with the thaw/culture medium and seed at 2
million cells per T-75 flask. Cell Proliferation Method: Trypsinize cells from the culturing flask and centrifuge
and then resuspend cells in culture medium. Passage cells at 6-7 million per T-225 flask. Assay Protocol:
Trypsinize cells from the culturing flask and centrifuge and then resuspend cells in culture/assay medium.
Dispense 3000 cells/5uL/well (for agonist mode) into 1536-well tissue treated white/solid bottom plates using
a 8 tip dispenser (Multidrop). Incubate the plates for 5hrs at 37C and 0% C02. Transfer 23nL of compounds from
the library collection (0.59nM to 92uM) and positive control. Incubate the plates for 15.30hrs at 37C and 0%
C02. Add lul of CellTiter-Fluor (TM) Cell Viability Assay reagent using a single tip dispenser (BioRAPTR). Incubate
the plates at room temperature or 37C for 30min. Measure fluorescence by ViewLux plate reader. Then add 4ul
of ONE-Glo(TM) Luciferase reagent using a single tip dispenser (BioRAPTR). Incubate the plates at room
temperature for 30min. Measure luminescence by ViewLux plate reader.

Baseline median absolute deviation for the assay (bmad): 4.981
Response cutoff threshold used to determine hit calls: 29.884

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Androgen receptor antagonism and inhibited gene expression is measured by monitoring
luminescent production by the luciferase reporter gene under control of androgen response element
promoters. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

Nilutamide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast AR Pathway Model: Androgen receptor assays used in ToxCast AR Pathway model. See
10.1016/j.yrtph.2020.104764 and 10.1021/acs.chemrestox.6b00347

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: In cultures stimulated with a known agonist (lOnM R1881), decreased luminescence (loss-of-
signal) relative to nilutamide signal (positive control, 100 percent antagonist activity), using DMSO (neutral
control) as a signal baseline as a baseline for luciferase induction. Response was reported as a percent of positive
control activity. This is an secondary assay to TOX21_AR_LUC_MDAKB2_Agonist to confirm antagonist
specificity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall

-------
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1196

Inactive hit count: Oihitc 0.9
5884

WINING MODEL SELECTION

NA hit count: hitc^O
3416

Number of sample-assay endpoints with winning hill model:

641
531

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

828

3222

quadratic-polynomialfpoly2) model:	1155

exponential-2 (exp2) model:

504

-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

73

2719

823

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.247

Neutral control median absolute deviation, by plate: nmad

8.706

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-3835.22%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

-100
2.403

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-10.927

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 823.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Kleinstreuer NC, Ceger P, Watt ED, Martin M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ,
Allen D, Sakamuru S, Xia M, Huang R, Judson R. Development and Validation of a Computational Model for
Androgen Receptor Activity. Chem Res Toxicol. 2017 Apr 17;30(4):946-964. doi:
10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID: PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 766

TOX21_AR_LUC_MDAKB2_Antagonist_10nM_R1881_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 MDa-kb2 Androgen Receptor (AR) Antagonism (lOnM R1881)
Luciferase Assay

1.2	Assay Summary: TOX21 AR LUC MDAKB2 Antagonist lOnM R1881 is a cell-based, single-readout assay that
uses MDA-kb2, a human breast cell line, with measurements taken at 24 hours after chemical dosing in a 1536-
well plate. This is an secondary assay to TOX21_AR_LUC_MDAKB2_Agonist for antagonist specificity. See tox21-
ar-mda-kb2-luc-antagonist-pl. TOX21_AR_LUC_MDAKB2_Antagonist_10nM_R1881_viability is an assay
readout measuring cellular ATP content and detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
TOX21_AR_LUC_MDAKB2_Antagonist_10nM_R1881_viability used a type of viability reporter where loss-of-
signal activity can be used to understand changes in the cell viability. Furthermore, this assay endpoint can be
referred to as a secondary readout, because this assay has produced multiple assay endpoints where this one
serves a viability function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected MDA-kb2 cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring luminescence resulting from AR
gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 androgen receptor antagonism luciferase assay screened a large library of diverse environmental
compounds to probe for xenobiotic androgenic activity and potential to inhibit androgen-dependent
transcription, monitored through luciferase reporter gene signal activity using an AR-luciferase reporter gene
construct. The assay is run in triplicate on a 1536-well microplate and bioluminescence was measured following
15.5 hour incubation of cells with test compounds and 30 min incubation of test system with ONE-GloTM

-------
luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux plate reader.
Following the incubation period, the cell culture was screened for bioluminescent signals in antagonist mode
using luciferase detection technology. Each compound was tested in a concentration-response format, using 15
concentrations ranging from 1.1 nM to 92 uM. To help distinguish true antagonistic activity from cytotoxic
effects, this assay was multiplexed with a fluorescence-based cell viability assay which measured conserved and
constitutive protease activity within live cells (Promega). Compound auto-fluorescence was monitored in
various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.
Concentration-response relationships were determined by monitoring luminescent signals relative to DMSO-
only exposures which provided a signal baseline, and to a known androgen receptor antagonist (Nilutamide) as
a positive control which provided a reference for 100 percent androgen receptor inhibition, as assessed in the
presence of 0.5 uM R1881, a known AR agonist.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Androgen receptors have pleiotropic regulatory roles in a
diverse range of tissues; particularly in mediating the activity of endogenous androgens in the hypothalamus,
pituitary, liver, prostate, and testicular tissues. Endogenous androgens are important for target gene expression
in physiological processes like developmental differentiation of the male embryo, initiation and maintenance of
spermatogenesis and in neuroendocrine system functioning. Endogenous androgens also influence male
pubertal maturation and maintenance of secondary sexual characteristics in adults. Disruption of androgen
signaling following exposure to endocrine-mimicking environmental chemicals can result in hormonal cancers,
impaired reproductive development and infertility. This assay is designed to screen a large, structurally diverse
chemical library to identify compounds capable of interference with endogenous androgenic signaling by
monitoring the increase in luminescent signals relative to a known androgen receptor agonist
(Methyltrienolone) as a positive control, and indicator of androgenic activity. The Tox21 MDA-kb2 AR luciferase
assays are qHTS format assays which measured the ability of a chemical to interact with AR by monitoring
modulation of luminescent reporter gene signals. This assay utilized an epithelial breast cancer cell line which
expresses firefly luciferase under control of a MMTV promoter that contains androgen response elements to
quantify xenobiotic androgen receptor agonism. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
androgen receptor mediated pathways and potentially affect endocrine systems in exposed populations. There
is strong evidence that androgen receptor agonism is a molecular initiating event (MIE) in an Adverse Outcome
Pathway (AOP) leading to reproductive dysfunction in fish populations (EAGMST Approved AOP), and there is
some evidence that androgen receptor activation is the MIE for a putative pathway leading to hepatocellular
adenomas and carcinomas (in mouse and rat models) (AOP currently under development). Chemical-activity
profiles derived from this assay can inform prioritization decisions for compound selection in more resource
intensive in vivo studies to further investigate the involvement of AR agonism in pathways leading to hazardous
outcomes in biological systems.

2.3	Experimental System: adherent MDA-kb2 cell line used. The MDA-kb2 AR-luc cell line was derived from
epithelial breast cancer cell line, MDA-MB-453 (originally obtained in 1976 from pleural effusion of metastatic
carcinoma from 48-yo Caucasian female) by stable transfection with a mouse mammary tumor virus (MMTV)
neomycin-resistant luciferase reporter gene construct. MDA-MB-453 cells have fibroblastic morphology and
were selected for transformation due to high levels of functional, endogenous androgen and glucocorticoid
receptors (while estrogen receptor (ER) alpha and progesterone receptor (PR) mRNA are not detectable, and
ER-beta is apparently expressed at very low levels). This cell line expresses firefly luciferase under control of a
MMTV promoter that contains response elements for both GR and AR. MDA-kb2 may be used in an in vitro

-------
assay to screen androgen agonist and antagonists and to characterize its specificity and sensitivity to endocrine
disrupting chemicals.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Metabolic activity
has been examined for the parental MDA-MB-453 cells. CYP1A1 and CYP1B1 have been shown to be inducible
following TCDD exposures, with exposure to AhR agonists showing highly preferential induction of CYP1B1 as
opposed to CYP1A1.

2.5	Exposure Regime: Quality Control Precautions: Maintain cell culture below 85-90% confluence. Cell culturing
and assay culture doesn't require C02. Cell Thawing Method: Thaw a vial of cells in 9ml of pre-warmed
thaw/culture medium and then centrifuge. Resuspend the pellet with the thaw/culture medium and seed at 2
million cells per T-75 flask. Cell Proliferation Method: Trypsinize cells from the culturing flask and centrifuge
and then resuspend cells in culture medium. Passage cells at 6-7 million per T-225 flask. Assay Protocol:
Trypsinize cells from the culturing flask and centrifuge and then resuspend cells in culture/assay medium.
Dispense 3000 cells/5uL/well (for agonist mode) into 1536-well tissue treated white/solid bottom plates using
a 8 tip dispenser (Multidrop). Incubate the plates for 5hrs at 37C and 0% C02. Transfer 23nL of compounds from
the library collection (0.59nM to 92uM) and positive control. Incubate the plates for 15.30hrs at 37C and 0%
C02. Add lul of CellTiter-Fluor (TM) Cell Viability Assay reagent using a single tip dispenser (BioRAPTR). Incubate
the plates at room temperature or 37C for 30min. Measure fluorescence by ViewLux plate reader. Then add 4ul
of ONE-Glo(TM) Luciferase reagent using a single tip dispenser (BioRAPTR). Incubate the plates at room
temperature for 30min. Measure luminescence by ViewLux plate reader.

Baseline median absolute deviation for the assay (bmad): 3
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Androgen receptor antagonism and inhibited gene expression is measured by monitoring
luminescent production by the luciferase reporter gene under control of androgen response element
promoters. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

I arget (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region, ToxCast AR Pathway Model: Androgen
receptor assays used in ToxCast AR Pathway model. See 10.1016/j.yrtph.2020.104764 and
10.1021/acs.chemrestox.6b00347

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning

-------
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
978

Inaclive hit count: 0
-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.178

Neutral control median absolute deviation, by plate: nmad

7.012

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-3933.72%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

NA

NA

Z Prime Factor for median positive and neutral control across all plates:

NA

-------
(1 - ((3 * (pmad + nmad)) / abs(pmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 682.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Kleinstreuer NC, Ceger P, Watt ED, Martin M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ,
Allen D, Sakamuru S, Xia M, Huang R, Judson R. Development and Validation of a Computational Model for
Androgen Receptor Activity. Chem Res Toxicol. 2017 Apr 17;30(4):946-964. doi:
10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID: PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 781

T0X21_ELG 1_LU C_Agon ist

1.	General Information

1.1	Assay Title: Tox21 HEK293T Enhanced Level of Genome Instability Gene 1 (ELG1/ATAD5) Agonism Luciferase
Assay

1.2	Assay Summary: TOX21 ELG1 LUC Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
elgl-luc-agonist-pl. TOX21_ELGl_LUC_Agonist is one of one assay component(s) measured or calculated from
the TOX21_ELGl_LUC_Agonist assay. It is designed to make measurements of luciferase induction, a form of
inducible reporter, as detected with bioluminescence signals by CellTiter-Glo Luciferase-coupled ATP
quantitation technology. Data from the assay component TOX21_ELGl_LUC_Agonist was analyzed into 1 assay
endpoint. This assay endpoint, TOX21_ELGl_LUC_Agonist, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene ATAD5.
Furthermore, this assay endpoint can be referred to as a primary readout, because the performed assay has
only produced 1 assay endpoint. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the hydrolase intended target family, where the subfamily is atpase.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ATAD5 gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion
AOP: Adverse Outcome Pathway
CV: Coefficient of Variation
DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction involving the key
substrate [One-Glo] are indicative of changes in transcriptional gene expression due to agonist activity regulated
by the human ATPase family, AAA domain containing 5 [GeneSymbol:ATAD5 | GenelD:79915 |
Uniprot_SwissProt_Accession:Q96QE3],

ToxCast: US EPA's Toxicity Forecaster Program
tcpl: ToxCast Data Analysis Pipeline R Package
SSMD: Strictly Standardized Mean Difference

The Tox21 HEK293T enhanced level of genome instability gene 1 (ELG1/ATAD5) agonism luciferase assay
screened a library of diverse environmental compounds to find the compounds that effectively block DNA

-------
replication either by directly damaging DNA or by inhibiting other cellular mechanisms. Using a luciferase
reporter-gene tagged with ATAD5, a cell-based assay (developed by Kyungjae Myung, NHGRI, NIH) was used to
measure the induction of ATAD5 in human embryonic kidney cells (HEK293T). The assay is run in triplicate on
1536-well microplate and bioluminescence was measured following 16 hour incubation of cells with test
compounds and 30 min incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent
signal was monitored using a Promega ViewLux plate reader to measure agonistic activity, this assay is evaluated
against a known ATAD5 agonist (5-Fluorouridine) as a positive control (100 percent inhibition). Test compounds
were assayed for cytotoxicity by measuring protease activity with Promega CellTiter-Fluor with
tetraoctylammonium bromide as a positive control for cell death. Compound auto-fluorescence was monitored
in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency As cancer cells divide rapidly and during every cell division
they need to duplicate their genome by DNA replication. The failure to do so results in the cancer cell death.
Based on this concept, many chemotherapeutic agents were developed but have limitations such as low efficacy
and severe side effects etc. A new cell based assay was developed to find the compounds that effectively block
DNA replication either by directly damaging DNA or by inhibiting other cellular mechanisms. Enhanced Level of
Genome Instability Gene 1 (ELG1; human ATAD5) protein levels increase in response to various types of DNA
damage. Using a luciferase reporter-gene tagged with ATAD5, a cell-based assay [developed by Kyungjae Myung,
NHGRI, NIH] was used to measure the induction of ATAD5 in human embryonic kidney cells (HEK293T). So,
increase in luciferase activity can be used to identify the compounds that cause genetic stress.

2.3	Experimental System: adherent HEK293T cell line used. The HEK293T ATAD5-luc cell line was derived from was
derived from the human embryonic kidney cell line, HEK-293, by stable transfection with a luciferase reporter
gene construct. The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed
with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated
approximately 4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent
cytogenetic characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells
are popular for their ease of growth and transfection cells and are frequently used to produce exogenous
proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence. Feed or passage cells
twice a week. Thawing method: Thaw a vial of cells in 9ml of pre-warmed medium and seed them in T75 flask
at 2 million cells. Cell Proliferation Method: Trypsinize cells from the flask and centrifuge. Add culture medium
to the pellet and passage at 3-4 million per T-225 flask. Assay Protocol: Harvest and resuspend cells in
culture/assay medium. Dispense 2000 cells/5uL/well (for agonist mode) into 1536-well tissue treated
white/solid bottom plates. Incubate the plates for 5hrs at 37C and 5% C02. Transfer 23nL of compounds from
the library collection and positive control to the assay plates through Pintool. Incubate the plates for 15.5hrs at
37C and 5% C02. Add lul of CellTiter-Fluor (TM) Cell Viability reagent (5uL of AFC substrate added in 5ml of
Buffer). Incubate the plates at room temperature for 30min. Measure fluorescence by ViewLux plate reader.
Add 4ul of Amplite Luciferase (Bright Glow) reagent. Incubate the plates at room temperature for 30min.
Measure luminescence by ViewLux plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:

Target (nominal) number of replicates:

-------
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

5-Fluorouridine

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.813
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Agonism of the enhanced level of genome instability gene 1 (ELG1/ATAD5) signaling pathway is
measured by bioluminescence activity via an estrogen-related-receptor firefly luciferase reporter gene. Increase
luciferase activity can be used to identify the compounds that may result in induction of ATAD5. The cytotoxicity
of the compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in
the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of hydrolase.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Increased luminescence (gain-of-signal) was measured relative to 5-Fluorouridine (positive
control) signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a
percent of positive control activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

NA

-------
Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496

Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
154

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.086

Neutral control median absolute deviation, by plate: nmad	3.322

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3648.14%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.693

Positive control well median absolute deviation, by plate: pmad	7.171

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	12.513

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 665.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 782

TOX21_ELGl_LUC_Agonist_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293T Enhanced Level of Genome Instability Gene 1
(ELG1/ATAD5) Agonism Luciferase Assay

1.2	Assay Summary: TOX21 ELG1 LUC Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
elgl-luc-agonist-pl. TOX21_ELGl_LUC_Agonist_viability is an assay readout measuring cellular ATP content and
detected with CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_ELGl_LUC_Agonist_viability used a
type of viability reporter where loss-of-signal activity can be used to understand changes in the cell viability.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a viability function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ATAD5 gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 HEK293T enhanced level of genome instability gene 1 (ELG1/ATAD5) agonism luciferase assay
screened a library of diverse environmental compounds to find the compounds that effectively block DNA
replication either by directly damaging DNA or by inhibiting other cellular mechanisms. Using a luciferase
reporter-gene tagged with ATAD5, a cell-based assay (developed by Kyungjae Myung, NHGRI, NIH) was used to
measure the induction of ATAD5 in human embryonic kidney cells (HEK293T). The assay is run in triplicate on
1536-well microplate and bioluminescence was measured following 16 hour incubation of cells with test

-------
compounds and 30 min incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent
signal was monitored using a Promega ViewLux plate reader to measure agonistic activity, this assay is evaluated
against a known ATAD5 agonist (5-Fluorouridine) as a positive control (100 percent inhibition). Test compounds
were assayed for cytotoxicity by measuring protease activity with Promega CellTiter-Fluor with
tetraoctylammonium bromide as a positive control for cell death. Compound auto-fluorescence was monitored
in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency As cancer cells divide rapidly and during every cell division
they need to duplicate their genome by DNA replication. The failure to do so results in the cancer cell death.
Based on this concept, many chemotherapeutic agents were developed but have limitations such as low efficacy
and severe side effects etc. A new cell based assay was developed to find the compounds that effectively block
DNA replication either by directly damaging DNA or by inhibiting other cellular mechanisms. Enhanced Level of
Genome Instability Gene 1 (ELG1; human ATAD5) protein levels increase in response to various types of DNA
damage. Using a luciferase reporter-gene tagged with ATAD5, a cell-based assay [developed by Kyungjae Myung,
NHGRI, NIH] was used to measure the induction of ATAD5 in human embryonic kidney cells (HEK293T). So,
increase in luciferase activity can be used to identify the compounds that cause genetic stress.

2.3	Experimental System: adherent HEK293T cell line used. The HEK293T ATAD5-luc cell line was derived from was
derived from the human embryonic kidney cell line, HEK-293, by stable transfection with a luciferase reporter
gene construct. The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed
with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated
approximately 4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent
cytogenetic characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells
are popular for their ease of growth and transfection cells and are frequently used to produce exogenous
proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence. Feed or passage cells
twice a week. Thawing method: Thaw a vial of cells in 9ml of pre-warmed medium and seed them in T75 flask
at 2 million cells. Cell Proliferation Method: Trypsinize cells from the flask and centrifuge. Add culture medium
to the pellet and passage at 3-4 million per T-225 flask. Assay Protocol: Harvest and resuspend cells in
culture/assay medium. Dispense 2000 cells/5uL/well (for agonist mode) into 1536-well tissue treated
white/solid bottom plates. Incubate the plates for 5hrs at 37C and 5% C02. Transfer 23nL of compounds from
the library collection and positive control to the assay plates through Pintool. Incubate the plates for 15.5hrs at
37C and 5% C02. Add lul of CellTiter-Fluor (TM) Cell Viability reagent (5uL of AFC substrate added in 5ml of
Buffer). Incubate the plates at room temperature for 30min. Measure fluorescence by ViewLux plate reader.
Add 4ul of Amplite Luciferase (Bright Glow) reagent. Incubate the plates at room temperature for 30min.
Measure luminescence by ViewLux plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

-------
tetraoctylammonium bromide	DMSO

Baseline median absolute deviation for the assay (bmad): 5.902
Response cutoff threshold used to determine hit calls: 35.413

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Agonism of the enhanced level of genome instability gene 1 (ELG1/ATAD5) signaling pathway is
measured by bioluminescence activity via an estrogen-related-receptor firefly luciferase reporter gene. Increase
luciferase activity can be used to identify the compounds that may result in induction of ATAD5. The cytotoxicity
of the compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in
the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

-------
1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496

Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
509

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.222

Neutral control median absolute deviation, by plate: nmad	8.199

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3541.01%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 683.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 783

TOX2 l_ERa_BLA_Agon ist_ch 1

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-alpha (ESR1) Agonism Beta-lactamase Assay, Channel 1 Readout
of Uncleaved Substrate

1.2	Assay Summary: TOX21_ERa_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-er-
bla-agonist-p2. TOX21_ERa_BLA_Agonist_chl is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the uncleaved reporter gene substrate and is used to calculate the ratio of
cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_ERa_BLA_Agonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ERa_BLA_Agonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene ESR1. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERalpha gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERa_BLA_Agonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to ESR1 gene(s) using a
positive control of 17b-estradiol

The Tox21 estrogen receptor-alpha agonism beta-lactamase assay screened a library of diverse environmental
compounds to probe for xenoestrogenic ligand-binding and potential to induce estrogen-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4

-------
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER alpha-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor alpha (ERalpha) fused to the DNA-binding domain of
GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Cell Thawing Method: 1ml frozen cells of ERalpha-bla were taken in pre-warmed 10ml of thaw
medium for centrifuging. 3ml of the thaw medium is taken to resuspend the pellet. The cells were seeded in T-
75 flask at 2 million cells. Cell Proliferation Method: The cells are detached using 0.05 percent Trypsin. Cells are
further passaged at a density of 4-5 million cells per T-225 flask. Assay Protocol: Rinse the cells twice with DPBS
and detach them using 0.05 percent Trypsin and centrifuge. Resuspend the pellet with assay buffer. Plate the
cells in black-clear bottom 1536 well plate at 5000/well/5uL through 8 tip plate dispenser (Multi drop). Incubate
at 37Cfor 5hrs. Transfer 23nLof the compounds from the library collection and positive control through Pintool.
Incubate at 37C for 18hrsAdd 1. uL of CCF4 dye using a single tip of a plate dispenser (BioRAPTR). Incubate at
room temperature for 2hrs. Read the fluorescence intensity through Envision plate reader ER-alpha-bla cells
were cultured in assay medium containing 2 percent charcoal stripped FBS overnight in the culture flasks before
the assay was performed.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 2.925

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

-------
2.6	Response: Estrogen receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
17b-Estradiol was used as a positive ERa agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496

Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
478

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.166

Neutral control median absolute deviation, by plate: nmad	5.512

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3071.31%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-33.751

Positive control well median absolute deviation, by plate: pmad	4.546

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-4.814

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 637.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Huang R, Sakamuru S, Martin MT, Reif DM,
Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W, Rotroff DM, Zhao T,
Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox2110K compound library
for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep. 2014 Jul 11;4:5664. doi:
10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 784

TOX21_ERa_BLA_Agonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-alpha (ESR1) Agonism Beta-lactamase Assay, Channel 2 Readout
of Cleaved Substrate

1.2	Assay Summary: TOX21_ERa_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-er-
bla-agonist-p2. TOX21_ERa_BLA_Agonist_ch2 is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio of cleaved
(ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_ERa_BLA_Agonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ERa_BLA_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, increased activity can be used to
understand changes in the reporter gene as they relate to the gene ESR1. Furthermore, this assay endpoint can
be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERalpha gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERa_BLA_Agonist_ch2 was designed to measure cleaved reporter gene substrate to target
nuclear receptor (steroidal) activity at the protein (receptor) level, specifically mapping to ESR1 gene(s) using a
positive control of 17b-estradiol

The Tox21 estrogen receptor-alpha agonism beta-lactamase assay screened a library of diverse environmental
compounds to probe for xenoestrogenic ligand-binding and potential to induce estrogen-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4

-------
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER alpha-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor alpha (ERalpha) fused to the DNA-binding domain of
GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Cell Thawing Method: 1ml frozen cells of ERalpha-bla were taken in pre-warmed 10ml of thaw
medium for centrifuging. 3ml of the thaw medium is taken to resuspend the pellet. The cells were seeded in T-
75 flask at 2 million cells. Cell Proliferation Method: The cells are detached using 0.05 percent Trypsin. Cells are
further passaged at a density of 4-5 million cells per T-225 flask. Assay Protocol: Rinse the cells twice with DPBS
and detach them using 0.05 percent Trypsin and centrifuge. Resuspend the pellet with assay buffer. Plate the
cells in black-clear bottom 1536 well plate at 5000/well/5uL through 8 tip plate dispenser (Multi drop). Incubate
at 37Cfor 5hrs. Transfer 23nLof the compounds from the library collection and positive control through Pintool.
Incubate at 37C for 18hrsAdd 1. uL of CCF4 dye using a single tip of a plate dispenser (BioRAPTR). Incubate at
room temperature for 2hrs. Read the fluorescence intensity through Envision plate reader ER-alpha-bla cells
were cultured in assay medium containing 2 percent charcoal stripped FBS overnight in the culture flasks before
the assay was performed.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.816

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

-------
2.6	Response: Estrogen receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
17b-Estradiol was used as a positive ERa agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496

Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
561

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.048

Neutral control median absolute deviation, by plate: nmad	1.582

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3207.74%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.56

Positive control well median absolute deviation, by plate: pmad	14.632

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	6.747

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 590.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Huang R, Sakamuru S, Martin MT, Reif DM,
Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W, Rotroff DM, Zhao T,
Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox2110K compound library
for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep. 2014 Jul 11;4:5664. doi:
10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 785

TOX21_ERa_BLA_Agonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-alpha (ESR1) Agonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21_ERa_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-er-
bla-agonist-p2. TOX21_ERa_BLA_Agonist_ratio is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of
target activity. Data from the assay component TOX21_ERa_BLA_Agonist_ratio was analyzed into 1 assay
endpoint. This assay endpoint, TOX21_ERa_BLA_Agonist_ratio, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter,
gain-of-signal activity can be used to understand changes in the reporter gene as they relate to the gene ESR1.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints and this ratio serves a reporter gene function to understand target activity. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERalpha gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERa_BLA_Agonist_ratio was designed to target nuclear receptor activity at the protein
(receptor) level, specifically mapping to ESR1 gene(s) using a positive control of 17b-estradiol

The Tox21 estrogen receptor-alpha agonism beta-lactamase assay screened a library of diverse environmental
compounds to probe for xenoestrogenic ligand-binding and potential to induce estrogen-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test

-------
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER agonists, GeneBLAzer ERalpha-UAS-bla GripTite cell line (Invitrogen) has been used to
screen the Tox21 library of diverse environmental compounds. ERalpha-UAS-bla cell line expresses a partial
ERalpha one-hybrid GAL4 system and is stably transfected with a -beta-lactamase reporter gene. The Tox21
ERalpha bla assays are qHTS format assays which measured the ability of a chemical to interact with estrogen
receptor alpha (ERalpha) by monitoring modulation of fluorescence reporter gene signals. This assay utilized a
human embryonic kidney cell line (HEK293T) which expresses a partial ERalpha and a one-hybrid GAL4 system
to quantify xenoestrogenic agonism. This assay is intended for use as a part of an integrated testing strategy, to
screen a large structurally diverse chemical library for compounds with the potential to interact with estrogen
receptor alpha mediated pathways and potentially affect endocrine systems in exposed populations. There is
strong evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome
Pathway (AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that
estrogen receptor activation is the MIE for putative AOPs leading to reduced survival due to renal failure and
leading to skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under
development). Chemical-activity profiles derived from this assay can inform prioritization decisions for
compound selection in more resource intensive in vivo studies to further investigate the involvement of ER
agonism in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER alpha-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor alpha (ERalpha) fused to the DNA-binding domain of
GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

-------
2.5 Exposure Regime: Cell Thawing Method: 1ml frozen cells of ERalpha-bla were taken in pre-warmed 10ml of thaw
medium for centrifuging. 3ml of the thaw medium is taken to resuspend the pellet. The cells were seeded in T-
75 flask at 2 million cells. Cell Proliferation Method: The cells are detached using 0.05 percent Trypsin. Cells are
further passaged at a density of 4-5 million cells per T-225 flask. Assay Protocol: Rinse the cells twice with DPBS
and detach them using 0.05 percent Trypsin and centrifuge. Resuspend the pellet with assay buffer. Plate the
cells in black-clear bottom 1536 well plate at 5000/well/5uL through 8 tip plate dispenser (Multi drop). Incubate
at 37Cfor 5hrs. Transfer 23nLof the compounds from the library collection and positive control through Pintool.
Incubate at 37C for 18hrsAdd 1. uL of CCF4 dye using a single tip of a plate dispenser (BioRAPTR). Incubate at
room temperature for 2hrs. Read the fluorescence intensity through Envision plate reader ER-alpha-bla cells
were cultured in assay medium containing 2 percent charcoal stripped FBS overnight in the culture flasks before
the assay was performed.

Baseline median absolute deviation for the assay (bmad): 0.726

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Estrogen receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
3.2 Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
17b-Estradiol was used as a positive ERa agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active

-------
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

594	9832	70

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	462

gain-loss (gnls) model:	485

power(pow) model:	840

linear-polynomial (polyl) model:	4144

quadratic-polynomial(poly2) model:	944

exponential-2 (exp2) model:	487

exponential-3 (exp3) model:	74

exponential-4 (exp4) model:	2576

exponential-5 (exp5) model:	484

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.035

Neutral control median absolute deviation, by plate: nmad

1.123

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-2767.52%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

99.177

20.813

Z Prime Factor for median positive and neutral control across all plates:

NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

4.752

((pmed - nmed) /sqrt(pmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

NA

Positive control signal-to-background: (pmed/nmed)

NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 484.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Huang R, Sakamuru S, Martin MT, Reif DM,
Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W, Rotroff DM, Zhao T,
Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox2110K compound library
for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep. 2014 Jul 11;4:5664. doi:
10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 786

TOX21_ERa_BLA_Antagonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-alpha (ESR1) Antagonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21 ERa BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-er-
bla-antagonist-pl. TOX21_ERa_BLA_Antagonist_ratio is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate
used as the measure of target activity. Data from the assay component TOX21_ERa_BLA_Antagonist_ratio was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_ERa_BLA_Antagonist_ratio, was analyzed in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of inducible reporter, loss-of-signal activity can be used to understand changes in the reporter gene as they
relate to the gene ESR1. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints and this ratio serves a reporter gene function to understand target
activity. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERalpha gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERa_BLA_Antagonist_ratio was designed to target nuclear receptor activity at the protein
(receptor) level, specifically mapping to ESR1 gene(s) using a positive control of 4-hydroxytamoxifen

The Tox21 estrogen receptor-alpha antagonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-
dependent transcription, monitored through changes to bioluminescence signals produced from an enzymatic
reaction catalyzed by luciferase between the key substrate [CellTiter-Glo] and the target cofactor [ATP], Each

-------
well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as compared to
positive control of tetraoctylammonium bromide (100 percent inhibition). The assay is run in triplicate on 1536-
well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable FRET-
based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER agonists, GeneBLAzer ERalpha-UAS-bla GripTite cell line (Invitrogen) has been used to
screen the Tox21 library of diverse environmental compounds. ERalpha-UAS-bla cell line expresses a partial
ERalpha one-hybrid GAL4 system and is stably transfected with a -beta-lactamase reporter gene. The Tox21
ERalpha bla assays are qHTS format assays which measured the ability of a chemical to interact with estrogen
receptor alpha (ERalpha) by monitoring modulation of fluorescence reporter gene signals. This assay utilized a
human embryonic kidney cell line (HEK293T) which expresses a partial ERalpha and a one-hybrid GAL4 system
to quantify xenoestrogenic agonism. This assay is intended for use as a part of an integrated testing strategy, to
screen a large structurally diverse chemical library for compounds with the potential to interact with estrogen
receptor alpha mediated pathways and potentially affect endocrine systems in exposed populations. There is
strong evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome
Pathway (AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that
estrogen receptor activation is the MIE for putative AOPs leading to reduced survival due to renal failure and
leading to skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under
development). Chemical-activity profiles derived from this assay can inform prioritization decisions for
compound selection in more resource intensive in vivo studies to further investigate the involvement of ER
agonism in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER alpha-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor alpha (ERalpha) fused to the DNA-binding domain of
GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

-------
2.5 Exposure Regime: Cell Thawing Method: 1ml frozen cells of ERalpha-bla were taken in pre-warmed 10ml of thaw
medium for centrifuging. 3ml of the thaw medium is taken to resuspend the pellet. The cells were seeded in T-
75 flask at 2 million cells. Cell Proliferation Method: The cells are detached using 0.05 percent Trypsin. Cells are
further passaged at a density of 4-5 million cells per T-225 flask. Assay Protocol: Rinse the cells twice with DPBS
and detach them using 0.05 percent Trypsin and centrifuge. Resuspend the pellet with assay buffer. Plate the
cells in black-clear bottom 1536 well plate at 5000/well/5uL through 8 tip plate dispenser (Multi drop). Incubate
at 37Cfor 5hrs. Transfer 23nLof the compounds from the library collection and positive control through Pintool.
Add luL of assay buffer with or without 0.5nM (final) Beta-estradiol. Incubate at 37Cfor 18hrs. Add luLof CCF4
dye using a single tip of a plate dispenser (BioRAPTR). Incubate at room temperature for 2hrs. Read the
fluorescence intensity through Envision plate reader. Add 4uLof CellTiter-Glo reagent using a single tip of a plate
dispenser (BioRAPTR). Incubate at room temperature for 30min. Read the luminescence through ViewLux plate
reader.

Baseline median absolute deviation for the assay (bmad): 3.329

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Estrogen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

4-hydroxytamoxifen

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
4-hydroxytamoxifen was used as a positive ERa antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.

-------
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1315

Inactive hit count: 0
-------
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.861

Neutral control median absolute deviation, by plate: nmad	19.331

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-2106.99%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100.09

Positive control well median absolute deviation, by plate: pmad	2.958

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-5.072

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

-------
Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 747.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

-------
• Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Huang R, Sakamuru S, Martin MT, Reif DM,
Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W, Rotroff DM, Zhao T,
Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox2110K compound library
for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep. 2014 Jul 11;4:5664. doi:
10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 787

TOX21_ERa_BLA_Antagonist_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293T Estrogen Receptor-alpha (ESR1) Antagonism Beta-
lactamase Assay

1.2	Assay Summary: TOX21 ERa BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-er-
bla-antagonist-pl. TOX21_ERa_BLA_Antagonist_viability is an assay readout measuring cellular ATP content and
detected with CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_ERa_BLA_Antagonist_viability used a
type of viability reporter where loss-of-signal activity can be used to understand changes in the cell viability.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a viability function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERalpha gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 estrogen receptor-alpha antagonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-
dependent transcription, monitored through changes to bioluminescence signals produced from an enzymatic
reaction catalyzed by luciferase between the key substrate [CellTiter-Glo] and the target cofactor [ATP], Each
well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as compared to
positive control of tetraoctylammonium bromide (100 percent inhibition). The assay is run in triplicate on 1536-

-------
well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable FRET-
based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER agonists, GeneBLAzer ERalpha-UAS-bla GripTite cell line (Invitrogen) has been used to
screen the Tox21 library of diverse environmental compounds. ERalpha-UAS-bla cell line expresses a partial
ERalpha one-hybrid GAL4 system and is stably transfected with a -beta-lactamase reporter gene. The Tox21
ERalpha bla assays are qHTS format assays which measured the ability of a chemical to interact with estrogen
receptor alpha (ERalpha) by monitoring modulation of fluorescence reporter gene signals. This assay utilized a
human embryonic kidney cell line (HEK293T) which expresses a partial ERalpha and a one-hybrid GAL4 system
to quantify xenoestrogenic agonism. This assay is intended for use as a part of an integrated testing strategy, to
screen a large structurally diverse chemical library for compounds with the potential to interact with estrogen
receptor alpha mediated pathways and potentially affect endocrine systems in exposed populations. There is
strong evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome
Pathway (AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that
estrogen receptor activation is the MIE for putative AOPs leading to reduced survival due to renal failure and
leading to skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under
development). Chemical-activity profiles derived from this assay can inform prioritization decisions for
compound selection in more resource intensive in vivo studies to further investigate the involvement of ER
agonism in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER alpha-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor alpha (ERalpha) fused to the DNA-binding domain of
GAL4 stably integrated into the cell line. These cells stably express a beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

-------
2.5 Exposure Regime: Cell Thawing Method: 1ml frozen cells of ERalpha-bla were taken in pre-warmed 10ml of thaw
medium for centrifuging. 3ml of the thaw medium is taken to resuspend the pellet. The cells were seeded in T-
75 flask at 2 million cells. Cell Proliferation Method: The cells are detached using 0.05 percent Trypsin. Cells are
further passaged at a density of 4-5 million cells per T-225 flask. Assay Protocol: Rinse the cells twice with DPBS
and detach them using 0.05 percent Trypsin and centrifuge. Resuspend the pellet with assay buffer. Plate the
cells in black-clear bottom 1536 well plate at 5000/well/5uL through 8 tip plate dispenser (Multi drop). Incubate
at 37Cfor 5hrs. Transfer 23nLof the compounds from the library collection and positive control through Pintool.
Add luL of assay buffer with or without 0.5nM (final) Beta-estradiol. Incubate at 37C for 18hrs. Add luLof CCF4
dye using a single tip of a plate dispenser (BioRAPTR). Incubate at room temperature for 2hrs. Read the
fluorescence intensity through Envision plate reader. Add 4uLof CellTiter-Glo reagent using a single tip of a plate
dispenser (BioRAPTR). Incubate at room temperature for 30min. Read the luminescence through ViewLux plate
reader.

Baseline median absolute deviation for the assay (bmad): 6.482
Response cutoff threshold used to determine hit calls: 38.891

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Estrogen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region, ToxCast ER Pathway Model: Estrogen
receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates

-------
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

380	6821	3295

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	233

gain-loss (gnls) model:	346

power(pow) model:	487

linear-polynomial (polyl) model:	5239

quadratic-polynomial(poly2) model:	816

exponential-2 (exp2) model:	331

exponential-3 (exp3) model:	23

exponential-4 (exp4) model:	2397

exponential-5 (exp5) model:	624

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In

-------
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.273

Neutral control median absolute deviation, by plate: nmad	8.826

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3442%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

-------
Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 624.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Huang R, Sakamuru S, Martin MT, Reif DM,
Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W, Rotroff DM, Zhao T,
Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox2110K compound library

-------
for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep. 2014 Jul 11;4:5664. doi:
10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 788

TOX21_ERa_LUC_VM7_Agonist

1.	General Information

1.1	Assay Title: Tox21 VM7 Estrogen Receptor-alpha (ESR1) Agonism Luciferase Assay

1.2	Assay Summary: TOX21 ERa LUC VM7 Agonist is a cell-based, single-readout assay that uses VM7, a human
breast tissue cell line, with measurements taken at 22 hours after chemical dosing in a 1536-well plate. See
tox21-er-luc-bgl-4e2-agonist-p2. TOX21_ERa_LUC_VM7_Agonist is one of one assay component(s) measured
or calculated from the TOX21_ERa_LUC_VM7_Agonist assay. It is designed to make measurements of luciferase
induction, a form of inducible reporter, as detected with bioluminescence signals by CellTiter-Glo Luciferase-
coupled ATP quantitation technology. Data from the assay component TOX21_ERa_LUC_VM7_Agonist was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_ERa_LUC_VM7_Agonist, was analyzed in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of inducible reporter, gain-of-signal activity can be used to understand changes in the reporter gene as they
relate to the gene ESR1. Furthermore, this assay endpoint can be referred to as a primary readout, because the
performed assay has only produced 1 assay endpoint. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected VM7 cells are aliquoted into 1536-well microtiter plates
and incubated with test compounds for 24 hours prior to monitoring luminescence resulting from ER gene
expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERa_LUC_VM7_Agonist was designed to measure changes to bioluminescence signals
produced from an enzymatic reaction involving the key substrate [One-Glo], Changes are indicative of changes
in transcriptional gene expression due to agonist activity regulated by the human estrogen receptor 1
[GeneSymbokESRl | GenelD:2099 | Uniprot_SwissProt_Accession:P03372],

The Tox21 VM7 estrogen receptor alpha agonism luciferase assay screened a library of diverse environmental
compounds to probe for xenoestrogenic ligand-binding and potential to inhibit estrogen-dependent

-------
transcription, as monitored through luciferase reporter gene signal activity using an endogenous full-length
ERalpha cell line of human breast tissue (VM7) that was stably transfected with an estrogen-responsive
luciferase reporter gene plasmid (pGudLuc7ere). The assay is run in triplicate on 1536-well microplate and
bioluminescence was measured following 24 hour incubation of cells with test compounds and 30 min
incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored
using a Promega ViewLux plate reader to measure agonistic activity, this assay is performed with small amounts
of an ER alpha antagonist (ICI182780) added to each well and each compound is evaluated against a known ER
alpha agonist (beta-estradiol, E2) as a positive control (100 percent inhibition). Test compounds were assayed
for cytotoxicity by measuring protease activity with Promega CellTiter-Fluor with tetraoctylammonium bromide
as a positive control for cell death. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER agonists, VM7-Luc-4E2 cell line (provided by Dr. Michael Denison from University of
California) has been used to screen the Tox21 library of diverse environmental compounds. VM7-Luc-4E2 cell
line endogenously expresses full-length ER-alpha and is stably transfected with a plasmid containing four
estrogen responsive elements (ERE) upstream of a luciferase reporter gene. The ERalpha_LUC_VM7 assays are
qHTS format assays which measure the ability of a chemical to interact with estrogen receptor alpha (ERalpha)
by monitoring modulation of fluorescence reporter gene signals. This assay utilized a human breast (VM7-Luc-
4E2) cell line which expresses endogenous full-length ERalpha and a luciferase reporter gene (ER-luc) to quantify
xenoestrogenic activity. This assay is intended for use as a part of an integrated testing strategy, to screen a
large structurally diverse chemical library for compounds with the potential to interact with estrogen receptor
alpha mediated pathways and potentially affect endocrine systems in exposed populations. There is strong
evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome Pathway
(AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen
receptor activation is the MIE for putative AOPs leading to reduced survival due to renal failure and leading to
skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under development).
Chemical-activity profiles derived from this assay can inform prioritization decisions for compound selection in
more resource intensive in vivo studies to further investigate the involvement of ER agonism in pathways leading
to hazardous outcomes in biological systems.

2.3	Experimental System: adherent VM7 cell line used. Michigan Cancer Foundation-7 (MCF-7) cells are a breast
cancer cell line originating from a 69-year old woman in 1970 (Lee et al. 2015). This is an immortalized cell line
which endogenously expresses receptors for estrogen (ESR1 and ESR2) and progesterone (Geisinger et al. 1989)
as well as growth factors EGF and IGF-1 (Baldwin et al. 1998), and so provides an alternative to breast cell lines
for estrogen-sensitive proliferation assays. vM7-Luc-4E2 cells are vMCF7 cells which are stably transfected with
plasmid containing four estrogen responsive elements upstream of a luciferase reporter gene (Rogers and
Denison 2000) to measure transactivation activity occurring along estrogen signaling pathways. This was
previously the BGl-Luc estrogen receptor transactivation test method, but BGlLuc4E2 cells are being renamed
VM7Luc4E2 cells because recent DNA analysis (STR) revealed that the original cell line used to generate the
BGlLuc4E2 cells were not human ovarian carcinoma (BG-1) cells but a variant of human breast cancer (MCF7)
cells.

-------
2.4 Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5 Exposure Regime: Cell Thawing Method: Thaw a frozen vial of cells in 9ml of pre-warmed medium and seed
them in T175 flask at 2 million cells. Cell Proliferation Method: Trypsinize cells from the flask and centrifuge.
Add culture medium to the pellet and passage at 3-4 million perT-225 flask. Assay Protocol: Harvest from the
5-day culture in assay medium and re-suspend cells in assay medium. Dispense 4000 cells/5uL/well into 1536-
well tissue treated white/solid bottom plates. Incubate the plates for 24hrs (22-24hrs) at 37C and 5 percent
C02. Transfer 23nL of compounds from the library collection and positive control to the assay plates through
pintool. Incubate the plates for 22hrs (22-24hrs) at 37Cand 5 percent C02. Add 5ul of ONE-Glo reagent. Incubate
the plates at room temperature for 30min. Measure luminescence by ViewLux plate reader.

Baseline median absolute deviation for the assay (bmad): 2.775
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Agonism of the estrogen receptor (ERa) signaling pathway is measured by bioluminescence activity
via an estrogen-responsive firefly luciferase reporter gene. Increased luciferase activity can be used to identify
the compounds that induce xenoestrogenic ligand-binding and ERalpha agonism. The cytotoxicity of the
compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in the same
wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
3.2 Data Analysis: Increased luminescence (gain-of-signal) was measured relative to 17beta-estradiol (positive
control) signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a
percent of positive control activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain

-------
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inaclive hit count: 0
-------
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

-0.18
6.871
-3974.28%

99.579
8.914

NA

8.797

NA
NA

NA
NA
NA

NA

NA

-------
Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 793.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Judson RS, Magpantay FM, Chickarmane V, Haskell C, Tania N, Taylor J, Xia M, Huang R, Rotroff
DM, Filer DL, Houck KA, Martin MT, Sipes N, Richard AM, Mansouri K, Setzer RW, Knudsen TB, Crofton KM,
Thomas RS. Integrated Model of Chemical Perturbations of a Biological Pathway Using 18 In Vitro High-
Throughput Screening Assays for the Estrogen Receptor. Toxicol Sci. 2015 Nov;148(l):137-54. doi:
10.1093/toxsci/kfvl68. Epub 2015 Aug 13. PMID: 26272952; PMCID: PMC4635633., Huang R, Sakamuru S,
Martin MT, Reif DM, Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W,
Rotroff DM, Zhao T, Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox21
10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep.
2014 Jul 11;4:5664. doi: 10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 789

TOX21_ERa_LUC_VM7_Antagonist_0.5nM_E2

1.	General Information

1.1	Assay Title: Tox21 VM7 Estrogen Receptor-alpha (ESR1) Antagonism (0.5nM E2) Luciferase Assay

1.2	Assay Summary: TOX21 ERa LUC VM7 Antagonist 0.5nM E2 is a cell-based, single-readout assay that uses
VM7, a human breast tissue cell line, with measurements taken at 22 hours after chemical dosing in a 1536-well
plate. This is a secondary assay for specificity to TOX21_ERa_LUC_VM7_Antagonist_0.1nM_E2. See tox21-er-
luc-bgl-4e2-antagonist-pl. TOX21_ERa_LUC_VM7_Antagonist_0.5nM_E2 is one of one assay component(s)
measured or calculated from the TOX21_ERa_LUC_VM7_Antagonist_0.5nM_E2 assay. It is designed to make
measurements of luciferase induction, a form of inducible reporter, as detected with bioluminescence signals
by CellTiter-Glo Luciferase-coupled ATP quantitation technology. Data from the assay component
TOX21_ERa_LUC_VM7_Antagonist_0.5nM_E2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ERa_LUC_VM7_Antagonist_0.5nM_E2, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity
can be used to understand changes in the reporter gene as they relate to the gene ESR1. Furthermore, this assay
endpoint can be referred to as a primary readout, because the performed assay has only produced 1 assay
endpoint. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected VM7 cells are aliquoted into 1536-well microtiter plates
and incubated with test compounds for 24 hours prior to monitoring luminescence resulting from ER gene
expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction involving the key
substrate [One-Glo] are indicative of changes in transcriptional gene expression due to antagonist activity
regulated by the human estrogen receptor 1 [GeneSymbokESRl | GenelD:2099 |
Uniprot_SwissProt_Accession:P03372] using a positive control of 4-hydroxytamoxifen

-------
The Tox21 VM7 estrogen receptor alpha antagonism luciferase assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to inhibit estrogen-
dependent transcription, as monitored through luciferase reporter gene signal activity using an endogenous
full-length ERalpha cell line of human breast tissue (VM7) that was stably transfected with an estrogen-
responsive luciferase reporter gene plasmid (pGudLuc7ere). The assay is run in triplicate on 1536-well
microplate and bioluminescence was measured following 24 hour incubation of cells with test compounds and
30 min incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was
monitored using a Promega ViewLux plate reader To measure antagonistic activity, this assay is performed with
small amounts of an ER alpha agonist (beta-estradiol, E2) added to each well and each compound is evaluated
against a known ER alpha antagonist (4-Hydroxytamoxifen) as a positive control (100 percent inhibition). Test
compounds were assayed for cytotoxicity by measuring protease activity with Promega CellTiter-Fluor with
tetraoctylammonium bromide as a positive control for cell death. Compound auto-fluorescence was monitored
in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER agonists, VM7-Luc-4E2 cell line (provided by Dr. Michael Denison from University of
California) has been used to screen the Tox21 library of diverse environmental compounds. VM7-Luc-4E2 cell
line endogenously expresses full-length ER-alpha and is stably transfected with a plasmid containing four
estrogen responsive elements (ERE) upstream of a luciferase reporter gene. The ERalpha_LUC_VM7 assays are
qHTS format assays which measure the ability of a chemical to interact with estrogen receptor alpha (ERalpha)
by monitoring modulation of fluorescence reporter gene signals. This assay utilized a human breast (VM7-Luc-
4E2) cell line which expresses endogenous full-length ERalpha and a luciferase reporter gene (ER-luc) to quantify
xenoestrogenic activity. This assay is intended for use as a part of an integrated testing strategy, to screen a
large structurally diverse chemical library for compounds with the potential to interact with estrogen receptor
alpha mediated pathways and potentially affect endocrine systems in exposed populations. There is strong
evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome Pathway
(AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen
receptor activation is the MIE for putative AOPs leading to reduced survival due to renal failure and leading to
skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under development).
Chemical-activity profiles derived from this assay can inform prioritization decisions for compound selection in
more resource intensive in vivo studies to further investigate the involvement of ER agonism in pathways leading
to hazardous outcomes in biological systems.

2.3	Experimental System: adherent VM7 cell line used. Michigan Cancer Foundation-7 (MCF-7) cells are a breast
cancer cell line originating from a 69-year old woman in 1970 (Lee et al. 2015). This is an immortalized cell line
which endogenously expresses receptors for estrogen (ESR1 and ESR2) and progesterone (Geisinger et al. 1989)
as well as growth factors EGF and IGF-1 (Baldwin et al. 1998), and so provides an alternative to breast cell lines
for estrogen-sensitive proliferation assays. vM7-Luc-4E2 cells are vMCF7 cells which are stably transfected with
plasmid containing four estrogen responsive elements upstream of a luciferase reporter gene (Rogers and
Denison 2000) to measure transactivation activity occurring along estrogen signaling pathways. This was
previously the BGl-Luc estrogen receptor transactivation test method, but BGlLuc4E2 cells are being renamed
VM7Luc4E2 cells because recent DNA analysis (STR) revealed that the original cell line used to generate the

-------
BGlLuc4E2 cells were not human ovarian carcinoma (BG-1) cells but a variant of human breast cancer (MCF7)
cells.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. For
assay purpose, the cells should be cultured in assay medium with 10% charcoal stripped FBS for 5 days with
alternate day medium changed to fresh medium. Especially while in assay culture, the cells should not reach
more than 85% confluence as they would become harder to detach if they reach over confluence. Cell Thawing
Method: Thaw a frozen vial of cells in 9ml of pre-warmed medium and seed them in T175 flask at 2 million cells.
Cell Proliferation Method: Trypsinize cells from the flask and centrifuge. Add culture medium to the pellet and
passage at 3-4 million per T-225 flask. Assay Protocol: Harvest cells from the 5-day culture in assay medium and
resuspend cells in assay medium. Dispense 4000 cells/4uL/well into 1536-well tissue treated white/solid bottom
plates using an 8 tip dispenser (Multidrop). Incubate the assay plates for 22hrs at 37C and 5% C02. First luL of
lO.OnM (final concentration) ICI-182,780 (ER-Antagonist) or assay buffer was added using two separate tips of
a dispenser (BioRAPTR). Then transfer 23nL of compounds from the library collection and positive control to the
assay plates by using a Pintool station. Incubate the assay plates for 22hrs at 37C and 5% C02. After 21hrs of
incubation, lul of CellTiter-Fluor(TM) Cell Viability Assay reagent was added using a single tip of a dispenser
(BioRAPTR). Incubate the assay plates at 37C and 5% C02 for lhr. Measure fluorescence signal by ViewLux plate
reader (Exposure time = lsec). Then add 4ul of ONE-Glo(TM) Luciferase reagent using a single tip of a dispenser
(BioRAPTR). Incubate the plates at room temperature for 30min.

Baseline median absolute deviation for the assay (bmad): 4.083
Response cutoff threshold used to determine hit calls: 24.498

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Antagonism of the estrogen receptor (ERa) signaling pathway is measured by bioluminescence
activity via an estrogen-responsive firefly luciferase reporter gene. Decreased luciferase activity can be used to
identify the compounds that inhibit xenoestrogenic ligand-binding and ERalpha antagonism. The cytotoxicity of
the compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in the
same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast ER Pathway Model: Estrogen receptor assays used in ToxCast ER Pathway model

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

4-hydroxytamoxifen

I arget (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Decreased luminescence (loss-of-signal) was measured relative to 4-hydroxytamoxifen (positive
control) signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a
percent of positive control activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active

-------
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

1028	6069	3399

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	438

gain-loss (gnls) model:	609

power(pow) model:	701

linear-polynomial (polyl) model:	4229

quadratic-polynomial(poly2) model:	1062

exponential-2 (exp2) model:	434

exponential-3 (exp3) model:	120

exponential-4 (exp4) model:	2304

exponential-5 (exp5) model:	599

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.216

Neutral control median absolute deviation, by plate: nmad

7.325

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-3519.64%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

-100.054

0.902

Z Prime Factor for median positive and neutral control across all plates:

NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

-13.537

((pmed - nmed) /sqrt(pmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

NA

Positive control signal-to-background: (pmed/nmed)

NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 599.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: Judson RS, Magpantay FM, Chickarmane V, Haskell C, Tania N, Taylor J, Xia M, Huang R, Rotroff
DM, Filer DL, Houck KA, Martin MT, Sipes N, Richard AM, Mansouri K, Setzer RW, Knudsen TB, Crofton KM,
Thomas RS. Integrated Model of Chemical Perturbations of a Biological Pathway Using 18 In Vitro High-
Throughput Screening Assays for the Estrogen Receptor. Toxicol Sci. 2015 Nov;148(l):137-54. doi:
10.1093/toxsci/kfvl68. Epub 2015 Aug 13. PMID: 26272952; PMCID: PMC4635633., Huang R, Sakamuru S,
Martin MT, Reif DM, Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W,
Rotroff DM, Zhao T, Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox21
10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep.
2014 Jul 11;4:5664. doi: 10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 790

TOX21_ERa_LUC_VM7_Antagonist_0.5nM_E2_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 VM7 Estrogen Receptor-alpha (ESR1) Antagonism (0.5nM E2)
Luciferase Assay

1.2	Assay Summary: TOX21 ERa LUC VM7 Antagonist 0.5nM E2 is a cell-based, single-readout assay that uses
VM7, a human breast tissue cell line, with measurements taken at 22 hours after chemical dosing in a 1536-well
plate. This is a secondary assay for specificity to TOX21_ERa_LUC_VM7_Antagonist_0.1nM_E2. See tox21-er-
luc-bgl-4e2-antagonist-pl. TOX21_ERa_LUC_VM7_Antagonist_0.5nM_E2_viability is an assay readout
measuring cellular ATP content and detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
TOX21_ERa_LUC_VM7_Antagonist_0.5nM_E2_viability used a type of viability reporter where loss-of-signal
activity can be used to understand changes in the cell viability. Furthermore, this assay endpoint can be referred
to as a secondary readout, because this assay has produced multiple assay endpoints where this one serves a
viability function. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected VM7 cells are aliquoted into 1536-well microtiter plates
and incubated with test compounds for 24 hours prior to monitoring luminescence resulting from ER gene
expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 VM7 estrogen receptor alpha antagonism luciferase assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to inhibit estrogen-
dependent transcription, as monitored through luciferase reporter gene signal activity using an endogenous
full-length ERalpha cell line of human breast tissue (VM7) that was stably transfected with an estrogen-
responsive luciferase reporter gene plasmid (pGudLuc7ere). The assay is run in triplicate on 1536-well

-------
microplate and bioluminescence was measured following 24 hour incubation of cells with test compounds and
30 min incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was
monitored using a Promega ViewLux plate reader To measure antagonistic activity, this assay is performed with
small amounts of an ER alpha agonist (beta-estradiol, E2) added to each well and each compound is evaluated
against a known ER alpha antagonist (4-Hydroxytamoxifen) as a positive control (100 percent inhibition). Test
compounds were assayed for cytotoxicity by measuring protease activity with Promega CellTiter-Fluor with
tetraoctylammonium bromide as a positive control for cell death. Compound auto-fluorescence was monitored
in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER agonists, VM7-Luc-4E2 cell line (provided by Dr. Michael Denison from University of
California) has been used to screen the Tox21 library of diverse environmental compounds. VM7-Luc-4E2 cell
line endogenously expresses full-length ER-alpha and is stably transfected with a plasmid containing four
estrogen responsive elements (ERE) upstream of a luciferase reporter gene. The ERalpha_LUC_VM7 assays are
qHTS format assays which measure the ability of a chemical to interact with estrogen receptor alpha (ERalpha)
by monitoring modulation of fluorescence reporter gene signals. This assay utilized a human breast (VM7-Luc-
4E2) cell line which expresses endogenous full-length ERalpha and a luciferase reporter gene (ER-luc) to quantify
xenoestrogenic activity. This assay is intended for use as a part of an integrated testing strategy, to screen a
large structurally diverse chemical library for compounds with the potential to interact with estrogen receptor
alpha mediated pathways and potentially affect endocrine systems in exposed populations. There is strong
evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome Pathway
(AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen
receptor activation is the MIE for putative AOPs leading to reduced survival due to renal failure and leading to
skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under development).
Chemical-activity profiles derived from this assay can inform prioritization decisions for compound selection in
more resource intensive in vivo studies to further investigate the involvement of ER agonism in pathways leading
to hazardous outcomes in biological systems.

2.3	Experimental System: adherent VM7 cell line used. Michigan Cancer Foundation-7 (MCF-7) cells are a breast
cancer cell line originating from a 69-year old woman in 1970 (Lee et al. 2015). This is an immortalized cell line
which endogenously expresses receptors for estrogen (ESR1 and ESR2) and progesterone (Geisinger et al. 1989)
as well as growth factors EGF and IGF-1 (Baldwin et al. 1998), and so provides an alternative to breast cell lines
for estrogen-sensitive proliferation assays. vM7-Luc-4E2 cells are vMCF7 cells which are stably transfected with
plasmid containing four estrogen responsive elements upstream of a luciferase reporter gene (Rogers and
Denison 2000) to measure transactivation activity occurring along estrogen signaling pathways. This was
previously the BGl-Luc estrogen receptor transactivation test method, but BGlLuc4E2 cells are being renamed
VM7Luc4E2 cells because recent DNA analysis (STR) revealed that the original cell line used to generate the
BGlLuc4E2 cells were not human ovarian carcinoma (BG-1) cells but a variant of human breast cancer (MCF7)
cells.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

-------
2.5 Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. For
assay purpose, the cells should be cultured in assay medium with 10% charcoal stripped FBS for 5 days with
alternate day medium changed to fresh medium. Especially while in assay culture, the cells should not reach
more than 85% confluence as they would become harder to detach if they reach over confluence. Cell Thawing
Method: Thaw a frozen vial of cells in 9ml of pre-warmed medium and seed them in T175 flask at 2 million cells.
Cell Proliferation Method: Trypsinize cells from the flask and centrifuge. Add culture medium to the pellet and
passage at 3-4 million per T-225 flask. Assay Protocol: Harvest cells from the 5-day culture in assay medium and
resuspend cells in assay medium. Dispense 4000 cells/4uL/well into 1536-well tissue treated white/solid bottom
plates using an 8 tip dispenser (Multidrop). Incubate the assay plates for 22hrs at 37C and 5% C02. First luL of
lO.OnM (final concentration) ICI-182,780 (ER-Antagonist) or assay buffer was added using two separate tips of
a dispenser (BioRAPTR). Then transfer 23nL of compounds from the library collection and positive control to the
assay plates by using a Pintool station. Incubate the assay plates for 22hrs at 37C and 5% C02. After 21hrs of
incubation, lul of CellTiter-Fluor(TM) Cell Viability Assay reagent was added using a single tip of a dispenser
(BioRAPTR). Incubate the assay plates at 37C and 5% C02 for lhr. Measure fluorescence signal by ViewLux plate
reader (Exposure time = lsec). Then add 4ul of ONE-Glo(TM) Luciferase reagent using a single tip of a dispenser
(BioRAPTR). Incubate the plates at room temperature for 30min.

Baseline median absolute deviation for the assay (bmad): 4.094
Response cutoff threshold used to determine hit calls: 24.566

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Antagonism of the estrogen receptor (ERa) signaling pathway is measured by bioluminescence
activity via an estrogen-responsive firefly luciferase reporter gene. Decreased luciferase activity can be used to
identify the compounds that inhibit xenoestrogenic ligand-binding and ERalpha antagonism. The cytotoxicity of
the compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in the
same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region, ToxCast ER Pathway Model: Estrogen
receptor assays used in ToxCast ER Pathway model

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

3.

Additionally, this assay was annotated to the intended target family of cell cycle.
Data Interpretation

-------
The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:

-------
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.179

Neutral control median absolute deviation, by plate: nmad	6.415

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3530.47%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 702.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: Judson RS, Magpantay FM, Chickarmane V, Haskell C, Tania N, Taylor J, Xia M, Huang R, Rotroff
DM, Filer DL, Houck KA, Martin MT, Sipes N, Richard AM, Mansouri K, Setzer RW, Knudsen TB, Crofton KM,
Thomas RS. Integrated Model of Chemical Perturbations of a Biological Pathway Using 18 In Vitro High-
Throughput Screening Assays for the Estrogen Receptor. Toxicol Sci. 2015 Nov;148(l):137-54. doi:
10.1093/toxsci/kfvl68. Epub 2015 Aug 13. PMID: 26272952; PMCID: PMC4635633., Huang R, Sakamuru S,
Martin MT, Reif DM, Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W,
Rotroff DM, Zhao T, Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox21
10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep.
2014 Jul 11;4:5664. doi: 10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 791

TOX21_GR_BLA_Agonist_chl

1.	General Information

1.1	Assay Title: Tox21 HeLa Glucocorticoid Receptor (GR) Agonism Beta-lactamase Assay, Channel 1 Readout of
Uncleaved Substrate

1.2	Assay Summary: TOX21 GR BLA Agonist is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-gr-hela-
bla-agonist-pl. TOX21_GR_BLA_Agonist_chl is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the uncleaved reporter gene substrate and is used to calculate the ratio of
cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_GR_BLA_Agonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_GR_BLA_Agonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene NR3C1. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HeLa cells are aliquoted into 1536-well microtiter plates
at a concentration of 1500 cells/6 uL and incubated with test compounds for 18 hours prior to monitoring
fluorescence emission resulting from xenobiotic glucocorticoid receptor expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_GR_BLA_Agonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to NR3C1 gene(s) using a
positive control of Dexamethasone

The Tox21 glucocorticoid receptor (GR) beta-lactamase agonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce glucocorticoid
receptor signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4

-------
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HeLa cell line used. GR-bla HeLa cells are engineered to express beta-lactamase
under the control of glucocorticoid receptor. These cells stably express a -beta-lactamase reporter gene under
the transcriptional control of an upstream activator sequence (UAS).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should not be grown more than 80-85% confluence. The
cell performance is affected is they are more confluent. Handle 1536 well plate black clear bottom plates
carefully by sides. Cell Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 ml conical tube.
Remove the vial of cells to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath
with gentle agitation for 1-2 minutes. Do not submerge vial in water. Mix the entire content of the vial to 14 ml
of pre-warmed medium and centrifuge to remove DMSO. Discard the supernatant and transfer the precipitated
cells to T175 flask using 30 ml thawing medium. Cell Proliferation Method: Detach the cells from the flask using
TrypLExpress. The cells are re-seeded in T-175 flask at 1-1.3 million. Assay Protocol: Spin down the cells after
rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate the cells in black-
clear bottom 1536 well plate at 1500/well/6uL through 8 tip Multidrop plate dispenser. Incubate for 4 hrs at
37C / 99% Humidity / 5% C02. Transfer 23nL of compounds from the library collection and positive control to
the assay plates through pintool. Incubate for 18 hrs at 379C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET
Substrate) dye using a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 2 hrs in dark.
Read the fluorescence intensity through Envision plate reader using Beta-Lactamase protocol optimized for this
cell type.

Baseline median absolute deviation for the assay (bmad): 3.641
Response cutoff threshold used to determine hit calls: 21.845
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6 Response: Glucocorticoid receptor agonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Dexamethasone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Dexamethasone was used as a positive GR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
732

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.215

Neutral control median absolute deviation, by plate: nmad	6.65

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-2828.87%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-26.954

Positive control well median absolute deviation, by plate: pmad	6.492

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.779

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 708.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 792

TOX21_GR_BLA_Agonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HeLa Glucocorticoid Receptor (GR) Agonism Beta-lactamase Assay, Channel 2 Readout of
Cleaved Substrate

1.2	Assay Summary: TOX21 GR BLA Agonist is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-gr-hela-
bla-agonist-pl. TOX21_GR_BLA_Agonist_ch2 is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio of cleaved
(ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_GR_BLA_Agonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_GR_BLA_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, increased activity can be used to
understand changes in the reporter gene as they relate to the gene NR3C1. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HeLa cells are aliquoted into 1536-well microtiter plates
at a concentration of 1500 cells/6 uL and incubated with test compounds for 18 hours prior to monitoring
fluorescence emission resulting from xenobiotic glucocorticoid receptor expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_GR_BLA_Agonist_ch2 was designed to measure cleaved reporter gene substrate to target
nuclear receptor (non-steroidal) activity at the protein (receptor) level, specifically mapping to NR3C1 gene(s)
using a positive control of Dexamethasone

The Tox21 glucocorticoid receptor (GR) beta-lactamase agonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce glucocorticoid
receptor signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4

-------
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HeLa cell line used. GR-bla HeLa cells are engineered to express beta-lactamase
under the control of glucocorticoid receptor. These cells stably express a -beta-lactamase reporter gene under
the transcriptional control of an upstream activator sequence (UAS).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should not be grown more than 80-85% confluence. The
cell performance is affected is they are more confluent. Handle 1536 well plate black clear bottom plates
carefully by sides. Cell Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 ml conical tube.
Remove the vial of cells to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath
with gentle agitation for 1-2 minutes. Do not submerge vial in water. Mix the entire content of the vial to 14 ml
of pre-warmed medium and centrifuge to remove DMSO. Discard the supernatant and transfer the precipitated
cells to T175 flask using 30 ml thawing medium. Cell Proliferation Method: Detach the cells from the flask using
TrypLExpress. The cells are re-seeded in T-175 flask at 1-1.3 million. Assay Protocol: Spin down the cells after
rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate the cells in black-
clear bottom 1536 well plate at 1500/well/6uL through 8 tip Multidrop plate dispenser. Incubate for 4 hrs at
37C / 99% Humidity / 5% C02. Transfer 23nL of compounds from the library collection and positive control to
the assay plates through pintool. Incubate for 18 hrs at 379C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET
Substrate) dye using a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 2 hrs in dark.
Read the fluorescence intensity through Envision plate reader using Beta-Lactamase protocol optimized for this
cell type.

Baseline median absolute deviation for the assay (bmad): 2.114

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6 Response: Glucocorticoid receptor agonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Dexamethasone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Dexamethasone was used as a positive GR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
274

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.118

Neutral control median absolute deviation, by plate: nmad	4.025

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3347.54%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.454

Positive control well median absolute deviation, by plate: pmad	18.347

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	5.258

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 789.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 793

T0X21_G R_BLA_Agonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HeLa Glucocorticoid Receptor (GR) Agonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21 GR BLA Agonist is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-gr-hela-
bla-agonist-pl. TOX21_GR_BLA_Agonist_ratio is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of
target activity. Data from the assay component TOX21_GR_BLA_Agonist_ratio was analyzed into 1 assay
endpoint. This assay endpoint, TOX21_GR_BLA_Agonist_ratio, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter,
gain-of-signal activity can be used to understand changes in the reporter gene as they relate to the gene NR3C1.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this ratio serves a reporter gene function to understand target activity. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HeLa cells are aliquoted into 1536-well microtiter plates
at a concentration of 1500 cells/6 uL and incubated with test compounds for 18 hours prior to monitoring
fluorescence emission resulting from xenobiotic glucocorticoid receptor expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_GR_BLA_Agonist_ratio was designed to target nuclear receptor (non-steroidal) activity at
the protein (receptor) level, specifically mapping to NR3C1 gene(s) using a positive control of Dexamethasone

The Tox21 glucocorticoid receptor (GR) beta-lactamase agonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce glucocorticoid
receptor signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test

-------
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. The Glucocorticoid receptor (GR) is one of the members of the nuclear
receptor family of ligand-dependent transcription factors. GR plays a critical role in carbohydrate, protein and
lipid metabolisms and programmed cell death. CellSensor GR-bla HeLa cell line (Invitrogen) contains a beta-
lactamase reporter gene under the control of the glucocorticoid response element stably integrated into HeLa
cells, a cervical cancer cell line that expresses glucocorticoid receptor. The activation of the reporter gene under
culture conditions can be detected by fluorescence intensity. This cell line has been used to screen the Tox21
10K compound library and to identify the compounds that activate the GR signaling.

2.3	Experimental System: adherent HeLa cell line used. GR-bla HeLa cells are engineered to express beta-lactamase
under the control of glucocorticoid receptor. These cells stably express a -beta-lactamase reporter gene under
the transcriptional control of an upstream activator sequence (UAS).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should not be grown more than 80-85% confluence. The
cell performance is affected is they are more confluent. Handle 1536 well plate black clear bottom plates
carefully by sides. Cell Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 ml conical tube.
Remove the vial of cells to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath
with gentle agitation for 1-2 minutes. Do not submerge vial in water. Mix the entire content of the vial to 14 ml
of pre-warmed medium and centrifuge to remove DMSO. Discard the supernatant and transfer the precipitated
cells to T175 flask using 30 ml thawing medium. Cell Proliferation Method: Detach the cells from the flask using
TrypLExpress. The cells are re-seeded in T-175 flask at 1-1.3 million. Assay Protocol: Spin down the cells after
rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate the cells in black-
clear bottom 1536 well plate at 1500/well/6uL through 8 tip Multidrop plate dispenser. Incubate for 4 hrs at
37C / 99% Humidity / 5% C02. Transfer 23nL of compounds from the library collection and positive control to
the assay plates through pintool. Incubate for 18 hrs at 379C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET
Substrate) dye using a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 2 hrs in dark.
Read the fluorescence intensity through Envision plate reader using Beta-Lactamase protocol optimized for this
cell type.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

-------
Dexamethasone	DMSO

Baseline median absolute deviation for the assay (bmad): 1.186
Response cutoff threshold used to determine hit calls: 20
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Glucocorticoid receptor agonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Dexamethasone was used as a positive GR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median

-------
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.049

Neutral control median absolute deviation, by plate: nmad	1.881

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3204.27%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.775

Positive control well median absolute deviation, by plate: pmad	7.962

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	12.181

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 602.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 794

T0X21_G R_B LA_Antagon ist_ratio

1.	General Information

1.1	Assay Title: Tox21 HeLa Glucocorticoid Receptor (GR) Antagonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21 GR BLA Antagonist is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-gr-hela-
bla-antagonist-pl. TOX21_GR_BLA_Antagonist_ratio is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate
used as the measure of target activity. Data from the assay component TOX21_GR_BLA_Antagonist_ratio was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_GR_BLA_Antagonist_ratio, was analyzed in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of inducible reporter, loss-of-signal activity can be used to understand changes in the reporter gene as they
relate to the gene NR3C1. Furthermore, this assay endpoint can be referred to as a primary readout, because
this assay has produced multiple assay endpoints and this ratio serves a reporter gene function to understand
gene activity. To generalize the intended target to other relatable targets, this assay endpoint is annotated to
the nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HeLa cells are aliquoted into 1536-well microtiter plates
at a concentration of 1500 cells/6 uL and incubated with test compounds for 18 hours prior to monitoring
fluorescence emission resulting from xenobiotic glucocorticoid receptor expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_GR_BLA_Antagonist_ratio was designed to target nuclear receptor activity at the protein
(receptor) level, specifically mapping to NR3C1 gene(s) using a positive control of Mifepristone

The Tox21 glucocorticoid receptor (GR) beta-lactamase antagonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress glucocorticoid
receptor signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test

-------
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. The Glucocorticoid receptor (GR) is one of the members of the nuclear
receptor family of ligand-dependent transcription factors. GR plays a critical role in carbohydrate, protein and
lipid metabolisms and programmed cell death. CellSensor GR-bla HeLa cell line (Invitrogen) contains a beta-
lactamase reporter gene under the control of the glucocorticoid response element stably integrated into HeLa
cells, a cervical cancer cell line that expresses glucocorticoid receptor. The activation of the reporter gene under
culture conditions can be detected by fluorescence intensity. This cell line has been used to screen the Tox21
10K compound library and to identify the compounds that inhibit the GR signaling.

2.3	Experimental System: adherent HeLa cell line used. GR-bla HeLa cells are engineered to express beta-lactamase
under the control of glucocorticoid receptor. These cells stably express a -beta-lactamase reporter gene under
the transcriptional control of an upstream activator sequence (UAS).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should not be grown more than 80-85% confluence. The
cell performance is affected is they are more confluent. Handle 1536 well plate black clear bottom plates
carefully by sides. Cell Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 ml conical tube.
Remove the vial of cells to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath
with gentle agitation for 1-2 minutes. Do not submerge vial in water. Mix the entire content of the vial to 14 ml
of pre-warmed medium and centrifuge to remove DMSO. Discard the supernatant and transfer the precipitated
cells to T175 flask using 30 ml thawing medium. Cell Proliferation Method: Detach the cells from the flask using
TrypLExpress. The cells are re-seeded in T-175 flask at 1-1.3 million. Assay Protocol: Spin down the cells after
rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate the cells in black-
clear bottom 1536 well plate at 1500/well/6uL through 8 tip Multidrop plate dispenser. Incubate for 4 hrs at
37C / 99% Humidity / 5% C02. Transfer 23nL of compounds from the library collection and positive control to
the assay plates through pintool. -Add 1 uL of buffer and luL of Agonist concentration to respective columns as
per plate map. Incubate for 18 hrs at 37C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET Substrate) dye using
a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 2 hrs in dark. Add 3 uL of Cell Titer
Glo and Incubate at room temperature for 0.5 hrs in dark. Read on ViewLux protocol optimized for this cell type.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

-------
Mifeprostone	DMSO

Baseline median absolute deviation for the assay (bmad): 6.344
Response cutoff threshold used to determine hit calls: 38.063
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Glucocorticoid receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Mifepristone was used as a positive GR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median

-------
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.187

Neutral control median absolute deviation, by plate: nmad	11.76

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-5000.5%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100.134

Positive control well median absolute deviation, by plate: pmad	4.583

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-7.876

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 765.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 795

T0X21_G R_BLA_Antagon ist_via bi I ity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HeLa Glucocorticoid Receptor (GR) Antagonism Beta-lactamase
Assay

1.2	Assay Summary: TOX21 GR BLA Antagonist is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-gr-hela-
bla-antagonist-pl. TOX21_GR_BLA_Antagonist_viability is an assay readout measuring cellular ATP content and
detected with CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_GR_BLA_Antagonist_viability used a
type of viability reporter where loss-of-signal activity can be used to understand changes in the cell viability.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a viability function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HeLa cells are aliquoted into 1536-well microtiter plates
at a concentration of 1500 cells/6 uL and incubated with test compounds for 18 hours prior to monitoring
fluorescence emission resulting from xenobiotic glucocorticoid receptor expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 glucocorticoid receptor (GR) beta-lactamase antagonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress glucocorticoid
receptor signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla

-------
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. The Glucocorticoid receptor (GR) is one of the members of the nuclear
receptor family of ligand-dependent transcription factors. GR plays a critical role in carbohydrate, protein and
lipid metabolisms and programmed cell death. CellSensor GR-bla HeLa cell line (Invitrogen) contains a beta-
lactamase reporter gene under the control of the glucocorticoid response element stably integrated into HeLa
cells, a cervical cancer cell line that expresses glucocorticoid receptor. The activation of the reporter gene under
culture conditions can be detected by fluorescence intensity. This cell line has been used to screen the Tox21
10K compound library and to identify the compounds that inhibit the GR signaling.

2.3	Experimental System: adherent HeLa cell line used. GR-bla HeLa cells are engineered to express beta-lactamase
under the control of glucocorticoid receptor. These cells stably express a -beta-lactamase reporter gene under
the transcriptional control of an upstream activator sequence (UAS).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should not be grown more than 80-85% confluence. The
cell performance is affected is they are more confluent. Handle 1536 well plate black clear bottom plates
carefully by sides. Cell Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 ml conical tube.
Remove the vial of cells to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath
with gentle agitation for 1-2 minutes. Do not submerge vial in water. Mix the entire content of the vial to 14 ml
of pre-warmed medium and centrifuge to remove DMSO. Discard the supernatant and transfer the precipitated
cells to T175 flask using 30 ml thawing medium. Cell Proliferation Method: Detach the cells from the flask using
TrypLExpress. The cells are re-seeded in T-175 flask at 1-1.3 million. Assay Protocol: Spin down the cells after
rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate the cells in black-
clear bottom 1536 well plate at 1500/well/6uL through 8 tip Multidrop plate dispenser. Incubate for 4 hrs at
37C / 99% Humidity / 5% C02. Transfer 23nL of compounds from the library collection and positive control to
the assay plates through pintool. -Add 1 uL of buffer and luL of Agonist concentration to respective columns as
per plate map. Incubate for 18 hrs at 37C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET Substrate) dye using
a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 2 hrs in dark. Add 3 uL of Cell Titer
Glo and Incubate at room temperature for 0.5 hrs in dark. Read on ViewLux protocol optimized for this cell type.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

tetraoctylammonium bromide
Baseline median absolute deviation for the assay (bmad): 6.485

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
Response cutoff threshold used to determine hit calls: 38.908

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Glucocorticoid receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
674

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.182

Neutral control median absolute deviation, by plate: nmad	10.116

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-6300.31%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 537.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 800

TOX2 l_PPARg_BLA_Agonist_ch 1

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-gamma (PPARg) Agonism Beta-
lactamase Assay, Channel 1 Readout of Uncleaved Substrate

1.2	Assay Summary: TOX21_PPARg_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
pparg-bla-agonist-pl. TOX21_PPARg_BLA_Agonist_chl is an assay readout measuring reporter gene via
receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-
lactamase reporter gene. The signal is derived from the uncleaved reporter gene substrate and is used to
calculate the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data
from the assay component TOX21_PPARg_BLA_Agonist_chl was analyzed into 1 assay endpoint. This assay
endpoint, TOX21_PPARg_BLA_Agonist_chl, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity
can be used to understand changes in the reporter gene as they relate to the gene PPARG. Furthermore, this
assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARg_BLA_Agonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to PPARG gene(s) using a
positive control of Rosiglitazone.

The Tox21 peroxisome proliferator-activated receptor gamma agonism beta-lactamase assay screened a library
of diverse environmental compounds to probe for xenobiotic ligand-binding and potential to induce PPAR-
gamma-dependent transcription, monitored through bla reporter gene signal activation using a mammalian

-------
one-hybrid GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the
day of the assay and following 17-hour incubation of cells with test compounds a membrane-permeable FRET-
based substrate CCF4-AM is introduced and incubated for an additional hour, in the dark. Once in the cell,
cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified
by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals
are monitored using an Envision plate reader. Compound auto-fluorescence was monitored in various
"TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer PPAR-gamma 293H DA (Division-arrested)
cells and PPARgamma-UAS-bla 293H cells contain a peroxisome proliferator-activated receptor gamma (PPARg)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293H
cell line. CellSensor UAS-bla 293H contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293H cells. These cells stably express a beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPARg (LBD) fusion protein, the protein binds to the UAS, resulting in expression of beta-lactamase. The
HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 1. The transformation incorporated approximately 4.5 kilobases
from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid 2. HEK293 cells are popular for their ease of
growth and transfection cells and are frequently used to produce exogenous proteins or viruses for
pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 3-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL for agonist mode and 3000cells/well/5 ul
for antagonist mode through 8 tip Multidrop plate dispenser. Incubate for 5 hrs at 37C / 99% Humidity / 5%
C02. Transfer 23nL of compounds from the library collection and positive control to the assay plates through
pintool. Add 1 uL of buffer and luL of Agonist concentration to respective columns as per plate map for
antagonist mode. Incubate for 17 hrs at 37C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET Substrate) dye
using a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 1 hrs in dark. Read the
fluorescence intensity through Envision plate reader using Beta-Lactamase protocol optimized for this cell type.
Add 3 uL of Cell Titer Glo and Incubate at room temperature for 0.5 hrs in dark for antagonist mode. Read on
ViewLux protocol optimized for this cell type for antagonist mode.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:
0.00979371647509579 nM

Target (nominal) number of replicates:

3

Standard maximum concentration tested:
765.134099616858 nM

-------
Key positive control:	Neutral vehicle control:

Rosiglitazone	DMSO

Baseline median absolute deviation for the assay (bmad): 4.598
Response cutoff threshold used to determine hit calls: 27.59
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-gamma agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Rosiglitazone was used as a positive PPARg agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series

-------
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.201

Neutral control median absolute deviation, by plate: nmad	7.368

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3646.69%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-21.504

Positive control well median absolute deviation, by plate: pmad	5.309

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.23

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 961.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 801

TOX21_PPARg_BLA_Agonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-gamma (PPARg) Agonism Beta-
lactamase Assay, Channel 2 Readout of Cleaved Substrate

1.2	Assay Summary: TOX21_PPARg_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
pparg-bla-agonist-pl. TOX21_PPARg_BLA_Agonist_ch2 is an assay readout measuring reporter gene via
receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-
lactamase reporter gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate
the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the
assay component TOX21_PPARg_BLA_Agonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PPARg_BLA_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, increased activity can be used to
understand changes in the reporter gene as they relate to the gene PPARG. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARg_BLA_Agonist_ch2 was designed to measure cleaved reporter gene substrate to
target nuclear receptor (non-steroidal) activity at the protein (receptor) level, specifically mapping to PPARG
gene(s) using a positive control of Rosiglitazone

The Tox21 peroxisome proliferator-activated receptor gamma agonism beta-lactamase assay screened a library
of diverse environmental compounds to probe for xenobiotic ligand-binding and potential to induce PPAR-
gamma-dependent transcription, monitored through bla reporter gene signal activation using a mammalian

-------
one-hybrid GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the
day of the assay and following 17-hour incubation of cells with test compounds a membrane-permeable FRET-
based substrate CCF4-AM is introduced and incubated for an additional hour, in the dark. Once in the cell,
cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified
by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals
are monitored using an Envision plate reader. Compound auto-fluorescence was monitored in various
"TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer PPAR-gamma 293H DA (Division-arrested)
cells and PPARgamma-UAS-bla 293H cells contain a peroxisome proliferator-activated receptor gamma (PPARg)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293H
cell line. CellSensor UAS-bla 293H contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293H cells. These cells stably express a beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPARg (LBD) fusion protein, the protein binds to the UAS, resulting in expression of beta-lactamase. The
HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 1. The transformation incorporated approximately 4.5 kilobases
from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid 2. HEK293 cells are popular for their ease of
growth and transfection cells and are frequently used to produce exogenous proteins or viruses for
pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 3-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL for agonist mode and 3000cells/well/5 ul
for antagonist mode through 8 tip Multidrop plate dispenser. Incubate for 5 hrs at 37C / 99% Humidity / 5%
C02. Transfer 23nL of compounds from the library collection and positive control to the assay plates through
pintool. Add 1 uL of buffer and luL of Agonist concentration to respective columns as per plate map for
antagonist mode. Incubate for 17 hrs at 37C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET Substrate) dye
using a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 1 hrs in dark. Read the
fluorescence intensity through Envision plate reader using Beta-Lactamase protocol optimized for this cell type.
Add 3 uL of Cell Titer Glo and Incubate at room temperature for 0.5 hrs in dark for antagonist mode. Read on
ViewLux protocol optimized for this cell type for antagonist mode.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:
0.00979371647509579 nM

Target (nominal) number of replicates:

3

Standard maximum concentration tested:
765.134099616858 nM

-------
Key positive control:	Neutral vehicle control:

Rosiglitazone	DMSO

Baseline median absolute deviation for the assay (bmad): 3.583
Response cutoff threshold used to determine hit calls: 21.496
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-gamma agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Rosiglitazone was used as a positive PPARg agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series

-------
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.168

Neutral control median absolute deviation, by plate: nmad	6.242

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3612.32%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.572

Positive control well median absolute deviation, by plate: pmad	14.524

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	6.268

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 862.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 802

TOX2 l_PPARg_B LA_Agon ist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-gamma (PPARg) Agonism Beta-
lactamase Assay, Ratio

1.2	Assay Summary: TOX21_PPARg_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
pparg-bla-agonist-pl. TOX21_PPARg_BLA_Agonist_ratio is an assay readout measuring reporter gene via
receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-
lactamase reporter gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene
substrate used as the measure of target activity.. Data from the assay component
TOX21_PPARg_BLA_Agonist_ratio was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PPARg_BLA_Agonist_ratio, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene PPARG. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints and this ratio
serves a reporter gene function to understand target activity. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the
subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARg_BLA_Agonist_ratio was designed to target nuclear receptor (non-steroidal) activity
at the protein (receptor) level, specifically mapping to PPARG gene(s) using a positive control of Rosiglitazone

The Tox21 peroxisome proliferator-activated receptor gamma agonism beta-lactamase assay screened a library
of diverse environmental compounds to probe for xenobiotic ligand-binding and potential to induce PPAR-

-------
gamma-dependent transcription, monitored through bla reporter gene signal activation using a mammalian
one-hybrid GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the
day of the assay and following 17-hour incubation of cells with test compounds a membrane-permeable FRET-
based substrate CCF4-AM is introduced and incubated for an additional hour, in the dark. Once in the cell,
cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified
by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals
are monitored using an Envision plate reader. Compound auto-fluorescence was monitored in various
"TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Peroxisome proliferator-activated receptor gamma (PPAR-gamma) is
a ligand-activated nuclear receptor which regulates the expression of genes involved in fatty acid-oxidation and
is a major regulator of energy homeostasis. PPAR-gamma is primarily expressed in adipose tissue, macrophages
and in the colon where it controls adipocyte differentiation, lipid storage and inflammatory responses. PPAR-
gamma agonists, the thiazolidinediones (TZDs), improve insulin sensitivity, lower glucose levels, and lower
plasma triglycerides and free fatty acid (FFA) levels by enhancing their uptake into adipocytes. The
PPARg_BLA_Agonist assay used Fluorescence Resonance Energy Transfer (FRET) substrate to generate a
ratiometric reporter response to receptor ligand-binding to allow monitoring of PPAR-gamma activity relative
to a known receptor agonist. This assay is designed to help identify environmental compounds with a capacity
for PPAR-gamma ligand-binding activity. The Tox21 PPAR-gamma bla assays are qHTS format assays which
measured the ability of a chemical to interact with PPAR-gamma by monitoring modulation of fluorescence
reporter gene signals. This assay utilized a human embryonic kidney cell line (HEK293T) which expresses PPAR-
gamma and a one-hybrid GAL4 system to quantify xenobiotic PPAR-gamma agonism.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer PPAR-gamma 293H DA (Division-arrested)
cells and PPARgamma-UAS-bla 293H cells contain a peroxisome proliferator-activated receptor gamma (PPARg)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293H
cell line. CellSensor UAS-bla 293H contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293H cells. These cells stably express a beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPARg (LBD) fusion protein, the protein binds to the UAS, resulting in expression of beta-lactamase. The
HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 1. The transformation incorporated approximately 4.5 kilobases
from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid 2. HEK293 cells are popular for their ease of
growth and transfection cells and are frequently used to produce exogenous proteins or viruses for
pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial

-------
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 3-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL for agonist mode and 3000cells/well/5 ul
for antagonist mode through 8 tip Multidrop plate dispenser. Incubate for 5 hrs at 37C / 99% Humidity / 5%
C02. Transfer 23nL of compounds from the library collection and positive control to the assay plates through
pintool. Add 1 uL of buffer and luL of Agonist concentration to respective columns as per plate map for
antagonist mode. Incubate for 17 hrs at 37C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET Substrate) dye
using a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 1 hrs in dark. Read the
fluorescence intensity through Envision plate reader using Beta-Lactamase protocol optimized for this cell type.
Add 3 uL of Cell Titer Glo and Incubate at room temperature for 0.5 hrs in dark for antagonist mode. Read on
ViewLux protocol optimized for this cell type for antagonist mode.

Baseline median absolute deviation for the assay (bmad): 2.084

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-gamma agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Rosiglitazone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

NA

-------
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Rosiglitazone was used as a positive PPARg agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag

-------
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
296

Inactive hit count: 0
-------
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.093

Neutral control median absolute deviation, by plate: nmad

3.487

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-3436.94%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

99.805

6.461

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

13.516

NA

-------
Positive control signal-to-noise: ((pmed-nmed)/nmad)

NA

Positive control signal-to-background: (pmed/nmed)

NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

NA

Negative control well median absolute deviation value, by plate: mmad

NA

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

NA

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrtfmmad2 + nmad2)

NA

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

NA

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 812.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

-------
•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 806

TOX21_AhR_LUC_Agonist

1. General Information

1.1	Assay Title: Tox21 HepG2 Aryl Hydrocarbon Receptor (AhR) Agonism Luciferase Assay

1.2	Assay Summary: TOX21 AhR LUC Agonist is a cell-based, single-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-ahr-pl.
TOX21_AhR_LUC_Agonist is one of one assay component(s) measured or calculated from the
TOX21_AhR_LUC_Agonist assay. It is designed to make measurements of luciferase induction, a form of
inducible reporter, as detected with bioluminescence signals by CALUX luciferase quantitation technology. Data
from the assay component TOX21_AhR_LUC_Agonist was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_AhR_LUC_Agonist, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene AHR. Furthermore, this assay endpoint
can be referred to as a primary readout, because the performed assay has only produced 1 assay endpoint. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the DNA binding
intended target family, where the subfamily is basic helix-loop-helix protein.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. 4000 cells in 5 uL/well were dispensed into white wall/solid bottom 1536-
well plates using a Multidrop Combi (Thermo Fisher Scientific Inc., Waltham, MA) dispenser. After the assay
plates were incubated at 37C for 5 h, 23 nL of compound or DMSO vehicle was transferred to the assay plates
by a pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C for 24 hr. After 1 ul/well of
CellTiter-Fluor reagent (Promega, Madison, Wl) was added into the assay plates using a Flying Reagent
Dispenser (FRD) (Aurora Discovery, CA), the assay plates were incubated at 37C for 30 min. The fluorescence
intensity in the plates was measured using a ViewLux plate reader (PerkinElmer, Shelton, CT), followed by the
addition of 4 uLof ONE-Glo Luciferase Assay reagent (Promega) using an FRD (Aurora Discovery). After the assay
plates were incubated at room temperature for 30 min, luminescence intensity was measured using a ViewLux
plate reader (Perkin Elmer).

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Changes to bioluminescence signals produced from the luciferase induction are indicative of
agonist activity regulated by the receptor function and kinetics for the human aryl hydrocarbon receptor
[GeneSymbokAHR | GenelD:196 | Uniprot_SwissProt_Accession:P35869],

The Tox21 aryl hydrocarbon receptor agonism luciferase assay screened a library of divers environmental
compounds to probe for xenobiotic ligand-binding and potential to induce AhR activation, monitored through
luciferase reporter gene signal activation. HepG2 (human hepatocellular carcinoma) cells were stably
transfected with an Ah receptor-responsive firefly luciferase reporter gene plasmid containing 20 dioxin
responsive elements and luciferase reporter gene. Increased luciferase activity can be used to identify the
compounds that induce AhR activation. The assay is run in triplicate on 1536-well microplate and
bioluminescence was measured following 24-hour incubation of cells with test compounds and 30 min
incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored
using a Promega ViewLux plate reader. Following the incubation period, the cell culture was screened for
bioluminescent signals in agonist mode using luciferase detection technology. Compound auto-fluorescence
was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background
artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity ortransfection efficiency. The Aryl hydrocarbon Receptor (AhR), a member of the family
of basic helix-loop-helix transcription factors, is crucial to adaptive responses to environmental changes. AhR
mediates cellular responses to environmental pollutants such as aromatic hydrocarbons through induction of
phase I and II enzymes but also interacts with other nuclear receptor signaling pathways.

2.3	Experimental System: adherent HepG2 cell line used. A cell based HepG2-AhR-luc assay (HG2L7.5cl cell line,
developed by Dr. Michael S. Denison, University of California at Davis) was used to assess the activation of AhR.
HepG2 (human hepatocellular carcinoma) cells were stably transfected with an Ah receptor-responsive firefly
luciferase reporter gene plasmid containing 20 dioxin responsive elements and luciferase reporter gene.

2.4	Metabolic Competence: The parental HepG2 cell line has been shown by others to retain the potential for Phase
I and Phase II metabolic responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A,
2E1, and 3A4/5 with CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human
hepatocytes although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have
even been observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2
cells include SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7).
In addition, HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53
protein (Boehme et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant
response element (ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a
number of ATP-binding cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound
proteins also regulated in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Cell Media Required: Growth Medium (90% Alpha MEM, 10% Premium FBS, 400mg/mL
G418), Assay (90% Alpha MEM, 10% Premium FBS), Thaw (90% Alpha MEM, 10% Premium FBS), Freezing (100%
Recovery cell culture freezing medium). Thawing method: Place 14mLof pre-warmed thaw media intoT75 flask.
Remove vial of cells to be thawed from -140 and thaw rapidly by placing in water bath with gentle agitation for
l-2min. Wipe vial with 70% ethanol before opening in biological safety cabinet. Transfer vial contents dropwise
into lOmLof thaw medium in 15mL conical tube. Centrifuge cells at 1000 rpm for4min. Resuspend and transfer
contents into T75 flask containing thaw medium and transfer flask into incubator. Switch to growth media at

-------
first passage. Cell Proliferation Method: Aspirate media, rinse once with dPBS, add 0.25% trypsin/EDTA and
swirl to coat flask evenly. Add equal volume of growth medium to inactivate trypsin after 2-3 minutes
incubation. Centrifuge cells at 1000 RPM for 4min and resuspend in growth medium before adding to new flask.
Cells should be passaged or fed at least twice per week. Assay Protocol: Harvest cells from culture in growth
medium and resuspend in assay medium. Dispense 4000 cells per well into 1536-well tissue culture treated
white solid bottom plate using multidrop dispenser. Incubate cells 5hr, then dispense 23nL of compound,
positive control, or DMSO control using pintool. Positive and control compounds are located in the first four
columns according to the plate map, and library compounds located in columns 5-48. Incubate plates for 19hr
at 37C. Add luL of CellTiter fluor to each well using BioRAPTR dispenser. Incubate 30min at 37C. Measure
fluorescence using ViewLux. Add 5uLof Onego using BioRAPTR dispenser. Incubate 30min at room temperature.
Read luminescence on ViewLux.

Baseline median absolute deviation for the assay (bmad): 1.21

Response cutoff threshold used to determine hit calls: 20

Detection technology used: CALUX luciferase quantitation (Luminescence)

2.6	Response: Activation of the aryl hydrocarbon receptor (Ahr) signaling pathway is measured by bioluminescence
activity via an Ahr receptor-responsive firefly luciferase reporter gene plasmid containing 20 dioxin responsive
elements and luciferase reporter gene. Increased luciferase activity can be used to identify the compounds that
induce AhR activation. The cytotoxicity of the compounds screened against the HepG2-AhR-luc cell line was
tested in parallel by measuring the cell viability using CellTiter-Fluor assay in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

Omeprazole

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

-------
3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Increased luminescence (gain-of-signal) was measured relative to omeprazole (positive control)
signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a percent
of positive control activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were

-------
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.035

Neutral control median absolute deviation, by plate: nmad	1.783

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-4878.73%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.468

Positive control well median absolute deviation, by plate: pmad	20.862

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	4.703

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 593.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: He G, Tsutsumi T, Zhao B, Baston DS, Zhao J, Heath-Pagliuso S, Denison MS. Third-generation Ah
receptor-responsive luciferase reporter plasmids: amplification of dioxin-responsive elements dramatically
increases CALUX bioassay sensitivity and responsiveness. Toxicol Sci. 2011 Oct;123(2):511-22. doi:
10.1093/toxsci/kfrl89. Epub 2011 Jul 20. PubMed PMID: 21775728; PubMed Central PMCID: PMC3179681.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 807

T0X2 l_Ah R_LU C_Ago n ist_vi a b i I ity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HepG2 Aryl Hydrocarbon Receptor (AhR) Agonism Luciferase Assay

1.2	Assay Summary: TOX21 AhR LUC Agonist is a cell-based, single-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-ahr-pl.
TOX21_AhR_LUC_Agonist_viability is an assay readout measuring cellular ATP content and detected with
CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_AhR_LUC_Agonist_viability used a type of viability
reporter where loss-of-signal activity can be used to understand changes in the cell viability. Furthermore, this
assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves a viability function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. 4000 cells in 5 uL/well were dispensed into white wall/solid bottom 1536-
well plates using a Multidrop Combi (Thermo Fisher Scientific Inc., Waltham, MA) dispenser. After the assay
plates were incubated at 37C for 5 h, 23 nL of compound or DMSO vehicle was transferred to the assay plates
by a pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C for 24 hr. For cell viability
measurement, 1 ul/well of CellTiter-Fluor reagent (Promega, Madison, Wl) was added into the assay plates using
a Flying Reagent Dispenser (FRD) (Aurora Discovery, CA),and then the assay plates were incubated at 37C for 30
min. The fluorescence intensity in the plates was measured using a ViewLux plate reader (PerkinElmer, Shelton,
CT), followed by the addition of 4 uL of ONE-Glo Luciferase Assay reagent (Promega) using an FRD (Aurora
Discovery). After the assay plates were incubated at room temperature for 30 min, luminescence intensity was
measured using a ViewLux plate reader (Perkin Elmer).

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

-------
The Tox21 aryl hydrocarbon receptor agonism luciferase assay screened a library of divers environmental
compounds to probe for xenobiotic ligand-binding and potential to induce AhR activation, monitored through
luciferase reporter gene signal activation. HepG2 (human hepatocellular carcinoma) cells were stably
transfected with an Ah receptor-responsive firefly luciferase reporter gene plasmid containing 20 dioxin
responsive elements and luciferase reporter gene. Increased luciferase activity can be used to identify the
compounds that induce AhR activation. The assay is run in triplicate on 1536-well microplate and
bioluminescence was measured following 24-hour incubation of cells with test compounds and 30 min
incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored
using a Promega ViewLux plate reader. Following the incubation period, the cell culture was screened for
bioluminescent signals in agonist mode using luciferase detection technology. Compound auto-fluorescence
was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background
artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. The Aryl hydrocarbon Receptor (AhR), a member of the family
of basic helix-loop-helix transcription factors, is crucial to adaptive responses to environmental changes. AhR
mediates cellular responses to environmental pollutants such as aromatic hydrocarbons through induction of
phase I and II enzymes but also interacts with other nuclear receptor signaling pathways.

2.3	Experimental System: adherent HepG2 cell line used. A cell based HepG2-AhR-luc assay (HG2L7.5cl cell line,
developed by Dr. Michael S. Denison, University of California at Davis) was used to assess the activation of AhR.
HepG2 (human hepatocellular carcinoma) cells were stably transfected with an Ah receptor-responsive firefly
luciferase reporter gene plasmid containing 20 dioxin responsive elements and luciferase reporter gene.

2.4	Metabolic Competence: The parental HepG2 cell line has been shown by others to retain the potential for Phase
I and Phase II metabolic responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A,
2E1, and 3A4/5 with CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human
hepatocytes although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have
even been observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2
cells include SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7).
In addition, HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53
protein (Boehme et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant
response element (ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a
number of ATP-binding cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound
proteins also regulated in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Cell Media Required: Growth Medium (90% Alpha MEM, 10% Premium FBS, 400mg/mL
G418), Assay (90% Alpha MEM, 10% Premium FBS), Thaw (90% Alpha MEM, 10% Premium FBS), Freezing (100%
Recovery cell culture freezing medium). Thawing method: Place 14mLof pre-warmed thaw media intoT75 flask.
Remove vial of cells to be thawed from -140 and thaw rapidly by placing in water bath with gentle agitation for
l-2min. Wipe vial with 70% ethanol before opening in biological safety cabinet. Transfer vial contents dropwise
into lOmLof thaw medium in 15mL conical tube. Centrifuge cells at 1000 rpm for4min. Resuspend and transfer
contents into T75 flask containing thaw medium and transfer flask into incubator. Switch to growth media at
first passage. Cell Proliferation Method: Aspirate media, rinse once with dPBS, add 0.25% trypsin/EDTA and
swirl to coat flask evenly. Add equal volume of growth medium to inactivate trypsin after 2-3 minutes
incubation. Centrifuge cells at 1000 RPM for 4min and resuspend in growth medium before adding to new flask.
Cells should be passaged or fed at least twice per week. Assay Protocol: Harvest cells from culture in growth

-------
medium and resuspend in assay medium. Dispense 4000 cells per well into 1536-well tissue culture treated
white solid bottom plate using multidrop dispenser. Incubate cells 5hr, then dispense 23nL of compound,
positive control, or DMSO control using pintool. Positive and control compounds are located in the first four
columns according to the plate map, and library compounds located in columns 5-48. Incubate plates for 19hr
at 37C. Add luL of CellTiter fluor to each well using BioRAPTR dispenser. Incubate 30min at 37C. Measure
fluorescence using ViewLux. Add 5uLof Onego using BioRAPTR dispenser. Incubate 30min at room temperature.
Read luminescence on ViewLux.

Baseline median absolute deviation for the assay (bmad): 4.672
Response cutoff threshold used to determine hit calls: 28.033

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Activation of the aryl hydrocarbon receptor (Ahr) signaling pathway is measured by bioluminescence
activity via an Ahr receptor-responsive firefly luciferase reporter gene plasmid containing 20 dioxin responsive
elements and luciferase reporter gene. Increased luciferase activity can be used to identify the compounds that
induce AhR activation. The cytotoxicity of the compounds screened against the HepG2-AhR-luc cell line was
tested in parallel by measuring the cell viability using CellTiter-Fluor assay in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.147

Neutral control median absolute deviation, by plate: nmad	7.392

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-5029.82%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-18.316

Positive control well median absolute deviation, by plate: pmad	10.367

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.324

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 825.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: He G, Tsutsumi T, Zhao B, Baston DS, Zhao J, Heath-Pagliuso S, Denison MS. Third-generation Ah
receptor-responsive luciferase reporter plasmids: amplification of dioxin-responsive elements dramatically
increases CALUX bioassay sensitivity and responsiveness. Toxicol Sci. 2011 Oct;123(2):511-22. doi:
10.1093/toxsci/kfrl89. Epub 2011 Jul 20. PubMed PMID: 21775728; PubMed Central PMCID: PMC3179681.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1108

TOX21_ARE_BLA_Agonist_chl

1.	General Information

1.1	Assay Title: Tox21 HepG2 Nuclear Erythroid 2-Related factor 2/Antioxidant Response Element (Nrf2/ARE)
Agonism Beta-lactamase Assa, Channel 1 Readout of Uncleaved Substrate

1.2	Assay Summary: TOX21_ARE_BLA_Agonist is a cell-based, single-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-are-bla-pl.
TOX21_ARE_BLA_Agonist_chl is an assay readout measuring reporter gene via transcription factor activity and
designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene.
The signal is derived from the uncleaved reporter gene substrate and is used to calculate the ratio of cleaved
(ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_ARE_BLA_Agonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ARE_BLA_Agonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene NFE2L2. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. ARE-bla HepG2 cells are dispensed at 2000 cells/5 uL well in 1536-well
black, clear-bottom plates and incubated for 6 hours at 37C. Compounds are plated using a Wako Pintool station
and incubated for 16 hours. After a 16 hour incubation, 1 uLof LiveBLAzer (Life Technologies) detection mix was
added to each well and the plates are subsequently incubated at RT for 2 h.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ARE_BLA_Agonist_chl was designed to measure uncleaved reporter gene substrate to
target transcription factor activity, specifically mapping to NFE2L2 gene(s) using a positive control of Beta-
Naphthoflavone

The Tox21 antioxidant response element agonism beta-lactamase assay screened a library of diverse
environmental compounds to identify agonists that induce oxidative stress, monitored through bla reporter

-------
gene signal activation using a mammalian one-hybrid GAL4 system. The assay is run in triplicate on 1536-well
microplates. HepG2 cells are plated the day of the assay and following 16 hour incubation of cells with test
compounds a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (460nm, product) to green (530nm, substrate)
fluorescence. Fluorescence signals are monitored using an Envision plate reader. Each compound was tested in
a concentration-response format, using 11 concentrations ranging from 1.6 nM to 92 uM. Concentration-
response relationships were determined by monitoring FRET signals relative to DMSO-only exposures which
provided a signal baseline, and to a known antioxidant response element agonist (Beta-Naphthoflavone) as a
positive control which provided a reference for 100 percent androgen receptor inhibition. Compound auto-
fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for
background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice.

2.4	Metabolic Competence: The parental HepG2 cell line has been shown by others to retain the potential for Phase
I and Phase II metabolic responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A,
2E1, and 3A4/5 with CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human
hepatocytes although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have
even been observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2
cells include SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7).
In addition, HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53
protein (Boehme et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant
response element (ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a
number of ATP-binding cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound
proteins also regulated in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 60 to 75% confluence. Handle
the 1536-well, black-wall, clear-bottom assay plate by the sides; do not touch the clear bottom of the assay
plate. Cell Media Required: Growth (90% DMEM with GlutaMAX, 10% Dialyzed FBS, 0.1 mM NEAA, 25mM
HEPES, lOOU/mL/lOOug/mL Penicillin/Streptomycin, 5 ug/mL Blasticidin), Assay (99% DMEM with GlutaMAX,
10% Dialyzed FBS, 0.1 mM NEAA, 25mM HEPES, lOOU/mL/lOOug/mL Penicillin/Streptomycin), Thaw (90%
DMEM with GlutaMAX, 1% Dialyzed FBS, 0.1 mM NEAA, 25mM HEPES, lOOU/mL/lOOug/mL
Penicillin/Streptomycin), Freezing (100% Recovery Cell freezing medium) Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a T75 flask. Remove the vial of cells to be thawed from liquid nitrogen and thaw
rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial in water.
Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet. Transfer the
vial contents drop-wise into 10 mL of Thaw Medium in a sterile 15-mL conical tub. Centrifuge cells at 900 rpm
for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask containing Thaw
Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at first passage. Cell
Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.25% Trypsin/EDTA and swirl to coat the cell
evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes incubation at 37C.
Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be passage at least twice
a week. Assay Protocol: Harvest cells from culture in growth medium and resuspend in assay medium. Dispense
2000 cells/5uL/well into 1536-well tissue treated black/clear bottom plates using a BioRAPTR dispenser. After
the cells were incubated at 37C for 5 hours, 23 nL of positive controls or compounds were transferred to the

-------
assay plate by a PinTool resulting in a 217-fold dilution. The final compound concentration in the 5 ul assay
volume ranged from 1.2 nM to 92 uM in 15 concentrations. Incubate the plates for 16 hours at 37C. Add 1 uL of
6X LiveBLAzer-FRET B/G (CCF4-AM) Substrate Mixture to each well using a BioRAPTR dispenser and incubate
the plate at room temperature for 2 hours. Measure fluorescence intensity at 460 and 530 nm emission and
405 nm excitation by an Envision detector. Data is expressed as the ratio of 460nm/530nm emissions.

Baseline median absolute deviation for the assay (bmad): 2.855

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Antioxidant response element (ARE) signaling pathway agonism is monitored by FRET emission
resulting from GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was
measured in parallel by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation
in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

Beta-Naphthoflavone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

-------
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Beta-Naphthoflavone was used as a positive ARE agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root

-------
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.167

Neutral control median absolute deviation, by plate: nmad	4.865

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-2860.47%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-25.044

Positive control well median absolute deviation, by plate: pmad	3.756

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-3.922

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

-------
(1 - ((3 * (mmad + nmad)) / abs(mmed - rimed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 665.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Xia M, Huang R, Shi Q, Boyd WA, Zhao J, Sun N, Rice J R, Dunlap PE, Hackstadt AJ, Bridge MF, Smith
MV, Dai S, Zheng W, Chu PH, Gerhold D, Witt KL, DeVito M, Freedman JH, Austin CP, Houck KA, Thomas RS,
Paules RS, Tice RR, Simeonov A. Comprehensive Analyses and Prioritization of Tox21 10K Chemicals Affecting
Mitochondrial Function by in-Depth Mechanistic Studies. Environ Health Perspect. 2018 Jul 26;126(7):077010.
doi: 10.1289/EHP2589. PMID: 30059008; PMCID: PMC6112376.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1109

TOX21_ARE_BLA_Agonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HepG2 Nuclear Erythroid 2-Related factor 2/Antioxidant Response Element (Nrf2/ARE)
Agonism Beta-lactamase Assay, Channel 2 Readout of Cleaved Substrate

1.2	Assay Summary: TOX21_ARE_BLA_Agonist is a cell-based, single-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-are-bla-pl.
TOX21_ARE_BLA_Agonist_ch2 is an assay readout measuring reporter gene via receptor activity and designed
using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene. The signal
is derived from the cleaved reporter gene substrate and is used to calculate the ratio of cleaved (ch2) to
uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_ARE_BLA_Agonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ARE_BLA_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, increased activity can be used to
understand changes in the reporter gene as they relate to the gene NFE2L2. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. ARE-bla HepG2 cells are dispensed at 2000 cells/5 uL well in 1536-well
black, clear-bottom plates and incubated for 6 hours at 37C. Compounds are plated using a Wako Pintool station
and incubated for 16 hours. After a 16 hour incubation, 1 uLof LiveBLAzer (Life Technologies) detection mix was
added to each well and the plates are subsequently incubated at RT for 2 h.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ARE_BLA_Agonist_ch2 was designed to measure cleaved reporter gene substrate to target
transcription factor activity, specifically mapping to NFE2L2 gene(s) using a positive control of Beta-
Naphthoflavone

The Tox21 antioxidant response element agonism beta-lactamase assay screened a library of diverse
environmental compounds to identify agonists that induce oxidative stress, monitored through bla reporter

-------
gene signal activation using a mammalian one-hybrid GAL4 system. The assay is run in triplicate on 1536-well
microplates. HepG2 cells are plated the day of the assay and following 16 hour incubation of cells with test
compounds a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (460nm, product) to green (530nm, substrate)
fluorescence. Fluorescence signals are monitored using an Envision plate reader. Each compound was tested in
a concentration-response format, using 11 concentrations ranging from 1.6 nM to 92 uM. Concentration-
response relationships were determined by monitoring FRET signals relative to DMSO-only exposures which
provided a signal baseline, and to a known antioxidant response element agonist (Beta-Naphthoflavone) as a
positive control which provided a reference for 100 percent androgen receptor inhibition. Compound auto-
fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for
background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice.

2.4	Metabolic Competence: The parental HepG2 cell line has been shown by others to retain the potential for Phase
I and Phase II metabolic responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A,
2E1, and 3A4/5 with CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human
hepatocytes although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have
even been observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2
cells include SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7).
In addition, HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53
protein (Boehme et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant
response element (ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a
number of ATP-binding cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound
proteins also regulated in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 60 to 75% confluence. Handle
the 1536-well, black-wall, clear-bottom assay plate by the sides; do not touch the clear bottom of the assay
plate. Cell Media Required: Growth (90% DMEM with GlutaMAX, 10% Dialyzed FBS, 0.1 mM NEAA, 25mM
HEPES, lOOU/mL/lOOug/mL Penicillin/Streptomycin, 5 ug/mL Blasticidin), Assay (99% DMEM with GlutaMAX,
10% Dialyzed FBS, 0.1 mM NEAA, 25mM HEPES, lOOU/mL/lOOug/mL Penicillin/Streptomycin), Thaw (90%
DMEM with GlutaMAX, 1% Dialyzed FBS, 0.1 mM NEAA, 25mM HEPES, lOOU/mL/lOOug/mL
Penicillin/Streptomycin), Freezing (100% Recovery Cell freezing medium) Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a T75 flask. Remove the vial of cells to be thawed from liquid nitrogen and thaw
rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial in water.
Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet. Transfer the
vial contents drop-wise into 10 mL of Thaw Medium in a sterile 15-mL conical tub. Centrifuge cells at 900 rpm
for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask containing Thaw
Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at first passage. Cell
Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.25% Trypsin/EDTA and swirl to coat the cell
evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes incubation at 37C.
Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be passage at least twice
a week. Assay Protocol: Harvest cells from culture in growth medium and resuspend in assay medium. Dispense
2000 cells/5uL/well into 1536-well tissue treated black/clear bottom plates using a BioRAPTR dispenser. After
the cells were incubated at 37C for 5 hours, 23 nL of positive controls or compounds were transferred to the

-------
assay plate by a PinTool resulting in a 217-fold dilution. The final compound concentration in the 5 ul assay
volume ranged from 1.2 nM to 92 uM in 15 concentrations. Incubate the plates for 16 hours at 37C. Add 1 uL of
6X LiveBLAzer-FRET B/G (CCF4-AM) Substrate Mixture to each well using a BioRAPTR dispenser and incubate
the plate at room temperature for 2 hours. Measure fluorescence intensity at 460 and 530 nm emission and
405 nm excitation by an Envision detector. Data is expressed as the ratio of 460nm/530nm emissions.

Baseline median absolute deviation for the assay (bmad): 5.635
Response cutoff threshold used to determine hit calls: 33.812
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Antioxidant response element (ARE) signaling pathway agonism is monitored by FRET emission
resulting from GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was
measured in parallel by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation
in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

Beta-Naphthoflavone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

-------
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Beta-Naphthoflavone was used as a positive ARE agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root

-------
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.339

Neutral control median absolute deviation, by plate: nmad	9.609

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-2880.01%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.471

Positive control well median absolute deviation, by plate: pmad	18.433

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	4.657

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

-------
(1 - ((3 * (mmad + nmad)) / abs(mmed - rimed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 744.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Xia M, Huang R, Shi Q, Boyd WA, Zhao J, Sun N, Rice J R, Dunlap PE, Hackstadt AJ, Bridge MF, Smith
MV, Dai S, Zheng W, Chu PH, Gerhold D, Witt KL, DeVito M, Freedman JH, Austin CP, Houck KA, Thomas RS,
Paules RS, Tice RR, Simeonov A. Comprehensive Analyses and Prioritization of Tox21 10K Chemicals Affecting
Mitochondrial Function by in-Depth Mechanistic Studies. Environ Health Perspect. 2018 Jul 26;126(7):077010.
doi: 10.1289/EHP2589. PMID: 30059008; PMCID: PMC6112376.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1110

TOX21_ARE_BLA_Agonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HepG2 Nuclear Erythroid 2-Related factor 2/Antioxidant Response Element (Nrf2/ARE)
Agonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21_ARE_BLA_Agonist is a cell-based, single-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-are-bla-pl.
TOX21_ARE_BLA_Agonist_ratio is an assay readout measuring reporter gene via receptor activity and designed
using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene. The signal
is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate used as the measure of
target activity. Data from the assay component TOX21_ARE_BLA_Agonist_ratio was analyzed into 1 assay
endpoint. This assay endpoint, TOX21_ARE_BLA_Agonist_ratio, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter,
gain-of-signal activity can be used to understand changes in the reporter gene as they relate to the gene NFE2L2.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints and this ratio serves a reporter gene function to understand target activity. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the DNA binding
intended target family, where the subfamily is basic leucine zipper.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. ARE-bla HepG2 cells are dispensed at 2000 cells/5 uL well in 1536-well
black, clear-bottom plates and incubated for 6 hours at 37C. Compounds are plated using a Wako Pintool station
and incubated for 16 hours. After a 16 hour incubation, 1 uLof LiveBLAzer (Life Technologies) detection mix was
added to each well and the plates are subsequently incubated at RT for 2 h.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ARE_BLA_Agonist_ratio was designed to target transcription factor activity, specifically
mapping to NFE2L2 gene(s) using a positive control of Beta-Naphthoflavone

The Tox21 antioxidant response element agonism beta-lactamase assay screened a library of diverse
environmental compounds to identify agonists that induce oxidative stress, monitored through bla reporter

-------
gene signal activation using a mammalian one-hybrid GAL4 system. The assay is run in triplicate on 1536-well
microplates. HepG2 cells are plated the day of the assay and following 16 hour incubation of cells with test
compounds a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (460nm, product) to green (530nm, substrate)
fluorescence. Fluorescence signals are monitored using an Envision plate reader. Each compound was tested in
a concentration-response format, using 11 concentrations ranging from 1.6 nM to 92 uM. Concentration-
response relationships were determined by monitoring FRET signals relative to DMSO-only exposures which
provided a signal baseline, and to a known antioxidant response element agonist (Beta-Naphthoflavone) as a
positive control which provided a reference for 100 percent androgen receptor inhibition. Compound auto-
fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for
background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Oxidative stress has been implicated in the pathogenesis of a variety
of diseases ranging from cancer to neurodegeneration. The antioxidant response element (ARE) signaling
pathway plays an important role in the amelioration of oxidative stress. The CellSensor ARE-bla HepG2 cell line
(Invitrogen) can be used for analyzing the Nrf2/antioxidant response signaling pathway. Nrf2 (NF-E2-related
factor 2) and Nrfl are transcription factors that bind to AREs and activate these genes.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice.

2.4	Metabolic Competence: The parental HepG2 cell line has been shown by others to retain the potential for Phase
I and Phase II metabolic responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A,
2E1, and 3A4/5 with CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human
hepatocytes although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have
even been observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2
cells include SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7).
In addition, HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53
protein (Boehme et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant
response element (ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a
number of ATP-binding cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound
proteins also regulated in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 60 to 75% confluence. Handle
the 1536-well, black-wall, clear-bottom assay plate by the sides; do not touch the clear bottom of the assay
plate. Cell Media Required: Growth (90% DMEM with GlutaMAX, 10% Dialyzed FBS, 0.1 mM NEAA, 25mM
HEPES, lOOU/mL/lOOug/mL Penicillin/Streptomycin, 5 ug/mL Blasticidin), Assay (99% DMEM with GlutaMAX,
10% Dialyzed FBS, 0.1 mM NEAA, 25mM HEPES, lOOU/mL/lOOug/mL Penicillin/Streptomycin), Thaw (90%
DMEM with GlutaMAX, 1% Dialyzed FBS, 0.1 mM NEAA, 25mM HEPES, lOOU/mL/lOOug/mL
Penicillin/Streptomycin), Freezing (100% Recovery Cell freezing medium) Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a T75 flask. Remove the vial of cells to be thawed from liquid nitrogen and thaw

-------
rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial in water.
Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet. Transfer the
vial contents drop-wise into 10 mL of Thaw Medium in a sterile 15-mL conical tub. Centrifuge cells at 900 rpm
for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask containing Thaw
Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at first passage. Cell
Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.25% Trypsin/EDTA and swirl to coat the cell
evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes incubation at 37C.
Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be passage at least twice
a week. Assay Protocol: Harvest cells from culture in growth medium and resuspend in assay medium. Dispense
2000 cells/5uL/well into 1536-well tissue treated black/clear bottom plates using a BioRAPTR dispenser. After
the cells were incubated at 37C for 5 hours, 23 nL of positive controls or compounds were transferred to the
assay plate by a PinTool resulting in a 217-fold dilution. The final compound concentration in the 5 ul assay
volume ranged from 1.2 nM to 92 uM in 15 concentrations. Incubate the plates for 16 hours at 37C. Add 1 uL of
6X LiveBLAzer-FRET B/G (CCF4-AM) Substrate Mixture to each well using a BioRAPTR dispenser and incubate
the plate at room temperature for 2 hours. Measure fluorescence intensity at 460 and 530 nm emission and
405 nm excitation by an Envision detector. Data is expressed as the ratio of 460nm/530nm emissions.

Baseline median absolute deviation for the assay (bmad): 2.51

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Antioxidant response element (ARE) signaling pathway agonism is monitored by FRET emission
resulting from GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was
measured in parallel by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation
in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

Beta-Naphthoflavone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

-------
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Beta-Naphthoflavone was used as a positive ARE agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning

-------
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1673

Inactive hit count: 0
-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.16

Neutral control median absolute deviation, by plate: nmad

4.932

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-2728.91%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

99.845

5.663

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

NA

-------
Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	13.078

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 793.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Xia M, Huang R, Shi Q, Boyd WA, Zhao J, Sun N, Rice J R, Dunlap PE, Hackstadt AJ, Bridge MF, Smith
MV, Dai S, Zheng W, Chu PH, Gerhold D, Witt KL, DeVito M, Freedman JH, Austin CP, Houck KA, Thomas RS,
Paules RS, Tice RR, Simeonov A. Comprehensive Analyses and Prioritization of Tox21 10K Chemicals Affecting
Mitochondrial Function by in-Depth Mechanistic Studies. Environ Health Perspect. 2018 Jul 26;126(7):077010.
doi: 10.1289/EHP2589. PMID: 30059008; PMCID: PMC6112376.

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1117

TOX2 l_FXR_BLA_Agonist_ch 1

1.	General Information

1.1	Assay Title: Tox21 HEK293T Farnesoid-X-Receptor (FXR) Agonism Beta-lactamase Assay, Channel 1 Readout of
Uncleaved Substrate

1.2	Assay Summary: TOX21_FXR_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-fxr-
bla-agonist-p2. TOX21_FXR_BLA_Agonist_chl is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the uncleaved reporter gene substrate and is used to calculate the ratio of
cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_FXR_BLA_Agonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_FXR_BLA_Agonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene NR1H4. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic FXR target gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_FXR_BLA_Agonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to NR1H4 gene(s) using a
positive control of Chenodeoxycholic acid

The Tox21 farnesoid-X-receptor (FXR) beta-lactamase agonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce farnesoid-X-receptor
signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4 system.

-------
The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. The FXR-HEK293 line is a stably transfected cell line
containing an intact FXR signaling pathway. FXR multiple hormone response element (MHRE) reporters were
introduced into HEK293 cells that express endogenous FXR. The HEK-293 cell line is a human embryonic kidney
cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham
et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into human
chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293 line is
pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection cells and
are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical research
purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 80 to 90% confluence. Cell
Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 mL of conical tub. Remove the vial of cells
to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath with gentle agitation for
1-2 minutes. Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet.
Transfer the vial contents drop-wise into 14 mL of thaw medium in a sterile 15-mL conical tub. Centrifuge cells
at 900 rpm for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask
containing Thaw Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at
first passage. Cell Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.05% Trypsin/EDTA and
swirl to coat the cell evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes
incubation at 37C. Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be
passage at least twice a week. Assay Protocol: Harvest cells from culture and resuspend in assay medium.
Dispense 5000 cells/5uL/well into 1536-well tissue treated black, clear-bottom plates using a Multi-drop
dispenser. After the cells were incubated at 37C for 5 hours, 23 nL of control or compounds dissolved in DMSO
were transferred to the assay plate by a PinTool resulting in a 217-fold dilution. Incubate the plates for 16 hours
at 37C. Add 1 uL of 6X LiveBLAzer-FRET B/G (CCF4-AM). Substrate Mixture to each well using a BioRAPTR
dispenser. After two hours incubation at room temperature, measure fluorescence intensity at 460 and 530 nm
emission and 405 nm excitation by an Envision detector. Data is expressed as the ratio of 460nm/530nm
emissions.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Chenodeoxycholic acid

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 5.347
Response cutoff threshold used to determine hit calls: 32.085

-------
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Farnesoid-X-Receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Chenodeoxycholic acid was used as a positive FXR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must

-------
occur through the database and should come from the available list of methods for each processing level.

Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315

Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
150

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.393

Neutral control median absolute deviation, by plate: nmad	10.674

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-2704.06%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-10.701

Positive control well median absolute deviation, by plate: pmad	8.417

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.703

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 436.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Hsu CW, Zhao J, Huang R, Hsieh JH, Hamm J,
Chang X, Houck K, Xia M. Quantitative high-throughput profiling of environmental chemicals and drugs that
modulate farnesoid X receptor. Sci Rep. 2014 Sep 26;4:6437. doi: 10.1038/srep06437. PMID: 25257666; PMCID:
PMC4894417.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1118

TOX21_FXR_BlA_Agonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293T Farnesoid-X-Receptor (FXR) Agonism Beta-lactamase Assay, Channel 2 Readout of
Cleaved Substrate

1.2	Assay Summary: TOX21_FXR_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-fxr-
bla-agonist-p2. TOX21_FXR_BLA_Agonist_ch2 is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio of cleaved
(ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_FXR_BLA_Agonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_FXR_BLA_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene NR1H4. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic FXR target gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_FXR_BLA_Agonist_ch2 was designed to measure cleaved reporter gene substrate to target
nuclear receptor activity at the protein (receptor) level, specifically mapping to NR1H4 gene(s) using a positive
control of Chenodeoxycholic acid

The Tox21 farnesoid-X-receptor (FXR) beta-lactamase agonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce farnesoid-X-receptor
signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4 system.

-------
The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. The FXR-HEK293 line is a stably transfected cell line
containing an intact FXR signaling pathway. FXR multiple hormone response element (MHRE) reporters were
introduced into HEK293 cells that express endogenous FXR. The HEK-293 cell line is a human embryonic kidney
cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham
et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into human
chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293 line is
pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection cells and
are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical research
purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 80 to 90% confluence. Cell
Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 mL of conical tub. Remove the vial of cells
to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath with gentle agitation for
1-2 minutes. Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet.
Transfer the vial contents drop-wise into 14 mL of thaw medium in a sterile 15-mL conical tub. Centrifuge cells
at 900 rpm for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask
containing Thaw Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at
first passage. Cell Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.05% Trypsin/EDTA and
swirl to coat the cell evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes
incubation at 37C. Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be
passage at least twice a week. Assay Protocol: Harvest cells from culture and resuspend in assay medium.
Dispense 5000 cells/5uL/well into 1536-well tissue treated black, clear-bottom plates using a Multi-drop
dispenser. After the cells were incubated at 37C for 5 hours, 23 nL of control or compounds dissolved in DMSO
were transferred to the assay plate by a PinTool resulting in a 217-fold dilution. Incubate the plates for 16 hours
at 37C. Add 1 uL of 6X LiveBLAzer-FRET B/G (CCF4-AM). Substrate Mixture to each well using a BioRAPTR
dispenser. After two hours incubation at room temperature, measure fluorescence intensity at 460 and 530 nm
emission and 405 nm excitation by an Envision detector. Data is expressed as the ratio of 460nm/530nm
emissions.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Chenodeoxycholic acid

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 1.081
Response cutoff threshold used to determine hit calls: 20

-------
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Farnesoid-X-Receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Chenodeoxycholic acid was used as a positive FXR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must

-------
occur through the database and should come from the available list of methods for each processing level.

Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315

Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
97

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.064

Neutral control median absolute deviation, by plate: nmad	1.784

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-2882.35%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.451

Positive control well median absolute deviation, by plate: pmad	17.608

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	5.604

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 443.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Hsu CW, Zhao J, Huang R, Hsieh JH, Hamm J,
Chang X, Houck K, Xia M. Quantitative high-throughput profiling of environmental chemicals and drugs that
modulate farnesoid X receptor. Sci Rep. 2014 Sep 26;4:6437. doi: 10.1038/srep06437. PMID: 25257666; PMCID:
PMC4894417.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1119

T0X21_FXR_B LA_Agon ist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293T Farnesoid-X-Receptor (FXR) Agonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21_FXR_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-fxr-
bla-agonist-p2. TOX21_FXR_BLA_Agonist_ratio is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate
used as the measure of target activity. Data from the assay component TOX21_FXR_BLA_Agonist_ratio was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_FXR_BLA_Agonist_ratio, was analyzed in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of inducible reporter, measures of receptor for gain-of-signal activity can be used to understand the reporter
gene at the pathway-level as they relate to the gene NR1H4. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this ratio serves a
reporter gene function to understand target activity.To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the nuclear receptor intended target family, where the subfamily is non-
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic FXR target gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_FXR_BLA_Agonist_ratio was designed to target nuclear receptor activity at the protein
(receptor) level, specifically mapping to NR1H4 gene(s) using a positive control of Chenodeoxycholic acid

The Tox21 farnesoid-X-receptor (FXR) beta-lactamase agonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce farnesoid-X-receptor
signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4 system.

-------
The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. The farnesoid-X-receptor (FXR), a nuclear hormone receptor, is highly
expressed in liver, intestine, kidney and adrenal cortex. Natural ligands of FXR are the bile acids (Cholic acid,
Chenodeoxy cholic acid etc). FXR is an important regulator of diverse metabolic pathways, including the bile acid
homeostasis, lipid and glucose metabolisms. FXR regulates the expression of target genes by binding either as a
monomer or as a heterodimer with the retinoid X receptor (RXR). Numerous studies have reported that FXR
exerts protective function during cholestasis, diabetes, liver regeneration, and cancer. To identify compounds
that activate FXR signaling, GeneBLAzer FXR-UAS-bla HEK 293T cell line (Invitrogen, Carlsbad, CA) containing a
beta-lactamase reporter gene under the control of a UAS response element was used to screen the Tox2110K
compound library.

2.3	Experimental System: adherent HEK293T cell line used. The FXR-HEK293 line is a stably transfected cell line
containing an intact FXR signaling pathway. FXR multiple hormone response element (MHRE) reporters were
introduced into HEK293 cells that express endogenous FXR. The HEK-293 cell line is a human embryonic kidney
cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham
et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into human
chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293 line is
pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection cells and
are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical research
purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 80 to 90% confluence. Cell
Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 mL of conical tub. Remove the vial of cells
to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath with gentle agitation for
1-2 minutes. Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet.
Transfer the vial contents drop-wise into 14 mL of thaw medium in a sterile 15-mL conical tub. Centrifuge cells
at 900 rpm for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask
containing Thaw Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at
first passage. Cell Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.05% Trypsin/EDTA and
swirl to coat the cell evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes
incubation at 37C. Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be
passage at least twice a week. Assay Protocol: Harvest cells from culture and resuspend in assay medium.
Dispense 5000 cells/5uL/well into 1536-well tissue treated black, clear-bottom plates using a Multi-drop
dispenser. After the cells were incubated at 37C for 5 hours, 23 nL of control or compounds dissolved in DMSO

-------
were transferred to the assay plate by a PinTool resulting in a 217-fold dilution. Incubate the plates for 16 hours
at 37C. Add 1 uL of 6X LiveBLAzer-FRET B/G (CCF4-AM). Substrate Mixture to each well using a BioRAPTR
dispenser. After two hours incubation at room temperature, measure fluorescence intensity at 460 and 530 nm
emission and 405 nm excitation by an Envision detector. Data is expressed as the ratio of 460nm/530nm
emissions.

Baseline median absolute deviation for the assay (bmad): 1.72

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Farnesoid-X-Receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Chenodeoxycholic acid

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

NA

-------
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Chenodeoxycholic acid was used as a positive FXR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag

-------
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

185	8154	976

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	215

gain-loss (gnls) model:	331

power(pow) model:	363

linear-polynomial (polyl) model:	4306

quadratic-polynomial(poly2) model:	543

exponential-2 (exp2) model:	210

exponential-3 (exp3) model:	14

exponential-4 (exp4) model:	2979

exponential-5 (exp5) model:	354

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In

-------
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.089

Neutral control median absolute deviation, by plate: nmad	2.638

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-2768.8%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.489

Positive control well median absolute deviation, by plate: pmad	16.562

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	5.915

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

-------
Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 354.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Hsu CW, Zhao J, Huang R, Hsieh JH, Hamm J,
Chang X, Houck K, Xia M. Quantitative high-throughput profiling of environmental chemicals and drugs that

-------
modulate farnesoid X receptor. Sci Rep. 2014 Sep 26;4:6437. doi: 10.1038/srep06437. PMID: 25257666; PMCID:
PMC4894417.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1120

TOX2 l_FXR_BLA_Antagon ist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293T Farnesoid-X-Receptor (FXR) Antagonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21 FXR BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-fxr-
bla-antagonist-pl. TOX21_FXR_BLA_Antagonist_ratio is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate
used as the measure of target activity. Data from the assay component TOX21_FXR_BLA_Antagonist_ratio was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_FXR_BLA_Antagonist_ratio, was analyzed in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of inducible reporter, measures of receptor for gain-of-signal activity can be used to understand the reporter
gene at the pathway-level as they relate to the gene NR1H4. Furthermore, this assay endpoint can be referred
to as a primary readout, because this assay has produced multiple assay endpoints where this one serves a
reporter gene function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the nuclear receptor intended target family, where the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic FXR target gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_FXR_BLA_Antagonist_ratio was designed to target nuclear receptor activity at the protein
(receptor) level, specifically mapping to NR1H4 gene(s) using a positive control of Guggulsterone

The Tox21 farnesoid-X-receptor (FXR) beta-lactamase antagonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress farnesoid-X-
receptor signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test

-------
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. The farnesoid-X-receptor (FXR), a nuclear hormone receptor, is highly
expressed in liver, intestine, kidney and adrenal cortex. Natural ligands of FXR are the bile acids (Cholic acid,
Chenodeoxy cholic acid etc). FXR is an important regulator of diverse metabolic pathways, including the bile acid
homeostasis, lipid and glucose metabolisms. FXR regulates the expression of target genes by binding either as a
monomer or as a heterodimer with the retinoid X receptor (RXR). Numerous studies have reported that FXR
exerts protective function during cholestasis, diabetes, liver regeneration, and cancer. To identify compounds
that inhibit FXR signaling, GeneBLAzer FXR-UAS-bla HEK 293T cell line (Invitrogen, Carlsbad, CA) containing a
beta-lactamase reporter gene under the control of a UAS response element was used to screen the Tox2110K
compound library.

2.3	Experimental System: adherent HEK293T cell line used. The FXR-HEK293 line is a stably transfected cell line
containing an intact FXR signaling pathway. FXR multiple hormone response element (MHRE) reporters were
introduced into HEK293 cells that express endogenous FXR. The HEK-293 cell line is a human embryonic kidney
cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham
et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into human
chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293 line is
pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection cells and
are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical research
purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 80 to 90% confluence. Cell
Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 mL of conical tub. Remove the vial of cells
to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath with gentle agitation for
1-2 minutes. Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet.
Transfer the vial contents drop-wise into 14 mL of thaw medium in a sterile 15-mL conical tub. Centrifuge cells
at 900 rpm for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask
containing Thaw Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at
first passage. Cell Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.05% Trypsin/EDTA and
swirl to coat the cell evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes
incubation at 37C. Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be
passage at least twice a week. Assay Protocol: Harvest cells from culture in growth medium and resuspend in
assay medium. Dispense 5000 cells/5uL/well into 1536-well tissue treated black, clear-bottom plates using a
Multi-drop dispenser. After the cells were incubated at 37Cfor 5 hours, 23 nLof control or compounds dissolved
in DMSO were transferred to the assay plate by a PinTool resulting in a 217-fold dilution. Add 1 uL of agonist

-------
(CDCA) at 300 nM in assay medium to the column 1-2 and column. Add 1 uL of assay medium to the column 3-
4. Incubate the plates for 16 hours at 37C. Add 1 uL of 6X LiveBLAzer-FRET B/G (CCF4-AM) Substrate Mixture to
each well using a BioRAPTR dispenser. After two hours incubation at room temperature, measure fluorescence
intensity at 460 and 530 nm emission and 405 nm excitation by an Envision detector. Data is expressed as the
ratio of 460nm/530nm emissions.

Baseline median absolute deviation for the assay (bmad): 2.789

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Farnesoid-X-Receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Guggulsterone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

NA

-------
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Guggulsterone was used as a positive FXR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag

-------
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

1241	5697	2377

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	466

gain-loss (gnls) model:	498

power(pow) model:	686

linear-polynomial (polyl) model:	3302

quadratic-polynomial(poly2) model:	822

exponential-2 (exp2) model:	408

exponential-3 (exp3) model:	82

exponential-4 (exp4) model:	2431

exponential-5 (exp5) model:	620

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In

-------
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.127

Neutral control median absolute deviation, by plate: nmad	4.829

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3707.09%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100.376

Positive control well median absolute deviation, by plate: pmad	6.094

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-12.619

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

-------
Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 620.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Hsu CW, Zhao J, Huang R, Hsieh JH, Hamm J,
Chang X, Houck K, Xia M. Quantitative high-throughput profiling of environmental chemicals and drugs that

-------
modulate farnesoid X receptor. Sci Rep. 2014 Sep 26;4:6437. doi: 10.1038/srep06437. PMID: 25257666; PMCID:
PMC4894417.

7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1121

T0X21_FXR_B LA_Antagon ist_via bi lity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293T Farnesoid-X-Receptor (FXR) Antagonism Beta-lactamase
Assay

1.2	Assay Summary: TOX21 FXR BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-fxr-
bla-antagonist-pl. TOX21_FXR_BLA_Antagonist_viability is an assay readout measuring cellular ATP content and
detected with CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_FXR_BLA_Antagonist_viability used a
type of viability reporter where loss-of-signal activity can be used to understand changes in the cell viability.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a viability function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic FXR target gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 farnesoid-X-receptor (FXR) beta-lactamase antagonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress farnesoid-X-
receptor signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla

-------
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. The farnesoid-X-receptor (FXR), a nuclear hormone receptor, is highly
expressed in liver, intestine, kidney and adrenal cortex. Natural ligands of FXR are the bile acids (Cholic acid,
Chenodeoxy cholic acid etc). FXR is an important regulator of diverse metabolic pathways, including the bile acid
homeostasis, lipid and glucose metabolisms. FXR regulates the expression of target genes by binding either as a
monomer or as a heterodimer with the retinoid X receptor (RXR). Numerous studies have reported that FXR
exerts protective function during cholestasis, diabetes, liver regeneration, and cancer. To identify compounds
that inhibit FXR signaling, GeneBLAzer FXR-UAS-bla HEK 293T cell line (Invitrogen, Carlsbad, CA) containing a
beta-lactamase reporter gene under the control of a UAS response element was used to screen the Tox2110K
compound library.

2.3	Experimental System: adherent HEK293T cell line used. The FXR-HEK293 line is a stably transfected cell line
containing an intact FXR signaling pathway. FXR multiple hormone response element (MHRE) reporters were
introduced into HEK293 cells that express endogenous FXR. The HEK-293 cell line is a human embryonic kidney
cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham
et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into human
chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293 line is
pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection cells and
are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical research
purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 80 to 90% confluence. Cell
Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 mL of conical tub. Remove the vial of cells
to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath with gentle agitation for
1-2 minutes. Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet.
Transfer the vial contents drop-wise into 14 mL of thaw medium in a sterile 15-mL conical tub. Centrifuge cells
at 900 rpm for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask
containing Thaw Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at
first passage. Cell Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.05% Trypsin/EDTA and
swirl to coat the cell evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes
incubation at 37C. Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be
passage at least twice a week. Assay Protocol: Harvest cells from culture in growth medium and resuspend in
assay medium. Dispense 5000 cells/5uL/well into 1536-well tissue treated black, clear-bottom plates using a
Multi-drop dispenser. After the cells were incubated at 37Cfor 5 hours, 23 nL of control or compounds dissolved
in DMSO were transferred to the assay plate by a PinTool resulting in a 217-fold dilution. Add 1 uL of agonist
(CDCA) at 300 nM in assay medium to the column 1-2 and column. Add 1 uL of assay medium to the column 3-
4. Incubate the plates for 16 hours at 37C. Add 1 uL of 6X LiveBLAzer-FRET B/G (CCF4-AM) Substrate Mixture to

-------
each well using a BioRAPTR dispenser. After two hours incubation at room temperature, measure fluorescence
intensity at 460 and 530 nm emission and 405 nm excitation by an Envision detector. Data is expressed as the
ratio of 460nm/530nm emissions.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 6.819
Response cutoff threshold used to determine hit calls: 40.911

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Farnesoid-X-Receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with

-------
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.449

Neutral control median absolute deviation, by plate: nmad	14.691

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3159.02%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 607.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Hsu CW, Zhao J, Huang R, Hsieh JH, Hamm J,
Chang X, Houck K, Xia M. Quantitative high-throughput profiling of environmental chemicals and drugs that
modulate farnesoid X receptor. Sci Rep. 2014 Sep 26;4:6437. doi: 10.1038/srep06437. PMID: 25257666; PMCID:
PMC4894417.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1122

TOX21_PPARd_BLA_Agonist_chl

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-delta (PPARd) Agonism Beta-lactamase
Assay, Channel 1 Readout of Uncleaved Substrate

1.2	Assay Summary: TOX21_PPARd_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
ppard-bla-agonist-pl. TOX21_PPARd_BLA_Agonist_chl is an assay readout measuring reporter gene via
receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-
lactamase reporter gene. The signal is derived from the uncleaved reporter gene substrate and is used to
calculate the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data
from the assay component TOX21_PPARd_BLA_Agonist_chl was analyzed into 1 assay endpoint. This assay
endpoint, TOX21_PPARd_BLA_Agonist_chl, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity
can be used to understand changes in the reporter gene as they relate to the gene PPARD. Furthermore, this
assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARD_BLA_Agonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to PPARD gene(s) using a
positive control of L-165,041

The Tox21 peroxisome proliferator-activated receptor delta agonism beta-lactamase assay screened a library of
diverse environmental compounds to probe for xenobiotic ligand-binding and potential to induce PPAR-delta-
dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid

-------
GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the
assay and following 17-hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for an additional 2 hours in the dark. Once in the cell,
cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified
by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals
are monitored using an Envision plate reader. Following CCF4 incubation and detection, 3uL of CellTiter-Glo
reagent is added to each well and incubated for 30 minutes before cytotoxicity readout is measured on a
ViewLux microtiter plate reader. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer PPAR-delta 293T DA (Division-arrested)
cells and PPAR delta-UAS-bla 293T cells contain a peroxisome proliferator-activated receptor delta (PPARd)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293T
cell line. CellSensor UAS-bla 293T contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293T cells. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPAR delta (LBD) fusion protein, the protein binds to the UAS, resulting in expression of -beta-lactamase.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Dohr et al. 1995). The transformation incorporated approximately
4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30 ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 2.5-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate for 5hrs at 37C / 99 percent Humidity / 5 percent C02. Transfer 23nL of compounds from the library
collection and positive control to the assay plates through Pintool. Incubate for 17hrs at 37C / 99 percent
Humidity / 5 percent C02. Add 1 uL of CCF4 (FRET Substrate) dye using a single tip plate dispenser (BioRAPTR).
Incubate at room temperature for lhrs in the dark. Read the fluorescence intensity through Envision plate
reader using Beta-Lactamase protocol optimized for this cell type. Add 3 uL of Cell Titer Glo and Incubate at
room temperature for 0.5 hrs in dark for both agonist and antagonist mode. Read on ViewLux protocol
optimized for this cell type for both agonist and antagonist mode PPARd-bla cells were dispensed at 3000
cells/6uL/well in 1536-well black wall/clear bottom plates using a Multidrop Combi (Thermo Fisher Scientific,
Waltham, MA) dispenser. After the assay plates were incubated at 37C and 5 percent C02 for 5 h, 23 nL of
compounds dissolved in DMSO, positive controls or DMSO only was transferred to the assay plate by a Pintool
station (Kalypsys, San Diego, CA). The assay plates were incubated at 37C for 17 h. After 1 uL of LiveBLAzer B/G
FRET substrate was added using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter,

-------
Indianapolis, IN, USA), the plates were incubated at room temperature for 2 h, and fluorescence intensity was
measured by an Envision plate reader (PerkinElmer, Shelton, CT). For cell viability readout that measures
cytotoxicity, 3 uL/well of CellTiter-Glo reagent was added into the assay plates using a BioRAPTR FRD. After 30
min incubation at room temperature, the luminescence intensity in the plates was measured using a ViewLux
plate reader (PerkinElmer).

Baseline median absolute deviation for the assay (bmad): 5.839
Response cutoff threshold used to determine hit calls: 35.036
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-delta agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

L-165,041

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

NA

-------
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
L-165,041 was used as a positive PPARd agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag

-------
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

82	7086	2147

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	323

gain-loss (gnls) model:	592

power(pow) model:	436

linear-polynomial (polyl) model:	2950

quadratic-polynomial(poly2) model:	567

exponential-2 (exp2) model:	180

exponential-3 (exp3) model:	26

exponential-4 (exp4) model:	3549

exponential-5 (exp5) model:	692

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In

-------
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.263

Neutral control median absolute deviation, by plate: nmad	7.311

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-2832.13%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-15.487

Positive control well median absolute deviation, by plate: pmad	4.854

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.717

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

-------
Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 692.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1123

TOX21_PPARd_BLA_Agonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-delta (PPARd) Agonism Beta-lactamase
Assay, Channel 2 Readout of Cleaved Substrate

1.2	Assay Summary: TOX21_PPARd_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
ppard-bla-agonist-pl. TOX21_PPARd_BLA_Agonist_ch2 is an assay readout measuring reporter gene via
receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-
lactamase reporter gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate
the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the
assay component TOX21_PPARd_BLA_Agonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PPARd_BLA_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, increased activity can be used to
understand changes in the reporter gene as they relate to the gene PPARD. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARD_BLA_Agonist_ch2 was designed to measure cleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to PPARD gene(s) using a
positive control of L-165,041

The Tox21 peroxisome proliferator-activated receptor delta agonism beta-lactamase assay screened a library of
diverse environmental compounds to probe for xenobiotic ligand-binding and potential to induce PPAR-delta-
dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid

-------
GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the
assay and following 17-hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for an additional 2 hours in the dark. Once in the cell,
cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified
by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals
are monitored using an Envision plate reader. Following CCF4 incubation and detection, 3uL of CellTiter-Glo
reagent is added to each well and incubated for 30 minutes before cytotoxicity readout is measured on a
ViewLux microtiter plate reader. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer PPAR-delta 293T DA (Division-arrested)
cells and PPAR delta-UAS-bla 293T cells contain a peroxisome proliferator-activated receptor delta (PPARd)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293T
cell line. CellSensor UAS-bla 293T contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293T cells. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPAR delta (LBD) fusion protein, the protein binds to the UAS, resulting in expression of -beta-lactamase.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Dohr et al. 1995). The transformation incorporated approximately
4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30 ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 2.5-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate for 5hrs at 37C / 99 percent Humidity / 5 percent C02. Transfer 23nL of compounds from the library
collection and positive control to the assay plates through Pintool. Incubate for 17hrs at 37C / 99 percent
Humidity / 5 percent C02. Add 1 uL of CCF4 (FRET Substrate) dye using a single tip plate dispenser (BioRAPTR).
Incubate at room temperature for lhrs in the dark. Read the fluorescence intensity through Envision plate
reader using Beta-Lactamase protocol optimized for this cell type. Add 3 uL of Cell Titer Glo and Incubate at
room temperature for 0.5 hrs in dark for both agonist and antagonist mode. Read on ViewLux protocol
optimized for this cell type for both agonist and antagonist mode PPARd-bla cells were dispensed at 3000
cells/6uL/well in 1536-well black wall/clear bottom plates using a Multidrop Combi (Thermo Fisher Scientific,
Waltham, MA) dispenser. After the assay plates were incubated at 37C and 5 percent C02 for 5 h, 23 nL of
compounds dissolved in DMSO, positive controls or DMSO only was transferred to the assay plate by a Pintool
station (Kalypsys, San Diego, CA). The assay plates were incubated at 37C for 17 h. After 1 uL of LiveBLAzer B/G
FRET substrate was added using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter,

-------
Indianapolis, IN, USA), the plates were incubated at room temperature for 2 h, and fluorescence intensity was
measured by an Envision plate reader (PerkinElmer, Shelton, CT). For cell viability readout that measures
cytotoxicity, 3 uL/well of CellTiter-Glo reagent was added into the assay plates using a BioRAPTR FRD. After 30
min incubation at room temperature, the luminescence intensity in the plates was measured using a ViewLux
plate reader (PerkinElmer).

Baseline median absolute deviation for the assay (bmad): 2.086

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-delta agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

L-165,041

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

NA

-------
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
L-165,041 was used as a positive PPARd agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag

-------
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

179	8236	900

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	336

gain-loss (gnls) model:	480

power(pow) model:	471

linear-polynomial (polyl) model:	3107

quadratic-polynomial(poly2) model:	538

exponential-2 (exp2) model:	221

exponential-3 (exp3) model:	31

exponential-4 (exp4) model:	3567

exponential-5 (exp5) model:	564

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In

-------
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.117

Neutral control median absolute deviation, by plate: nmad	3.179

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-2675.46%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.512

Positive control well median absolute deviation, by plate: pmad	13.642

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	7.075

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

-------
Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 564.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1124

TOX21_PPARd_BLA_Agonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-delta (PPARd) Agonism Beta-lactamase
Assay, Ratio

1.2	Assay Summary: TOX21_PPARd_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
ppard-bla-agonist-pl. TOX21_PPARd_BLA_Agonist_ratio is an assay readout measuring reporter gene via
receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-
lactamase reporter gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene
substrate used as the measure of target activity. Data from the assay component
TOX21_PPARd_BLA_Agonist_ratio was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PPARd_BLA_Agonist_ratio, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene PPARD. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints where this
ratio serves a reporter gene function to understand target activity. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the
subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARD_BLA_Agonist_ratio was designed to target nuclear receptor activity at the protein
(receptor) level, specifically mapping to PPARD gene(s) using a positive control of L-165,041

The Tox21 peroxisome proliferator-activated receptor delta agonism beta-lactamase assay screened a library of
diverse environmental compounds to probe for xenobiotic ligand-binding and potential to induce PPAR-delta-

-------
dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid
GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the
assay and following 17-hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for an additional 2 hours in the dark. Once in the cell,
cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified
by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals
are monitored using an Envision plate reader. Following CCF4 incubation and detection, 3uL of CellTiter-Glo
reagent is added to each well and incubated for 30 minutes before cytotoxicity readout is measured on a
ViewLux microtiter plate reader. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Peroxisome proliferator-activated receptor delta is a ligand-activated
nuclear receptor which is expressed ubiquitously and may have a role in regulating the differentiation of
adipocytes, in keratinocyte differentiation and in the regulation of cholesterol and lipid metabolism (Schmuth
et al. 2004, Seimandi et al. 2005). The PPAR-delta_BLA_Agonist assay used Fluorescence Resonance Energy
Transfer (FRET) substrate to generate a ratiometric reporter response to receptor ligand-binding to allow
monitoring of PPAR-delta activity relative to a known receptor agonist. This assay is designed to help identify
environmental compounds with a capacity for PPAR-delta ligand-binding activity. The Tox21 PPAR-delta bla
assays are qHTS format assays which measured the ability of a chemical to interact with PPAR-delta by
monitoring modulation of fluorescence reporter gene signals. This assay utilized a human embryonic kidney cell
line (HEK293T) which expresses PPAR-delta and a one-hybrid GAL4 system to quantify xenobiotic PPAR-delta
agonism.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer PPAR-delta 293T DA (Division-arrested)
cells and PPAR delta-UAS-bla 293T cells contain a peroxisome proliferator-activated receptor delta (PPARd)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293T
cell line. CellSensor UAS-bla 293T contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293T cells. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPAR delta (LBD) fusion protein, the protein binds to the UAS, resulting in expression of -beta-lactamase.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Dohr et al. 1995). The transformation incorporated approximately
4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial

-------
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30 ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 2.5-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate for 5hrs at 37C / 99 percent Humidity / 5 percent C02. Transfer 23nL of compounds from the library
collection and positive control to the assay plates through Pintool. Incubate for 17hrs at 37C / 99 percent
Humidity / 5 percent C02. Add 1 uL of CCF4 (FRET Substrate) dye using a single tip plate dispenser (BioRAPTR).
Incubate at room temperature for lhrs in the dark. Read the fluorescence intensity through Envision plate
reader using Beta-Lactamase protocol optimized for this cell type. Add 3 uL of Cell Titer Glo and Incubate at
room temperature for 0.5 hrs in dark for both agonist and antagonist mode. Read on ViewLux protocol
optimized for this cell type for both agonist and antagonist mode PPARd-bla cells were dispensed at 3000
cells/6uL/well in 1536-well black wall/clear bottom plates using a Multidrop Combi (Thermo Fisher Scientific,
Waltham, MA) dispenser. After the assay plates were incubated at 37C and 5 percent C02 for 5 h, 23 nL of
compounds dissolved in DMSO, positive controls or DMSO only was transferred to the assay plate by a Pintool
station (Kalypsys, San Diego, CA). The assay plates were incubated at 37C for 17 h. After 1 uL of LiveBLAzer B/G
FRET substrate was added using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter,
Indianapolis, IN, USA), the plates were incubated at room temperature for 2 h, and fluorescence intensity was
measured by an Envision plate reader (PerkinElmer, Shelton, CT). For cell viability readout that measures
cytotoxicity, 3 uL/well of CellTiter-Glo reagent was added into the assay plates using a BioRAPTR FRD. After 30
min incubation at room temperature, the luminescence intensity in the plates was measured using a ViewLux
plate reader (PerkinElmer).

Baseline median absolute deviation for the assay (bmad): 2.916

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-delta agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

L-165,041

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

NA

-------
Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso)) x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
L-165,041 was used as a positive PPARd agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test

-------
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
255

Inactive hit count: 0
-------
exponentials (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

21

3421

524

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.144

Neutral control median absolute deviation, by plate: nmad

4.064

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-2668.68%

POSITIVE CONTROL (well type = "p")

-------
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

99.74
7.632

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	11.168

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 524.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

-------
5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1125

TOX21_PPARd_BLA_Antagonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-delta (PPARd) Antagonism Beta-
lactamase Assay, Ratio

1.2	Assay Summary: TOX21_PPARd_BLA_Antagonist is a cell-based, single-readout assay that uses HEK293T, a
human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See
tox21-ppard-bla-antagonist-pl. TOX21_PPARd_BLA_Antagonist_ratio is an assay readout measuring reporter
gene via receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4
b-lactamase reportergene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reportergene
substrate used as the measure of target activity. Data from the assay component
TOX21_PPARd_BLA_Antagonist_ratio was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PPARd_BLA_Antagonist_ratio, was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be
used to understand changes in the reporter gene as they relate to the gene PPARD. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
and this ratio serves a reporter gene function to understand target activity. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARD_BLA_Antagonist_ratio was designed to target nuclear receptor activity at the
protein (receptor) level, specifically mapping to PPARD gene(s) using a positive control of MK886

The Tox21 peroxisome proliferator-activated receptor delta antagonism beta-lactamase assay screened a library
of diverse environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PPAR-

-------
delta-dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-
hybrid GAL4 system. Each well contained 0.3 uM L-165,041 to stimulate receptor activity and MK886 (a
leukotriene inhibitor) served as a positive control. The assay is run in triplicate on 1536-well microplates.
HEK293T cells are plated the day of the assay and following 17-hour incubation of cells with test compounds a
membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for an additional hour. Once
in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression
is quantified by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence.
Fluorescence signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is
also incubated with test system for 30 minutes before readout to detect cell viability. Compound auto-
fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for
background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Peroxisome proliferator-activated receptor delta is a ligand-activated
nuclear receptor which is expressed ubiquitously and may have a role in regulating the differentiation of
adipocytes, in keratinocyte differentiation and in the regulation of cholesterol and lipid metabolism (Schmuth
et al. 2004, Seimandi et al. 2005). The PPAR-delta_BLA_Antagonist assay used Fluorescence Resonance Energy
Transfer (FRET) substrate to generate a ratiometric reporter response to receptor ligand-binding to allow
monitoring of PPAR-delta activity relative to a known receptor antagonist. This assay is designed to help identify
environmental compounds with a capacity for PPAR-delta interfering activity. The Tox21 PPAR-delta bla assays
are qHTS format assays which measured the ability of a chemical to interact with PPAR-delta by monitoring
modulation of fluorescence reporter gene signals. This assay utilized a human embryonic kidney cell line
(HEK293T) which expresses PPAR-delta and a one-hybrid GAL4 system to quantify xenobiotic PPAR-delta
antagonism.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer PPAR-delta 293T DA (Division-arrested)
cells and PPAR delta-UAS-bla 293T cells contain a peroxisome proliferator-activated receptor delta (PPARd)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293T
cell line. CellSensor UAS-bla 293T contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293T cells. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPAR delta (LBD) fusion protein, the protein binds to the UAS, resulting in expression of -beta-lactamase.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Dohr et al. 1995). The transformation incorporated approximately
4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen

-------
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30 ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 2.5-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate for 5hrs at 37C / 99 percent Humidity / 5 percent C02. Transfer 23nL of compounds from the library
collection and positive control to the assay plates through Pintool. Incubate for 17hrs at 37C / 99 percent
Humidity / 5 percent C02. Add 1 uL of CCF4 (FRET Substrate) dye using a single tip plate dispenser (BioRAPTR).
Incubate at room temperature for lhrs in the dark. Read the fluorescence intensity through Envision plate
reader using Beta-Lactamase protocol optimized for this cell type. Add 3 uL of Cell Titer Glo and Incubate at
room temperature for 0.5 hrs in dark for both agonist and antagonist mode. Read on ViewLux protocol
optimized for this cell type for both agonist and antagonist mode PPARd-bla cells were dispensed at 3000
cells/6uL/well in 1536-well black wall/clear bottom plates using a Multidrop Combi (Thermo Fisher Scientific,
Waltham, MA) dispenser. After the assay plates were incubated at 37C and 5 percent C02 for 5 h, 23 nL of
compounds dissolved in DMSO, positive controls or DMSO only was transferred to the assay plate by a Pintool
station (Kalypsys, San Diego, CA). The assay plates were incubated at 37C for 17 h. After 1 uL of LiveBLAzer B/G
FRET substrate was added using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter,
Indianapolis, IN, USA), the plates were incubated at room temperature for 2 h, and fluorescence intensity was
measured by an Envision plate reader (PerkinElmer, Shelton, CT). For cell viability readout that measures
cytotoxicity, 3 uL/well of CellTiter-Glo reagent was added into the assay plates using a BioRAPTR FRD. After 30
min incubation at room temperature, the luminescence intensity in the plates was measured using a ViewLux
plate reader (PerkinElmer).

Baseline median absolute deviation for the assay (bmad): 6.048
Response cutoff threshold used to determine hit calls: 36.289
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-delta antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

MK886

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
MK886 was used as a positive PPARd antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

-------
2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
636

Inactive hit count: 0
-------
quadratic-polynomialfpoly2) model: 594

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

45

225

3844

905

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.136

Neutral control median absolute deviation, by plate: nmad

8.803

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-6091.39%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100.269

Positive control well median absolute deviation, by plate: pmad	7.013

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-8.697

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 905.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1126

T0X21_P PARd_B LA_Antago n ist_vi a bi I ity

1.	General Information

1.1	Assay Title: Viability Assessment in theTox21 HEK293 Peroxisome Proliferator-activated Receptor-delta (PPARd)
Antagonism Beta-lactamase Assay

1.2	Assay Summary: TOX21_PPARd_BLA_Antagonist is a cell-based, single-readout assay that uses HEK293T, a
human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See
tox21-ppard-bla-antagonist-pl. TOX21_PPARd_BLA_Antagonist_viability is an assay readout measuring cellular
ATP content and detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
TOX21_PPARd_BLA_Antagonist_viability used a type of viability reporter where loss-of-signal activity can be
used to understand changes in the cell viability. Furthermore, this assay endpoint can be referred to as a
secondary readout, because this assay has produced multiple assay endpoints where this one serves a viability
function. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 peroxisome proliferator-activated receptor delta antagonism beta-lactamase assay screened a library
of diverse environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PPAR-
delta-dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-
hybrid GAL4 system. Each well contained 0.3 uM L-165,041 to stimulate receptor activity and MK886 (a
leukotriene inhibitor) served as a positive control. The assay is run in triplicate on 1536-well microplates.
HEK293T cells are plated the day of the assay and following 17-hour incubation of cells with test compounds a

-------
membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for an additional hour. Once
in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression
is quantified by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence.
Fluorescence signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is
also incubated with test system for 30 minutes before readout to detect cell viability. Compound auto-
fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for
background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Peroxisome proliferator-activated receptor delta is a ligand-activated
nuclear receptor which is expressed ubiquitously and may have a role in regulating the differentiation of
adipocytes, in keratinocyte differentiation and in the regulation of cholesterol and lipid metabolism (Schmuth
et al. 2004, Seimandi et al. 2005). The PPAR-delta_BLA_Antagonist assay used Fluorescence Resonance Energy
Transfer (FRET) substrate to generate a ratiometric reporter response to receptor ligand-binding to allow
monitoring of PPAR-delta activity relative to a known receptor antagonist. This assay is designed to help identify
environmental compounds with a capacity for PPAR-delta interfering activity. The Tox21 PPAR-delta bla assays
are qHTS format assays which measured the ability of a chemical to interact with PPAR-delta by monitoring
modulation of fluorescence reporter gene signals. This assay utilized a human embryonic kidney cell line
(HEK293T) which expresses PPAR-delta and a one-hybrid GAL4 system to quantify xenobiotic PPAR-delta
antagonism.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer PPAR-delta 293T DA (Division-arrested)
cells and PPAR delta-UAS-bla 293T cells contain a peroxisome proliferator-activated receptor delta (PPARd)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293T
cell line. CellSensor UAS-bla 293T contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293T cells. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPAR delta (LBD) fusion protein, the protein binds to the UAS, resulting in expression of -beta-lactamase.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Dohr et al. 1995). The transformation incorporated approximately
4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30 ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask

-------
at 2.5-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate for 5hrs at 37C / 99 percent Humidity / 5 percent C02. Transfer 23nL of compounds from the library
collection and positive control to the assay plates through Pintool. Incubate for 17hrs at 37C / 99 percent
Humidity / 5 percent C02. Add 1 uL of CCF4 (FRET Substrate) dye using a single tip plate dispenser (BioRAPTR).
Incubate at room temperature for lhrs in the dark. Read the fluorescence intensity through Envision plate
reader using Beta-Lactamase protocol optimized for this cell type. Add 3 uL of Cell Titer Glo and Incubate at
room temperature for 0.5 hrs in dark for both agonist and antagonist mode. Read on ViewLux protocol
optimized for this cell type for both agonist and antagonist mode PPARd-bla cells were dispensed at 3000
cells/6uL/well in 1536-well black wall/clear bottom plates using a Multidrop Combi (Thermo Fisher Scientific,
Waltham, MA) dispenser. After the assay plates were incubated at 37C and 5 percent C02 for 5 h, 23 nL of
compounds dissolved in DMSO, positive controls or DMSO only was transferred to the assay plate by a Pintool
station (Kalypsys, San Diego, CA). The assay plates were incubated at 37C for 17 h. After 1 uL of LiveBLAzer B/G
FRET substrate was added using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter,
Indianapolis, IN, USA), the plates were incubated at room temperature for 2 h, and fluorescence intensity was
measured by an Envision plate reader (PerkinElmer, Shelton, CT). For cell viability readout that measures
cytotoxicity, 3 uL/well of CellTiter-Glo reagent was added into the assay plates using a BioRAPTR FRD. After 30
min incubation at room temperature, the luminescence intensity in the plates was measured using a ViewLux
plate reader (PerkinElmer).

Baseline median absolute deviation for the assay (bmad): 7.986
Response cutoff threshold used to determine hit calls: 47.918

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: PPAR-delta antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Additionally, this assay was annotated to the intended target family of cell cycle.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

567	6397	2351

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	303

gain-loss (gnls) model:	409

power(pow) model:	543

linear-polynomial (polyl) model:	4312

quadratic-polynomial(poly2) model:	667

exponential-2 (exp2) model:	206

exponential-3 (exp3) model:	24

exponential-4 (exp4) model:	2284

exponential-5 (exp5) model:	567

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.229

Neutral control median absolute deviation, by plate: nmad	13.133

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-5705.46%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 567.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1127

TOX21_PRARg_BLA_Antagonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-gamma (PPARg) Antagonism Beta-
lactamase Assay, Ratio

1.2	Assay Summary: TOX21 PPARg BLA Antagonist is a cell-based, single-readout assay that uses HEK293, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
pparg-bla-antagonist-pl. TOX21_PPARg_BLA_Antagonist_ratio is an assay readout measuring reporter gene via
receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-
lactamase reporter gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene
substrate used as the measure of target activity. Data from the assay component
TOX21_PPARg_BLA_Antagonist_ratio was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PPARg_BLA_Antagonist_ratio, was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be
used to understand changes in the reporter gene as they relate to the gene PPARG. Furthermore, this assay
endpoint can be referred to as a primary readout, because this assay has produced multiple assay endpoints
and this ratio serves a reporter gene function to understand target activity. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARg_BLA_Antagonist_ratio was designed to target nuclear receptor activity at the
protein (receptor) level, specifically mapping to PPARG gene(s) using a positive control of GW9662

The Tox21 peroxisome proliferator-activated receptor gamma antagonism beta-lactamase assay screened a
library of diverse environmental compounds to probe for xenobiotic ligand-binding and potential to suppress

-------
PPAR-gamma-dependent transcription, monitored through bla reporter gene signal activation using a
mammalian one-hybrid GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are
plated the day of the assay and following 17-hour incubation of cells with test compounds a membrane-
permeable FRET-based substrate CCF4-AM is introduced and incubated for an additional hour. Once in the cell,
cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified
by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals
are monitored using an Envision plate reader. Following CCF4 incubation and detection, 3uL of CellTiter-Glo
reagent is added to each well and incubated for 30 minutes before cytotoxicity readout is measured on a
ViewLux microtiter plate reader. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Peroxisome proliferator-activated receptor gamma (PPAR-gamma) is
a ligand-activated nuclear receptor which regulates the expression of genes involved in fatty acid-oxidation and
is a major regulator of energy homeostasis. PPAR-gamma is primarily expressed in adipose tissue, macrophages
and in the colon where it controls adipocyte differentiation, lipid storage and inflammatory responses. PPAR-
gamma agonists, the thiazolidinediones (TZDs), improve insulin sensitivity, lower glucose levels, and lower
plasma triglycerides and free fatty acid (FFA) levels by enhancing their uptake into adipocytes. The
PPARg_BLA_Antagonist assay used Fluorescence Resonance Energy Transfer (FRET) substrate to generate a
ratiometric reporter response to receptor ligand-binding to allow monitoring of PPAR-gamma activity relative
to a known receptor antagonist. This assay is designed to help identify environmental compounds with a
capacity for PPAR-gamma ligand-binding activity. The Tox21 PPAR-gamma bla assays are qHTS format assays
which measured the ability of a chemical to interact with PPAR-gamma by monitoring modulation of
fluorescence reporter gene signals. This assay utilized a human embryonic kidney cell line (HEK293T) which
expresses PPAR-gamma and a one-hybrid GAL4 system to quantify xenobiotic PPAR-gamma antagonism.

2.3	Experimental System: adherent HEK293 cell line used. GeneBLAzer PPAR-gamma 293H DA (Division-arrested)
cells and PPARgamma-UAS-bla 293H cells contain a peroxisome proliferator-activated receptor gamma (PPARg)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293H
cell line. CellSensor UAS-bla 293H contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293H cells. These cells stably express a beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPARg (LBD) fusion protein, the protein binds to the UAS, resulting in expression of beta-lactamase. The
HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 1. The transformation incorporated approximately 4.5 kilobases
from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid 2. HEK293 cells are popular for their ease of
growth and transfection cells and are frequently used to produce exogenous proteins or viruses for
pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of

-------
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 3-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL for agonist mode and 3000cells/well/5 ul
for antagonist mode through 8 tip Multidrop plate dispenser. Incubate for 5 hrs at 37C / 99% Humidity / 5%
C02. Transfer 23nL of compounds from the library collection and positive control to the assay plates through
pintool. Add 1 uL of buffer and luL of Agonist concentration to respective columns as per plate map for
antagonist mode. Incubate for 17 hrs at 37C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET Substrate) dye
using a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 1 hrs in dark. Read the
fluorescence intensity through Envision plate reader using Beta-Lactamase protocol optimized for this cell type.
Add 3 uL of Cell Titer Glo and Incubate at room temperature for 0.5 hrs in dark for antagonist mode. Read on
ViewLux protocol optimized for this cell type for antagonist mode.

Baseline median absolute deviation for the assay (bmad): 5.997
Response cutoff threshold used to determine hit calls: 35.982
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-gamma antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

GW9662

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

NA

-------
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
GW9662 was used as a positive PPARg antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning

-------
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
858

Inactive hit count: Oihitc 0.9
5616

WINING MODEL SELECTION

NA hit count: hitc^O
2841

Number of sample-assay endpoints with winning hill model:

415
704

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

759

3452

quadratic-polynomialfpoly2) model: 890

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

289

64

2127

615

-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.154

Neutral control median absolute deviation, by plate: nmad

10.059

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-6124.28%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

-100.233

5.506

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

NA

-------
Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-8.6

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 615.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1128

T0X21_P PARg_B LA_Antago n ist_vi a bi I ity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293 Peroxisome Proliferator-activated Receptor-gamma
(PPARg), Antagonism Beta-lactamase Assay

1.2	Assay Summary: TOX21 PPARg BLA Antagonist is a cell-based, single-readout assay that uses HEK293, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
pparg-bla-antagonist-pl. TOX21_PPARg_BLA_Antagonist_viability is an assay readout measuring cellular ATP
content and detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
TOX21_PPARg_BLA_Antagonist_viability used a type of viability reporter where loss-of-signal activity can be
used to understand changes in the cell viability. Furthermore, this assay endpoint can be referred to as a
secondary readout, because this assay has produced multiple assay endpoints where this one serves a viability
function. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publicly available
through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 peroxisome proliferator-activated receptor gamma antagonism beta-lactamase assay screened a
library of diverse environmental compounds to probe for xenobiotic ligand-binding and potential to suppress
PPAR-gamma-dependent transcription, monitored through bla reporter gene signal activation using a
mammalian one-hybrid GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are
plated the day of the assay and following 17-hour incubation of cells with test compounds a membrane-
permeable FRET-based substrate CCF4-AM is introduced and incubated for an additional hour. Once in the cell,

-------
cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified
by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals
are monitored using an Envision plate reader. Following CCF4 incubation and detection, 3uL of CellTiter-Glo
reagent is added to each well and incubated for 30 minutes before cytotoxicity readout is measured on a
ViewLux microtiter plate reader. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Peroxisome proliferator-activated receptor gamma (PPAR-gamma) is
a ligand-activated nuclear receptor which regulates the expression of genes involved in fatty acid-oxidation and
is a major regulator of energy homeostasis. PPAR-gamma is primarily expressed in adipose tissue, macrophages
and in the colon where it controls adipocyte differentiation, lipid storage and inflammatory responses. PPAR-
gamma agonists, the thiazolidinediones (TZDs), improve insulin sensitivity, lower glucose levels, and lower
plasma triglycerides and free fatty acid (FFA) levels by enhancing their uptake into adipocytes. The
PPARg_BLA_Antagonist assay used Fluorescence Resonance Energy Transfer (FRET) substrate to generate a
ratiometric reporter response to receptor ligand-binding to allow monitoring of PPAR-gamma activity relative
to a known receptor antagonist. This assay is designed to help identify environmental compounds with a
capacity for PPAR-gamma ligand-binding activity. The Tox21 PPAR-gamma bla assays are qHTS format assays
which measured the ability of a chemical to interact with PPAR-gamma by monitoring modulation of
fluorescence reporter gene signals. This assay utilized a human embryonic kidney cell line (HEK293T) which
expresses PPAR-gamma and a one-hybrid GAL4 system to quantify xenobiotic PPAR-gamma antagonism.

2.3	Experimental System: adherent HEK293 cell line used. GeneBLAzer PPAR-gamma 293H DA (Division-arrested)
cells and PPARgamma-UAS-bla 293H cells contain a peroxisome proliferator-activated receptor gamma (PPARg)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293H
cell line. CellSensor UAS-bla 293H contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293H cells. These cells stably express a beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPARg (LBD) fusion protein, the protein binds to the UAS, resulting in expression of beta-lactamase. The
HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 1. The transformation incorporated approximately 4.5 kilobases
from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid 2. HEK293 cells are popular for their ease of
growth and transfection cells and are frequently used to produce exogenous proteins or viruses for
pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30ml thawing medium. Cell

-------
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 3-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL for agonist mode and 3000cells/well/5 ul
for antagonist mode through 8 tip Multidrop plate dispenser. Incubate for 5 hrs at 37C / 99% Humidity / 5%
C02. Transfer 23nL of compounds from the library collection and positive control to the assay plates through
pintool. Add 1 ul of buffer and luL of Agonist concentration to respective columns as per plate map for
antagonist mode. Incubate for 17 hrs at 37C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET Substrate) dye
using a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 1 hrs in dark. Read the
fluorescence intensity through Envision plate reader using Beta-Lactamase protocol optimized for this cell type.
Add 3 ul of Cell Titer Glo and Incubate at room temperature for 0.5 hrs in dark for antagonist mode. Read on
ViewLux protocol optimized for this cell type for antagonist mode.

Baseline median absolute deviation for the assay (bmad): 9.131
Response cutoff threshold used to determine hit calls: 54.787

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: PPAR-gamma antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1 Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
3.2 Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were

-------
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 9315

Active hit count: hitc>0.9
497

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.209

Neutral control median absolute deviation, by plate: nmad	13.812

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-6294.12%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 482.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7. Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1185

T0X21_A R E_B LA_a go n i st_v lability

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HepG2 Nuclear Erythroid 2-Related factor 2/Antioxidant Response
Element (Nrf2/ARE) Agonism Beta-lactamase Assay

1.2	Assay Summary: TOX21_ARE_BLA_Agonist is a cell-based, single-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-are-bla-pl.
TOX21_ARE_BLA_Agonist_viability is an assay readout measuring cellular ATP content and detected with
CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_ARE_BLA_Agonist_viability used a type of viability
reporter where loss-of-signal activity can be used to understand changes in the cell viability. Furthermore, this
assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves a viability function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. ARE-bla HepG2 cells are dispensed at 2000 cells/5 uL well in 1536-well
black, clear-bottom plates and incubated for 6 hours at 37C. Compounds are plated using a Wako Pintool station
and incubated for 16 hours. After a 16 hour incubation, 1 uLof LiveBLAzer (Life Technologies) detection mix was
added to each well and the plates are subsequently incubated at RT for 2 h.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 antioxidant response element agonism beta-lactamase assay screened a library of diverse
environmental compounds to identify agonists that induce oxidative stress, monitored through bla reporter
gene signal activation using a mammalian one-hybrid GAL4 system. The assay is run in triplicate on 1536-well
microplates. HepG2 cells are plated the day of the assay and following 16 hour incubation of cells with test
compounds a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla

-------
expression is quantified by measuring the ratio of blue (460nm, product) to green (530nm, substrate)
fluorescence. Fluorescence signals are monitored using an Envision plate reader. Each compound was tested in
a concentration-response format, using 11 concentrations ranging from 1.6 nM to 92 uM. Concentration-
response relationships were determined by monitoring FRET signals relative to DMSO-only exposures which
provided a signal baseline, and to a known antioxidant response element agonist (Beta-Naphthoflavone) as a
positive control which provided a reference for 100 percent androgen receptor inhibition. Compound auto-
fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for
background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Oxidative stress has been implicated in the pathogenesis of a variety
of diseases ranging from cancer to neurodegeneration. The antioxidant response element (ARE) signaling
pathway plays an important role in the amelioration of oxidative stress. The CellSensor ARE-bla HepG2 cell line
(Invitrogen) can be used for analyzing the Nrf2/antioxidant response signaling pathway. Nrf2 (NF-E2-related
factor 2) and Nrfl are transcription factors that bind to AREs and activate these genes.

2.3	Experimental System: adherent HepG2 cell line used. Hep G2 is an immortal cell line which was derived in 1975
from the liver tissue of a 15-year-old Caucasian male from Argentina with a well-differentiated hepatocellular
carcinoma. These cells are epithelial in morphology, have a modal chromosome number of 55, and are not
tumorigenic in nude mice.

2.4	Metabolic Competence: The parental HepG2 cell line has been shown by others to retain the potential for Phase
I and Phase II metabolic responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A,
2E1, and 3A4/5 with CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human
hepatocytes although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have
even been observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2
cells include SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7).
In addition, HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53
protein (Boehme et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant
response element (ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a
number of ATP-binding cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound
proteins also regulated in part by Nrf2TF DNA-binding).

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 60 to 75% confluence. Handle
the 1536-well, black-wall, clear-bottom assay plate by the sides; do not touch the clear bottom of the assay
plate. Cell Media Required: Growth (90% DMEM with GlutaMAX, 10% Dialyzed FBS, 0.1 mM NEAA, 25mM
HEPES, lOOU/mL/lOOug/mL Penicillin/Streptomycin, 5 ug/mL Blasticidin), Assay (99% DMEM with GlutaMAX,
10% Dialyzed FBS, 0.1 mM NEAA, 25mM HEPES, lOOU/mL/lOOug/mL Penicillin/Streptomycin), Thaw (90%
DMEM with GlutaMAX, 1% Dialyzed FBS, 0.1 mM NEAA, 25mM HEPES, lOOU/mL/lOOug/mL
Penicillin/Streptomycin), Freezing (100% Recovery Cell freezing medium) Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a T75 flask. Remove the vial of cells to be thawed from liquid nitrogen and thaw
rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial in water.
Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet. Transfer the
vial contents drop-wise into 10 mL of Thaw Medium in a sterile 15-mL conical tub. Centrifuge cells at 900 rpm
for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask containing Thaw

-------
Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at first passage. Cell
Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.25% Trypsin/EDTA and swirl to coat the cell
evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes incubation at 37C.
Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be passage at least twice
a week. Assay Protocol: Harvest cells from culture in growth medium and resuspend in assay medium. Dispense
2000 cells/5uL/well into 1536-well tissue treated black/clear bottom plates using a BioRAPTR dispenser. After
the cells were incubated at 37C for 5 hours, 23 nL of positive controls or compounds were transferred to the
assay plate by a PinTool resulting in a 217-fold dilution. The final compound concentration in the 5 ul assay
volume ranged from 1.2 nM to 92 uM in 15 concentrations. Incubate the plates for 16 hours at 37C. Add 1 uL of
6X LiveBLAzer-FRET B/G (CCF4-AM) Substrate Mixture to each well using a BioRAPTR dispenser and incubate
the plate at room temperature for 2 hours. Measure fluorescence intensity at 460 and 530 nm emission and
405 nm excitation by an Envision detector. Data is expressed as the ratio of 460nm/530nm emissions.

Baseline median absolute deviation for the assay (bmad): 4.933
Response cutoff threshold used to determine hit calls: 29.598

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Antioxidant response element (ARE) signaling pathway agonism is monitored by FRET emission
resulting from GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was
measured in parallel by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation
in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates

-------
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

802	6091	2422

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	316

gain-loss (gnls) model:	365

power(pow) model:	447

linear-polynomial (polyl) model:	4010

quadratic-polynomial(poly2) model:	637

exponential-2 (exp2) model:	315

exponential-3 (exp3) model:	30

exponential-4 (exp4) model:	2601

exponential-5 (exp5) model:	594

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In

-------
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.208

Neutral control median absolute deviation, by plate: nmad	6.674

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3112.86%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-0.296

Positive control well median absolute deviation, by plate: pmad	12.333

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.005

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

-------
Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 594.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Xia M, Huang R, Shi Q, Boyd WA, Zhao J, Sun N, Rice J R, Dunlap PE, Hackstadt AJ, Bridge MF, Smith
MV, Dai S, Zheng W, Chu PH, Gerhold D, Witt KL, DeVito M, Freedman JH, Austin CP, Houck KA, Thomas RS,
Paules RS, Tice RR, Simeonov A. Comprehensive Analyses and Prioritization of Tox21 10K Chemicals Affecting
Mitochondrial Function by in-Depth Mechanistic Studies. Environ Health Perspect. 2018 Jul 26;126(7):077010.
doi: 10.1289/EHP2589. PMID: 30059008; PMCID: PMC6112376.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1188

T0X21_FXR_B LA_agon ist_via bi I ity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293T Farnesoid-X-Receptor (FXR) Agonism Beta-lactamase
Assay

1.2	Assay Summary: TOX21_FXR_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-fxr-
bla-agonist-p2. TOX21_FXR_BLA_Agonist_viability is an assay readout measuring cellular ATP content and
detected with CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_FXR_BLA_Agonist_viability used a type
of viability reporter where loss-of-signal activity can be used to understand changes in the cell viability.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a viability function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic FXR target gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 farnesoid-X-receptor (FXR) beta-lactamase agonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce farnesoid-X-receptor
signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4 system.
The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla

-------
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. The farnesoid-X-receptor (FXR), a nuclear hormone receptor, is highly
expressed in liver, intestine, kidney and adrenal cortex. Natural ligands of FXR are the bile acids (Cholic acid,
Chenodeoxy cholic acid etc). FXR is an important regulator of diverse metabolic pathways, including the bile acid
homeostasis, lipid and glucose metabolisms. FXR regulates the expression of target genes by binding either as a
monomer or as a heterodimer with the retinoid X receptor (RXR). Numerous studies have reported that FXR
exerts protective function during cholestasis, diabetes, liver regeneration, and cancer. To identify compounds
that activate FXR signaling, GeneBLAzer FXR-UAS-bla HEK 293T cell line (Invitrogen, Carlsbad, CA) containing a
beta-lactamase reporter gene under the control of a UAS response element was used to screen the Tox2110K
compound library.

2.3	Experimental System: adherent HEK293T cell line used. The FXR-HEK293 line is a stably transfected cell line
containing an intact FXR signaling pathway. FXR multiple hormone response element (MHRE) reporters were
introduced into HEK293 cells that express endogenous FXR. The HEK-293 cell line is a human embryonic kidney
cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham
et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into human
chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293 line is
pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection cells and
are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical research
purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 80 to 90% confluence. Cell
Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 mL of conical tub. Remove the vial of cells
to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath with gentle agitation for
1-2 minutes. Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet.
Transfer the vial contents drop-wise into 14 mL of thaw medium in a sterile 15-mL conical tub. Centrifuge cells
at 900 rpm for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask
containing Thaw Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at
first passage. Cell Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.05% Trypsin/EDTA and
swirl to coat the cell evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes
incubation at 37C. Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be
passage at least twice a week. Assay Protocol: Harvest cells from culture and resuspend in assay medium.
Dispense 5000 cells/5uL/well into 1536-well tissue treated black, clear-bottom plates using a Multi-drop
dispenser. After the cells were incubated at 37C for 5 hours, 23 nL of control or compounds dissolved in DMSO
were transferred to the assay plate by a PinTool resulting in a 217-fold dilution. Incubate the plates for 16 hours
at 37C. Add 1 uL of 6X LiveBLAzer-FRET B/G (CCF4-AM). Substrate Mixture to each well using a BioRAPTR
dispenser. After two hours incubation at room temperature, measure fluorescence intensity at 460 and 530 nm

-------
emission and 405 nm excitation by an Envision detector. Data is expressed as the ratio of 460nm/530nm
emissions.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 8.915
Response cutoff threshold used to determine hit calls: 53.49

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Farnesoid-X-Receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where

-------
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.501

Neutral control median absolute deviation, by plate: nmad	14.6

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-2851.28%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-46.406

Positive control well median absolute deviation, by plate: pmad	15.401

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.999

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 552.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Hsu CW, Zhao J, Huang R, Hsieh JH, Hamm J,
Chang X, Houck K, Xia M. Quantitative high-throughput profiling of environmental chemicals and drugs that
modulate farnesoid X receptor. Sci Rep. 2014 Sep 26;4:6437. doi: 10.1038/srep06437. PMID: 25257666; PMCID:
PMC4894417.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1189

TOX21_ERa_BLA_Antagonist_chl

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-alpha (ESR1) Antagonism Beta-lactamase Assay, Channel 1
Readout of Uncleaved Substrate

1.2	Assay Summary: TOX21 ERa BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-er-
bla-antagonist-pl. TOX21_ERa_BLA_Antagonist_chl is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the uncleaved reporter gene substrate and is used to calculate the
ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_ERa_BLA_Antagonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ERa_BLA_Antagonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, increased activity can be used to
understand changes in the reporter gene as they relate to the gene ESR1. Furthermore, this assay endpoint can
be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERalpha gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERa_BLA_Antagonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to ESR1 gene(s) using a
positive control of 4-hydroxytamoxifen

The Tox21 estrogen receptor-alpha antagonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-
dependent transcription, monitored through changes to bioluminescence signals produced from an enzymatic

-------
reaction catalyzed by luciferase between the key substrate [CellTiter-Glo] and the target cofactor [ATP], Each
well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as compared to
positive control of tetraoctylammonium bromide (100 percent inhibition). The assay is run in triplicate on 1536-
well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable FRET-
based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER alpha-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor alpha (ERalpha) fused to the DNA-binding domain of
GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Cell Thawing Method: 1ml frozen cells of ERalpha-bla were taken in pre-warmed 10ml of thaw
medium for centrifuging. 3ml of the thaw medium is taken to resuspend the pellet. The cells were seeded in T-
75 flask at 2 million cells. Cell Proliferation Method: The cells are detached using 0.05 percent Trypsin. Cells are
further passaged at a density of 4-5 million cells per T-225 flask. Assay Protocol: Rinse the cells twice with DPBS
and detach them using 0.05 percent Trypsin and centrifuge. Resuspend the pellet with assay buffer. Plate the
cells in black-clear bottom 1536 well plate at 5000/well/5uL through 8 tip plate dispenser (Multi drop). Incubate
at 37Cfor 5hrs. Transfer 23nLof the compounds from the library collection and positive control through Pintool.
Add luL of assay buffer with or without 0.5nM (final) Beta-estradiol. Incubate at 37Cfor 18hrs. Add luLof CCF4
dye using a single tip of a plate dispenser (BioRAPTR). Incubate at room temperature for 2hrs. Read the
fluorescence intensity through Envision plate reader. Add 4uL of CellTiter-Glo reagent using a single tip of a plate
dispenser (BioRAPTR). Incubate at room temperature for 30min. Read the luminescence through ViewLux plate
reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

4-hydroxytamoxifen

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 4.001
Response cutoff threshold used to determine hit calls: 24.004
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

-------
2.6	Response: Estrogen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
4-hydroxytamoxifen was used as a positive ERa antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496

Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
647

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.304

Neutral control median absolute deviation, by plate: nmad	9.915

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3526.97%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	25.235

Positive control well median absolute deviation, by plate: pmad	9.989

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	1.865

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 650.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Huang R, Sakamuru S, Martin MT, Reif DM,
Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W, Rotroff DM, Zhao T,
Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox2110K compound library
for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep. 2014 Jul 11;4:5664. doi:
10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1190

TOX21_ERa_BLA_Antagonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-alpha (ESR1) Antagonism Beta-lactamase Assay, Channel 2
Readout of Cleaved Substrate

1.2	Assay Summary: TOX21 ERa BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-er-
bla-antagonist-pl. TOX21_ERa_BLA_Antagonist_ch2 is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio
of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_ERa_BLA_Antagonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ERa_BLA_Antagonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene ESR1. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERalpha gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERa_BLA_Antagonist_ch2 was designed to measure cleaved reporter gene substrate to
target nuclear receptor (steroidal) activity at the protein (receptor) level, specifically mapping to ESR1 gene(s)
using a positive control of 4-hydroxytamoxifen

The Tox21 estrogen receptor-alpha antagonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-
dependent transcription, monitored through changes to bioluminescence signals produced from an enzymatic

-------
reaction catalyzed by luciferase between the key substrate [CellTiter-Glo] and the target cofactor [ATP], Each
well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as compared to
positive control of tetraoctylammonium bromide (100 percent inhibition). The assay is run in triplicate on 1536-
well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable FRET-
based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER alpha-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor alpha (ERalpha) fused to the DNA-binding domain of
GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Call Thawing Method: 1ml frozen cells of ERalpha-bla were taken in pre-warmed 10ml of thaw
medium for centrifuging. 3ml of the thaw medium is taken to resuspend the pellet. The cells were seeded in T-
75 flask at 2 million cells. Cell Proliferation Method: The cells are detached using 0.05 percent Trypsin. Cells are
further passaged at a density of 4-5 million cells perT-225 flask. Assay Protocol: Rinse the cells twice with DPBS
and detach them using 0.05 percent Trypsin and centrifuge. Resuspend the pellet with assay buffer. Plate the
cells in black-clear bottom 1536 well plate at 5000/well/5uL through 8 tip plate dispenser (Multi drop). Incubate
at 37Cfor 5hrs. Transfer 23nLof the compounds from the library collection and positive control through Pintool.
Add luL of assay buffer with or without 0.5nM (final) Beta-estradiol. Incubate at 37Cfor 18hrs. Add luLof CCF4
dye using a single tip of a plate dispenser (BioRAPTR). Incubate at room temperature for 2hrs. Read the
fluorescence intensity through Envision plate reader. Add 4uL of CellTiter-Glo reagent using a single tip of a plate
dispenser (BioRAPTR). Incubate at room temperature for 30min. Read the luminescence through ViewLux plate
reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

4-hydroxytamoxifen

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 6.489
Response cutoff threshold used to determine hit calls: 38.936
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

-------
2.6	Response: Estrogen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
4-hydroxytamoxifen was used as a positive ERa antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

-------
Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496

Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
581

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.613

Neutral control median absolute deviation, by plate: nmad	20.006

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3383.5%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100.071

Positive control well median absolute deviation, by plate: pmad	2.723

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-4.912

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 691.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Huang R, Sakamuru S, Martin MT, Reif DM,
Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W, Rotroff DM, Zhao T,
Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox2110K compound library
for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep. 2014 Jul 11;4:5664. doi:
10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1191

T0X21_FXR_B LA_Antagon ist_ch 1

1.	General Information

1.1	Assay Title: Tox21 HEK293T Farnesoid-X-Receptor (FXR) Antagonism Beta-lactamase Assay, Channel 1 Readout
of Uncleaved Substrate

1.2	Assay Summary: TOX21 FXR BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-fxr-
bla-antagonist-pl. TOX21_FXR_BLA_Antagonist_chl is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the uncleaved reporter gene substrate and is used to calculate the
ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_FXR_BLA_Antagonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_FXR_BLA_Antagonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter,increased activity can be used to
understand changes in the reporter gene as they relate to the gene NR1H4. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic FXR target gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_FXR_BLA_Antagonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to NR1H4 gene(s) using a
positive control of Guggulsterone

The Tox21 farnesoid-X-receptor (FXR) beta-lactamase antagonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress farnesoid-X-
receptor signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4

-------
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. The FXR-HEK293 line is a stably transfected cell line
containing an intact FXR signaling pathway. FXR multiple hormone response element (MHRE) reporters were
introduced into HEK293 cells that express endogenous FXR. The HEK-293 cell line is a human embryonic kidney
cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham
et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into human
chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293 line is
pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection cells and
are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical research
purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 80 to 90% confluence. Cell
Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 mL of conical tub. Remove the vial of cells
to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath with gentle agitation for
1-2 minutes. Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet.
Transfer the vial contents drop-wise into 14 mL of thaw medium in a sterile 15-mL conical tub. Centrifuge cells
at 900 rpm for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask
containing Thaw Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at
first passage. Cell Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.05% Trypsin/EDTA and
swirl to coat the cell evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes
incubation at 37C. Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be
passage at least twice a week. Assay Protocol: Harvest cells from culture in growth medium and resuspend in
assay medium. Dispense 5000 cells/5uL/well into 1536-well tissue treated black, clear-bottom plates using a
Multi-drop dispenser. After the cells were incubated at 37Cfor 5 hours, 23 nLof control or compounds dissolved
in DMSO were transferred to the assay plate by a PinTool resulting in a 217-fold dilution. Add 1 uL of agonist
(CDCA) at 300 nM in assay medium to the column 1-2 and column. Add 1 uL of assay medium to the column 3-
4. Incubate the plates for 16 hours at 37C. Add 1 uL of 6X LiveBLAzer-FRET B/G (CCF4-AM) Substrate Mixture to
each well using a BioRAPTR dispenser. After two hours incubation at room temperature, measure fluorescence
intensity at 460 and 530 nm emission and 405 nm excitation by an Envision detector. Data is expressed as the
ratio of 460nm/530nm emissions.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Guggulsterone

Baseline median absolute deviation for the assay (bmad): 3.494

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
Response cutoff threshold used to determine hit calls: 20.963
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Farnesoid-X-Receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Guggulsterone was used as a positive FXR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315

Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
508

Inactive hit count: 0
-------
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

-0.23
6.871
-2761.08%

74.371
16.1

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	3.94

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 479.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Hsu CW, Zhao J, Huang R, Hsieh JH, Hamm J,
Chang X, Houck K, Xia M. Quantitative high-throughput profiling of environmental chemicals and drugs that
modulate farnesoid X receptor. Sci Rep. 2014 Sep 26;4:6437. doi: 10.1038/srep06437. PMID: 25257666; PMCID:
PMC4894417.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1192

TOX21_FXR_BLA_Antagonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293T Farnesoid-X-Receptor (FXR) Antagonism Beta-lactamase Assay, Channel 2 Readout
of Cleaved Substrate

1.2	Assay Summary: TOX21 FXR BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-fxr-
bla-antagonist-pl. TOX21_FXR_BLA_Agonist_ch2 is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio
of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_FXR_BLA_Antagonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_FXR_BLA_Antagonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene NR1H4. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic FXR target gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_FXR_BLA_Antagonist_ch2 was designed to measure cleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to NR1H4 gene(s) using a
positive control of Guggulsterone

The Tox21 farnesoid-X-receptor (FXR) beta-lactamase antagonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress farnesoid-X-
receptor signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4

-------
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. The FXR-HEK293 line is a stably transfected cell line
containing an intact FXR signaling pathway. FXR multiple hormone response element (MHRE) reporters were
introduced into HEK293 cells that express endogenous FXR. The HEK-293 cell line is a human embryonic kidney
cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973 (Graham
et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into human
chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293 line is
pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection cells and
are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical research
purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should be grown to reach 80 to 90% confluence. Cell
Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 mL of conical tub. Remove the vial of cells
to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath with gentle agitation for
1-2 minutes. Decontaminate the vial by wiping with 70% ethanol before opening in a biological safety cabinet.
Transfer the vial contents drop-wise into 14 mL of thaw medium in a sterile 15-mL conical tub. Centrifuge cells
at 900 rpm for 4 minutes and resuspend in thaw medium. Transfer contents to the T75 tissue culture flask
containing Thaw Medium and place flask in a humidified 37C/5% C02 incubator. Switch to growth medium at
first passage. Cell Proliferation Method: Aspirate medium, rinse once in DPBS, add 0.05% Trypsin/EDTA and
swirl to coat the cell evenly. Add an equal volume of Growth Medium to inactivate Trypsin after 2-3 minutes
incubation at 37C. Centrifuge cells at 900 rpm for 4 minutes and resuspend in growth medium. Cell should be
passage at least twice a week. Assay Protocol: Harvest cells from culture in growth medium and resuspend in
assay medium. Dispense 5000 cells/5uL/well into 1536-well tissue treated black, clear-bottom plates using a
Multi-drop dispenser. After the cells were incubated at 37Cfor 5 hours, 23 nLof control or compounds dissolved
in DMSO were transferred to the assay plate by a PinTool resulting in a 217-fold dilution. Add 1 uL of agonist
(CDCA) at 300 nM in assay medium to the column 1-2 and column. Add 1 uL of assay medium to the column 3-
4. Incubate the plates for 16 hours at 37C. Add 1 uL of 6X LiveBLAzer-FRET B/G (CCF4-AM) Substrate Mixture to
each well using a BioRAPTR dispenser. After two hours incubation at room temperature, measure fluorescence
intensity at 460 and 530 nm emission and 405 nm excitation by an Envision detector. Data is expressed as the
ratio of 460nm/530nm emissions.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Guggulsterone

Baseline median absolute deviation for the assay (bmad): 6.684

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
Response cutoff threshold used to determine hit calls: 40.102
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Farnesoid-X-Receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Guggulsterone was used as a positive FXR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315

Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
713

Inactive hit count: 0
-------
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

-0.464
15.342
-3454.52%

-100.697
12.081

NA

-5.065

NA
NA

NA
NA
NA

NA

NA

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 555.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Hsu CW, Zhao J, Huang R, Hsieh JH, Hamm J,
Chang X, Houck K, Xia M. Quantitative high-throughput profiling of environmental chemicals and drugs that
modulate farnesoid X receptor. Sci Rep. 2014 Sep 26;4:6437. doi: 10.1038/srep06437. PMID: 25257666; PMCID:
PMC4894417.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1193

T0X21_G R_BLA_Antagon ist_ch 1

1.	General Information

1.1	Assay Title: Tox21 HeLa Glucocorticoid Receptor (GR) Antagonism Beta-lactamase Assay, Channel 1 Readout of
Uncleaved Substrate

1.2	Assay Summary: TOX21 GR BLA Antagonist is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-gr-hela-
bla-antagonist-pl. TOX21_GR_BLA_Antagonist_chl is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the uncleaved reporter gene substrate and is used to calculate the
ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_GR_BLA_Antagonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_GR_BLA_Antagonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, increased activity can be used to
understand changes in the reporter gene as they relate to the gene NR3C1. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HeLa cells are aliquoted into 1536-well microtiter plates
at a concentration of 1500 cells/6 uL and incubated with test compounds for 18 hours prior to monitoring
fluorescence emission resulting from xenobiotic glucocorticoid receptor expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_GR_BLA_Antagonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to NR3C1 gene(s) using a
positive control of Mifepristone

The Tox21 glucocorticoid receptor (GR) beta-lactamase antagonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress glucocorticoid
receptor signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4

-------
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HeLa cell line used. GR-bla HeLa cells are engineered to express beta-lactamase
under the control of glucocorticoid receptor. These cells stably express a -beta-lactamase reporter gene under
the transcriptional control of an upstream activator sequence (UAS).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should not be grown more than 80-85% confluence. The
cell performance is affected is they are more confluent. Handle 1536 well plate black clear bottom plates
carefully by sides. Cell Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 ml conical tube.
Remove the vial of cells to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath
with gentle agitation for 1-2 minutes. Do not submerge vial in water. Mix the entire content of the vial to 14 ml
of pre-warmed medium and centrifuge to remove DMSO. Discard the supernatant and transfer the precipitated
cells to T175 flask using 30 ml thawing medium. Cell Proliferation Method: Detach the cells from the flask using
TrypLExpress. The cells are re-seeded in T-175 flask at 1-1.3 million. Assay Protocol: Spin down the cells after
rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate the cells in black-
clear bottom 1536 well plate at 1500/well/6uL through 8 tip Multidrop plate dispenser. Incubate for 4 hrs at
37C / 99% Humidity / 5% C02. Transfer 23nL of compounds from the library collection and positive control to
the assay plates through pintool. -Add 1 uL of buffer and luL of Agonist concentration to respective columns as
per plate map. Incubate for 18 hrs at 37C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET Substrate) dye using
a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 2 hrs in dark. Add 3 uL of Cell Titer
Glo and Incubate at room temperature for 0.5 hrs in dark. Read on ViewLux protocol optimized for this cell type.

Baseline median absolute deviation for the assay (bmad): 4.323
Response cutoff threshold used to determine hit calls: 25.939
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6 Response: Glucocorticoid receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Mifepristone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Mifepristone was used as a positive GR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.), 6: resp.multnegl (Multiply the
normalized response value (resp) by -1; -l*resp.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496

Number of chemicals tested: 8305

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
37

Inactive hit count: Oihitc 0.9
7273

WINING MODEL SELECTION

NA hit count: hitc^O
3186

Number of sample-assay endpoints with winning hill model:

240
484

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

410

4894

quadratic-polynomialfpoly2) model: 880

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

267

587

2722

12

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.112

Neutral control median absolute deviation, by plate: nmad	6.94

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-6362.34%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	10.171

Positive control well median absolute deviation, by plate: pmad	7.33

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.974

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 587.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1194

T0X21_G R_BLA_Antagon ist_ch2

1.	General Information

1.1	Assay Title: Tox21 HeLa Glucocorticoid Receptor (GR) Antagonism Beta-lactamase Assay, Channel 2 Readout of
Cleaved Substrate

1.2	Assay Summary: TOX21 GR BLA Antagonist is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-gr-hela-
bla-antagonist-pl. TOX21_GR_BLA_Agonist_ch2 is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio
of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_GR_BLA_Antagonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_GR_BLA_Antagonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene NR3C1. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HeLa cells are aliquoted into 1536-well microtiter plates
at a concentration of 1500 cells/6 uL and incubated with test compounds for 18 hours prior to monitoring
fluorescence emission resulting from xenobiotic glucocorticoid receptor expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_GR_BLA_Antagonist_ch2 was designed to measure cleaved reporter gene substrate to
target nuclear receptor (non-steroidal) activity at the protein (receptor) level, specifically mapping to NR3C1
gene(s) using a positive control of Mifepristone

The Tox21 glucocorticoid receptor (GR) beta-lactamase antagonism assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress glucocorticoid
receptor signaling, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4

-------
system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with test
compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2 hours.
Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HeLa cell line used. GR-bla HeLa cells are engineered to express beta-lactamase
under the control of glucocorticoid receptor. These cells stably express a -beta-lactamase reporter gene under
the transcriptional control of an upstream activator sequence (UAS).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cells should not be grown more than 80-85% confluence. The
cell performance is affected is they are more confluent. Handle 1536 well plate black clear bottom plates
carefully by sides. Cell Thawing Method: Place 14 mL of pre-warmed thaw medium into a 15 ml conical tube.
Remove the vial of cells to be thawed from liquid nitrogen and thaw rapidly by placing at 37C in a water bath
with gentle agitation for 1-2 minutes. Do not submerge vial in water. Mix the entire content of the vial to 14 ml
of pre-warmed medium and centrifuge to remove DMSO. Discard the supernatant and transfer the precipitated
cells to T175 flask using 30 ml thawing medium. Cell Proliferation Method: Detach the cells from the flask using
TrypLExpress. The cells are re-seeded in T-175 flask at 1-1.3 million. Assay Protocol: Spin down the cells after
rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate the cells in black-
clear bottom 1536 well plate at 1500/well/6uL through 8 tip Multidrop plate dispenser. Incubate for 4 hrs at
37C / 99% Humidity / 5% C02. Transfer 23nL of compounds from the library collection and positive control to
the assay plates through pintool. -Add 1 uL of buffer and luL of Agonist concentration to respective columns as
per plate map. Incubate for 18 hrs at 37C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET Substrate) dye using
a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 2 hrs in dark. Add 3 uL of Cell Titer
Glo and Incubate at room temperature for 0.5 hrs in dark. Read on ViewLux protocol optimized for this cell type.

Baseline median absolute deviation for the assay (bmad): 8.309
Response cutoff threshold used to determine hit calls: 49.852
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6 Response: Glucocorticoid receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Mifepristone

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Mifepristone was used as a positive GR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
1061

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.353

Neutral control median absolute deviation, by plate: nmad	20.29

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-6020.57%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100.164

Positive control well median absolute deviation, by plate: pmad	5.892

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-4.725

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 715.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1195

T0X2 l_PRARd_B LA_Agon ist_via bi lity

1.	General Information

1.1	Assay Title: Viability Assessment in theTox21 HEK293 Peroxisome Proliferator-activated Receptor-delta (PPARd)
Agonism Beta-lactamase Assay

1.2	Assay Summary: TOX21_PPARd_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
ppard-bla-agonist-pl. TOX21_PPARd_BLA_Agonist_viability is an assay readout measuring cellular ATP content
and detected with CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_PPARd_BLA_Agonist_viability
used a type of viability reporter where loss-of-signal activity can be used to understand changes in the cell
viability. Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has
produced multiple assay endpoints where this one serves a viability function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the
subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 peroxisome proliferator-activated receptor delta agonism beta-lactamase assay screened a library of
diverse environmental compounds to probe for xenobiotic ligand-binding and potential to induce PPAR-delta-
dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid
GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the
assay and following 17-hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for an additional 2 hours in the dark. Once in the cell,

-------
cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified
by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals
are monitored using an Envision plate reader. Following CCF4 incubation and detection, 3uL of CellTiter-Glo
reagent is added to each well and incubated for 30 minutes before cytotoxicity readout is measured on a
ViewLux microtiter plate reader. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Peroxisome proliferator-activated receptor delta is a ligand-activated
nuclear receptor which is expressed ubiquitously and may have a role in regulating the differentiation of
adipocytes, in keratinocyte differentiation and in the regulation of cholesterol and lipid metabolism (Schmuth
et al. 2004, Seimandi et al. 2005). The PPAR-delta_BLA_Agonist assay used Fluorescence Resonance Energy
Transfer (FRET) substrate to generate a ratiometric reporter response to receptor ligand-binding to allow
monitoring of PPAR-delta activity relative to a known receptor agonist. This assay is designed to help identify
environmental compounds with a capacity for PPAR-delta ligand-binding activity. The Tox21 PPAR-delta bla
assays are qHTS format assays which measured the ability of a chemical to interact with PPAR-delta by
monitoring modulation of fluorescence reporter gene signals. This assay utilized a human embryonic kidney cell
line (HEK293T) which expresses PPAR-delta and a one-hybrid GAL4 system to quantify xenobiotic PPAR-delta
agonism.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer PPAR-delta 293T DA (Division-arrested)
cells and PPAR delta-UAS-bla 293T cells contain a peroxisome proliferator-activated receptor delta (PPARd)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293T
cell line. CellSensor UAS-bla 293T contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293T cells. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPAR delta (LBD) fusion protein, the protein binds to the UAS, resulting in expression of -beta-lactamase.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Dohr et al. 1995). The transformation incorporated approximately
4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30 ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 2.5-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend

-------
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate for 5hrs at 37C / 99 percent Humidity / 5 percent C02. Transfer 23nL of compounds from the library
collection and positive control to the assay plates through Pintool. Incubate for 17hrs at 37C / 99 percent
Humidity / 5 percent C02. Add 1 uL of CCF4 (FRET Substrate) dye using a single tip plate dispenser (BioRAPTR).
Incubate at room temperature for lhrs in the dark. Read the fluorescence intensity through Envision plate
reader using Beta-Lactamase protocol optimized for this cell type. Add 3 uL of Cell Titer Glo and Incubate at
room temperature for 0.5 hrs in dark for both agonist and antagonist mode. Read on ViewLux protocol
optimized for this cell type for both agonist and antagonist mode PPARd-bla cells were dispensed at 3000
cells/6uL/well in 1536-well black wall/clear bottom plates using a Multidrop Combi (Thermo Fisher Scientific,
Waltham, MA) dispenser. After the assay plates were incubated at 37C and 5 percent C02 for 5 h, 23 nL of
compounds dissolved in DMSO, positive controls or DMSO only was transferred to the assay plate by a Pintool
station (Kalypsys, San Diego, CA). The assay plates were incubated at 37C for 17 h. After 1 uL of LiveBLAzer B/G
FRET substrate was added using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter,
Indianapolis, IN, USA), the plates were incubated at room temperature for 2 h, and fluorescence intensity was
measured by an Envision plate reader (PerkinElmer, Shelton, CT). For cell viability readout that measures
cytotoxicity, 3 uL/well of CellTiter-Glo reagent was added into the assay plates using a BioRAPTR FRD. After 30
min incubation at room temperature, the luminescence intensity in the plates was measured using a ViewLux
plate reader (PerkinElmer).

Baseline median absolute deviation for the assay (bmad): 5.557
Response cutoff threshold used to determine hit calls: 33.344

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: PPAR-delta agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

3.

Additionally, this assay was annotated to the intended target family of cell cycle.
Data Interpretation

-------
The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:

-------
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

-0.304

Neutral control median absolute deviation, by plate: nmad

8.122

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

-2604.18%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

-3.728

7.207

Z Prime Factor for median positive and neutral control across all plates:

NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

-0.271

((pmed - nmed) /sqrt(pmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

NA

Positive control signal-to-background: (pmed/nmed)

NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 596.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: httpsi//www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1196

T0X2 l_PPARd_BLA_Antagonist_ch 1

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-delta (PPARd) Antagonism Beta-
lactamase Assay, Channel 1 Readout of Uncleaved Substrate

1.2	Assay Summary: TOX21_PPARd_BLA_Antagonist is a cell-based, single-readout assay that uses HEK293T, a
human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See
tox21-ppard-bla-antagonist-pl. TOX21_PPARd_BLA_Antagonist_chl is an assay readout measuring reporter
gene via receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4
b-lactamase reporter gene. The signal is derived from the uncleaved reporter gene substrate and is used to
calculate the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data
from the assay component TOX21_PPARd_BLA_Antagonist_chl was analyzed into 1 assay endpoint. This assay
endpoint, TOX21_PPARd_BLA_Antagonist_chl, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter,increased activity can
be used to understand changes in the reporter gene as they relate to the gene PPARD. Furthermore, this assay
endpoint can be referred to as a secondary readout, because this assay has produced multiple assay endpoints
where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARD_BLA_Antagonist_chl was designed to measure uncleaved reporter gene substrate
to target nuclear receptor activity at the protein (receptor) level, specifically mapping to PPARD gene(s) using
a positive control of MK886

The Tox21 peroxisome proliferator-activated receptor delta antagonism beta-lactamase assay screened a library
of diverse environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PPAR-
delta-dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-

-------
hybrid GAL4 system. Each well contained 0.3 uM L-165,041 to stimulate receptor activity and MK886 (a
leukotriene inhibitor) served as a positive control. The assay is run in triplicate on 1536-well microplates.
HEK293T cells are plated the day of the assay and following 17-hour incubation of cells with test compounds a
membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for an additional hour. Once
in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression
is quantified by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence.
Fluorescence signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is
also incubated with test system for 30 minutes before readout to detect cell viability. Compound auto-
fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for
background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer PPAR-delta 293T DA (Division-arrested)
cells and PPAR delta-UAS-bla 293T cells contain a peroxisome proliferator-activated receptor delta (PPARd)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293T
cell line. CellSensor UAS-bla 293T contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293T cells. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPAR delta (LBD) fusion protein, the protein binds to the UAS, resulting in expression of -beta-lactamase.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Dohr et al. 1995). The transformation incorporated approximately
4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30 ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 2.5-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate for 5hrs at 37C / 99 percent Humidity / 5 percent C02. Transfer 23nL of compounds from the library
collection and positive control to the assay plates through Pintool. Incubate for 17hrs at 37C / 99 percent
Humidity / 5 percent C02. Add 1 uL of CCF4 (FRET Substrate) dye using a single tip plate dispenser (BioRAPTR).
Incubate at room temperature for lhrs in the dark. Read the fluorescence intensity through Envision plate
reader using Beta-Lactamase protocol optimized for this cell type. Add 3 uL of Cell Titer Glo and Incubate at
room temperature for 0.5 hrs in dark for both agonist and antagonist mode. Read on ViewLux protocol
optimized for this cell type for both agonist and antagonist mode PPARd-bla cells were dispensed at 3000
cells/6uL/well in 1536-well black wall/clear bottom plates using a Multidrop Combi (Thermo Fisher Scientific,
Waltham, MA) dispenser. After the assay plates were incubated at 37C and 5 percent C02 for 5 h, 23 nL of
compounds dissolved in DMSO, positive controls or DMSO only was transferred to the assay plate by a Pintool
station (Kalypsys, San Diego, CA). The assay plates were incubated at 37C for 17 h. After 1 uL of LiveBLAzer B/G

-------
FRET substrate was added using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter,
Indianapolis, IN, USA), the plates were incubated at room temperature for 2 h, and fluorescence intensity was
measured by an Envision plate reader (PerkinElmer, Shelton, CT). For cell viability readout that measures
cytotoxicity, 3 uL/well of CellTiter-Glo reagent was added into the assay plates using a BioRAPTR FRD. After 30
min incubation at room temperature, the luminescence intensity in the plates was measured using a ViewLux
plate reader (PerkinElmer).

Baseline median absolute deviation for the assay (bmad): 5.655
Response cutoff threshold used to determine hit calls: 33.933
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-delta antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

MK886

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

NA

-------
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso)) x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
MK886 was used as a positive PPARd antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root

-------
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.144

Neutral control median absolute deviation, by plate: nmad	8.914

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-5840.53%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	27.28

Positive control well median absolute deviation, by plate: pmad	12.082

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	1.813

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

-------
(1 - ((3 * (mmad + nmad)) / abs(mmed - rimed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 710.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1197

TOX21_PPARd_BLA_Antagonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-delta (PPARd) Antagonism Beta-
lactamase Assay, Channel 2 Readout of Cleaved Substrate

1.2	Assay Summary: TOX21_PPARd_BLA_Antagonist is a cell-based, single-readout assay that uses HEK293T, a
human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See
tox21-ppard-bla-antagonist-pl. TOX21_PPARd_BLA_Antagonist_ch2 is an assay readout measuring reporter
gene via receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4
b-lactamase reporter gene. The signal is derived from the cleaved reporter gene substrate and is used to
calculate the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data
from the assay component TOX21_PPARd_BLA_Antagonist_ch2 was analyzed into 1 assay endpoint. This assay
endpoint, TOX21_PPARd_BLA_Antagonist_ch2, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity
can be used to understand changes in the reporter gene as they relate to the gene PPARD. Furthermore, this
assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARD_BLA_Antagonist_ch2 was designed to measure cleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to PPARD gene(s) using a
positive control of MK886

The Tox21 peroxisome proliferator-activated receptor delta antagonism beta-lactamase assay screened a library
of diverse environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PPAR-
delta-dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-

-------
hybrid GAL4 system. Each well contained 0.3 uM L-165,041 to stimulate receptor activity and MK886 (a
leukotriene inhibitor) served as a positive control. The assay is run in triplicate on 1536-well microplates.
HEK293T cells are plated the day of the assay and following 17-hour incubation of cells with test compounds a
membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for an additional hour. Once
in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression
is quantified by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence.
Fluorescence signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is
also incubated with test system for 30 minutes before readout to detect cell viability. Compound auto-
fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for
background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer PPAR-delta 293T DA (Division-arrested)
cells and PPAR delta-UAS-bla 293T cells contain a peroxisome proliferator-activated receptor delta (PPARd)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293T
cell line. CellSensor UAS-bla 293T contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293T cells. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPAR delta (LBD) fusion protein, the protein binds to the UAS, resulting in expression of -beta-lactamase.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Dohr et al. 1995). The transformation incorporated approximately
4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30 ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 2.5-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate for 5hrs at 37C / 99 percent Humidity / 5 percent C02. Transfer 23nL of compounds from the library
collection and positive control to the assay plates through Pintool. Incubate for 17hrs at 37C / 99 percent
Humidity / 5 percent C02. Add 1 uL of CCF4 (FRET Substrate) dye using a single tip plate dispenser (BioRAPTR).
Incubate at room temperature for lhrs in the dark. Read the fluorescence intensity through Envision plate
reader using Beta-Lactamase protocol optimized for this cell type. Add 3 uL of Cell Titer Glo and Incubate at
room temperature for 0.5 hrs in dark for both agonist and antagonist mode. Read on ViewLux protocol
optimized for this cell type for both agonist and antagonist mode PPARd-bla cells were dispensed at 3000
cells/6uL/well in 1536-well black wall/clear bottom plates using a Multidrop Combi (Thermo Fisher Scientific,
Waltham, MA) dispenser. After the assay plates were incubated at 37C and 5 percent C02 for 5 h, 23 nL of
compounds dissolved in DMSO, positive controls or DMSO only was transferred to the assay plate by a Pintool
station (Kalypsys, San Diego, CA). The assay plates were incubated at 37C for 17 h. After 1 uL of LiveBLAzer B/G

-------
FRET substrate was added using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter,
Indianapolis, IN, USA), the plates were incubated at room temperature for 2 h, and fluorescence intensity was
measured by an Envision plate reader (PerkinElmer, Shelton, CT). For cell viability readout that measures
cytotoxicity, 3 uL/well of CellTiter-Glo reagent was added into the assay plates using a BioRAPTR FRD. After 30
min incubation at room temperature, the luminescence intensity in the plates was measured using a ViewLux
plate reader (PerkinElmer).

Baseline median absolute deviation for the assay (bmad): 9.798

Response cutoff threshold used to determine hit calls: 58.79

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-delta antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

MK886

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

NA

-------
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
MK886 was used as a positive PPARd antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root

-------
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.405

Neutral control median absolute deviation, by plate: nmad	21.812

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-5266.03%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100.196

Positive control well median absolute deviation, by plate: pmad	4.692

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-4.463

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

-------
(1 - ((3 * (mmad + nmad)) / abs(mmed - rimed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 869.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1198

T0X2 l_PPARg_BLA_Antagonist_ch 1

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-gamma (PPARg) Antagonism Beta-
lactamase Assay, Channel 1 Readout of Uncleaved Substrate

1.2	Assay Summary: TOX21 PPARg BLA Antagonist is a cell-based, single-readout assay that uses HEK293, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
pparg-bla-antagonist-pl. TOX21_PPARg_BLA_Antagonist_chl is an assay readout measuring reporter gene via
receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-
lactamase reporter gene. The signal is derived from the uncleaved reporter gene substrate and is used to
calculate the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data
from the assay component TOX21_PPARg_BLA_Antagonist_chl was analyzed into 1 assay endpoint. This assay
endpoint, TOX21_PPARg_BLA_Antagonist_chl, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter,increased activity can
be used to understand changes in the reporter gene as they relate to the gene PPARG. Furthermore, this assay
endpoint can be referred to as a secondary readout, because this assay has produced multiple assay endpoints
where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARg_BLA_Antagonist_chl was designed to measure uncleaved reporter gene substrate
to target nuclear receptor activity at the protein (receptor) level, specifically mapping to PPARG gene(s) using
a positive control of GW9662

The Tox21 peroxisome proliferator-activated receptor gamma antagonism beta-lactamase assay screened a
library of diverse environmental compounds to probe for xenobiotic ligand-binding and potential to suppress
PPAR-gamma-dependent transcription, monitored through bla reporter gene signal activation using a

-------
mammalian one-hybrid GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are
plated the day of the assay and following 17-hour incubation of cells with test compounds a membrane-
permeable FRET-based substrate CCF4-AM is introduced and incubated for an additional hour. Once in the cell,
cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified
by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals
are monitored using an Envision plate reader. Following CCF4 incubation and detection, 3uL of CellTiter-Glo
reagent is added to each well and incubated for 30 minutes before cytotoxicity readout is measured on a
ViewLux microtiter plate reader. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293 cell line used. GeneBLAzer PPAR-gamma 293H DA (Division-arrested)
cells and PPARgamma-UAS-bla 293H cells contain a peroxisome proliferator-activated receptor gamma (PPARg)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293H
cell line. CellSensor UAS-bla 293H contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293H cells. These cells stably express a beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPARg (LBD) fusion protein, the protein binds to the UAS, resulting in expression of beta-lactamase. The
HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 1. The transformation incorporated approximately 4.5 kilobases
from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid 2. HEK293 cells are popular for their ease of
growth and transfection cells and are frequently used to produce exogenous proteins or viruses for
pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 3-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL for agonist mode and 3000cells/well/5 ul
for antagonist mode through 8 tip Multidrop plate dispenser. Incubate for 5 hrs at 37C / 99% Humidity / 5%
C02. Transfer 23nL of compounds from the library collection and positive control to the assay plates through
pintool. Add 1 uL of buffer and luL of Agonist concentration to respective columns as per plate map for
antagonist mode. Incubate for 17 hrs at 37C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET Substrate) dye
using a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 1 hrs in dark. Read the
fluorescence intensity through Envision plate reader using Beta-Lactamase protocol optimized for this cell type.
Add 3 uL of Cell Titer Glo and Incubate at room temperature for 0.5 hrs in dark for antagonist mode. Read on
ViewLux protocol optimized for this cell type for antagonist mode.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Target (nominal) number of replicates:
3

-------
Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

GW9662

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 4.507
Response cutoff threshold used to determine hit calls: 27.044
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-gamma antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
GW9662 was used as a positive PPARg antagonist control.

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest

-------
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.099

Neutral control median absolute deviation, by plate: nmad	6.23

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-6153.67%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	45.212

Positive control well median absolute deviation, by plate: pmad	8.619

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	4.18

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 899.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1199

TOX21_PPARg_BLA_Antagonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293 Peroxisome Proliferator-activated Receptor-gamma (PPARg) Antagonism Beta-
lactamase Assay, Channel 2 Readout of Cleaved Substrate

1.2	Assay Summary: TOX21 PPARg BLA Antagonist is a cell-based, single-readout assay that uses HEK293, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-
pparg-bla-antagonist-pl. TOX21_PPARg_BLA_Antagonist_ch2 is an assay readout measuring reporter gene via
receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-
lactamase reporter gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate
the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the
assay component TOX21_PPARg_BLA_Antagonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PPARg_BLA_Antagonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as
the negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be
used to understand changes in the reporter gene as they relate to the gene PPARG. Furthermore, this assay
endpoint can be referred to as a secondary readout, because this assay has produced multiple assay endpoints
where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 17-hours prior to monitoring fluorescence emission resulting
from PPAR-delta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PPARg_BLA_Antagonist_ch2 was designed to measure cleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to PPARG gene(s) using a
positive control of GW9662

The Tox21 peroxisome proliferator-activated receptor gamma antagonism beta-lactamase assay screened a
library of diverse environmental compounds to probe for xenobiotic ligand-binding and potential to suppress
PPAR-gamma-dependent transcription, monitored through bla reporter gene signal activation using a

-------
mammalian one-hybrid GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are
plated the day of the assay and following 17-hour incubation of cells with test compounds a membrane-
permeable FRET-based substrate CCF4-AM is introduced and incubated for an additional hour. Once in the cell,
cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified
by measuring the ratio of blue (460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals
are monitored using an Envision plate reader. Following CCF4 incubation and detection, 3uL of CellTiter-Glo
reagent is added to each well and incubated for 30 minutes before cytotoxicity readout is measured on a
ViewLux microtiter plate reader. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293 cell line used. GeneBLAzer PPAR-gamma 293H DA (Division-arrested)
cells and PPARgamma-UAS-bla 293H cells contain a peroxisome proliferator-activated receptor gamma (PPARg)
ligand-binding domain/Gal4 DNA-binding domain chimera, stably integrated into the CellSensor UAS-bla 293H
cell line. CellSensor UAS-bla 293H contains a beta-lactamase reporter gene under control of a UAS response
element stably integrated into 293H cells. These cells stably express a beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4
(DBD)-PPARg (LBD) fusion protein, the protein binds to the UAS, resulting in expression of beta-lactamase. The
HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 1. The transformation incorporated approximately 4.5 kilobases
from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid 2. HEK293 cells are popular for their ease of
growth and transfection cells and are frequently used to produce exogenous proteins or viruses for
pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: The cells should not be grown more than 80-85% confluence.
The cell performance is affected is they are more confluent. Do not leave cells in Trypsin for more than 5 min at
RT. Handle 1536 well plate black clear bottom plates carefully by sides. Cell Thawing Method: Place 14 mL of
pre-warmed thaw medium into a 15 ml conical tube. Remove the vial of cells to be thawed from liquid nitrogen
and thaw rapidly by placing at 37C in a water bath with gentle agitation for 1-2 minutes. Do not submerge vial
in water. Mix the entire content of the vial to 14 ml of pre-warmed medium and centrifuge to remove DMSO.
Discard the supernatant and transfer the precipitated cells to T175 flask using 30ml thawing medium. Cell
Proliferation Method: Detach the cells from the flask using TrypLExpress. The cells are re-seeded in T-175 flask
at 3-4 million. Assay Protocol: Spin down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend
the pellet with assay medium followed by filtering through cell strainer and adjust the required cell density.
Plate the cells in black-clear bottom 1536 well plate at 3000/well/6uL for agonist mode and 3000cells/well/5 ul
for antagonist mode through 8 tip Multidrop plate dispenser. Incubate for 5 hrs at 37C / 99% Humidity / 5%
C02. Transfer 23nL of compounds from the library collection and positive control to the assay plates through
pintool. Add 1 uL of buffer and luL of Agonist concentration to respective columns as per plate map for
antagonist mode. Incubate for 17 hrs at 37C / 99% Humidity / 5% C02. Add luL of CCF4 (FRET Substrate) dye
using a single tip plate dispenser (BioRAPTR). Incubate at room temperature for 1 hrs in dark. Read the
fluorescence intensity through Envision plate reader using Beta-Lactamase protocol optimized for this cell type.
Add 3 uL of Cell Titer Glo and Incubate at room temperature for 0.5 hrs in dark for antagonist mode. Read on
ViewLux protocol optimized for this cell type for antagonist mode.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Target (nominal) number of replicates:
3

-------
Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

GW9662

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 15.297
Response cutoff threshold used to determine hit calls: 91.784
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: PPAR-gamma antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso)) x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
GW9662 was used as a positive PPARg antagonist control.

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest

-------
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.42

Neutral control median absolute deviation, by plate: nmad	23.226

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-5814.33%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100.508

Positive control well median absolute deviation, by plate: pmad	12.657

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-3.763

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 673.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1202

TOX21_AR_B LA_Antagon ist_ch 1

1.	General Information

1.1	Assay Title: Tox21 HEK293T Androgen Receptor (AR) Antagonism Beta-lactamase Assay, Channel 1 Readout of
Uncleaved Substrate

1.2	Assay Summary: TOX21 AR BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-ar-
bla-antagonist-pl. TOX21_AR_BLA_Antagonist_chl is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the uncleaved reporter gene substrate and is used to calculate the
ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_AR_BLA_Antagonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_AR_BLA_Antagonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter,increased activity can be used to
understand changes in the reporter gene as they relate to the gene AR. Furthermore, this assay endpoint can
be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic AR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_AR_BLA_Antagonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to AR gene(s) using a positive
control of Cyproterone acetate

The Tox21 androgen receptor antagonism beta-lactamase assay screened a library of diverse environmental
compounds to probe for xenobiotic androgen receptor ligand-binding and potential to suppress androgen-
dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid

-------
GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the
assay and following 16 hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Each compound was tested in a concentration-response format, using 15 concentrations
ranging from 1.1 nM to 92 uM. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection. Concentration-response
relationships were determined by monitoring FRET signals relative to DMSO-only exposures which provided a
signal baseline, and to a known androgen receptor antagonist (Cyproterone acetate) as a positive control which
provided a reference for 100 percent androgen receptor inhibition, as assessed in the presence of 0.01 uM
R1881, a known AR agonist.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer AR-UAS-bla GripTite 293 cells contain the
ligand-binding domain (LBD) of the rat androgen receptor (AR) fused to the DNA-binding domain of GAL4 stably
integrated in the GeneBLAzer UAS-bla GripTite 293 cell line. This portion of the rat AR is identical to the human
AR in the conserved LBD and differs from the human sequence at five residues in the hinge region. These cells
stably express a -beta-lactamase reporter gene under the transcriptional control of an upstream activator
sequence (UAS). When an agonist binds to the LBD of the GAL4 (DBD)-AR (LBD) fusion protein, the protein binds
to the UAS, resulting in expression of-beta-lactamase. The HEK-293 cell line is a human embryonic kidney cell
line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973. The
transformation incorporated approximately 4.5 kilobases from the viral genome into human chromosome 19 of
the HEK cells, and subsequent cytogenetic characterization established that the 293 line is pseudotriploid.
HEK293 cells are popular for their ease of growth and transfection cells and are frequently used to produce
exogenous proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cell culture is maintained by passaging twice a week and should
not reach more than 90% confluence. The assay should be performed in black-clear bottom 1536 well plates, so
the bottom of the plates should not be touched. Cell Media Required: Growth (90% DMEM -- high glucose, 10%
Dialyzed FBS, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin, 80ug/ml Hygromycin, 80ug/ml Zeocin), Assay (90% Opti-MEM, 10% Dialyzed FB, O.lmN NEAA, 1
mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and Streptomycin), Thaw (90% DMEM - high glucose,
10% Dialyzed FB, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin), Freezing (100% Recovery Cell Culture Freezing Medium) Thawing method: 1ml frozen cells of
AR-bla were taken in pre-warmed 10ml of thaw medium for centrifuging. Thaw medium is used to re-suspend
the pellet. Seed the cells at 2 million per T-75 flask with thaw medium. Cell Proliferation Method: Detach the
cells from the flask using 0.05% Trypsin. The cells are re-seeded in T-225 flask at 3-4 million Assay Protocol: Spin
down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate
the cells in black-clear bottom 1536 well plate at 2000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate at 37C for 5hrs. Transfer 23nL of compounds from the library collection and positive control to the
assay plates through Pintool. Incubate at 37C for 16hrs. Add luL of CCF4 (FRET Substrate) dye using a single tip
plate dispenser (BioRAPTR). Incubate at room temperature for 2hrs. Read the fluorescence intensity through
Envision plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Target (nominal) number of replicates:
3

-------
Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Cyproterone acetate

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 3.676
Response cutoff threshold used to determine hit calls: 22.059
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Androgen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Cyproterone acetate was used as a positive AR antagonist control.

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest

-------
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.186

Neutral control median absolute deviation, by plate: nmad	7.393

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-4007.66%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	24.075

Positive control well median absolute deviation, by plate: pmad	4.797

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	2.604

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 747.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Kleinstreuer NC, Ceger P, Watt ED, Martin
M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ, Allen D, Sakamuru S, Xia M, Huang R, Judson R.
Development and Validation of a Computational Model for Androgen Receptor Activity. Chem Res Toxicol. 2017
Apr 17;30(4):946-964. doi: 10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID:
PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
-------
Assay Endpoint ID: 1203

TOX21_AR_BLA_Antagonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293T Androgen Receptor (AR) Antagonism Beta-lactamase Assay, Channel 2 Readout of
Cleaved Substrate

1.2	Assay Summary: TOX21 AR BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-ar-
bla-antagonist-pl. TOX21_AR_BLA_Antagonist_ch2 is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio
of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_AR_BLA_Antagonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_AR_BLA_Antagonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene AR. Furthermore, this assay endpoint can
be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic AR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_AR_BLA_Antagonist_ch2 was designed to measure cleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to AR gene(s) using a positive
control of Cyproterone acetate

The Tox21 androgen receptor antagonism beta-lactamase assay screened a library of diverse environmental
compounds to probe for xenobiotic androgen receptor ligand-binding and potential to suppress androgen-
dependent transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid

-------
GAL4 system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the
assay and following 16 hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Each compound was tested in a concentration-response format, using 15 concentrations
ranging from 1.1 nM to 92 uM. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection. Concentration-response
relationships were determined by monitoring FRET signals relative to DMSO-only exposures which provided a
signal baseline, and to a known androgen receptor antagonist (Cyproterone acetate) as a positive control which
provided a reference for 100 percent androgen receptor inhibition, as assessed in the presence of 0.01 uM
R1881, a known AR agonist.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer AR-UAS-bla GripTite 293 cells contain the
ligand-binding domain (LBD) of the rat androgen receptor (AR) fused to the DNA-binding domain of GAL4 stably
integrated in the GeneBLAzer UAS-bla GripTite 293 cell line. This portion of the rat AR is identical to the human
AR in the conserved LBD and differs from the human sequence at five residues in the hinge region. These cells
stably express a -beta-lactamase reporter gene under the transcriptional control of an upstream activator
sequence (UAS). When an agonist binds to the LBD of the GAL4 (DBD)-AR (LBD) fusion protein, the protein binds
to the UAS, resulting in expression of-beta-lactamase. The HEK-293 cell line is a human embryonic kidney cell
line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973. The
transformation incorporated approximately 4.5 kilobases from the viral genome into human chromosome 19 of
the HEK cells, and subsequent cytogenetic characterization established that the 293 line is pseudotriploid.
HEK293 cells are popular for their ease of growth and transfection cells and are frequently used to produce
exogenous proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cell culture is maintained by passaging twice a week and should
not reach more than 90% confluence. The assay should be performed in black-clear bottom 1536 well plates, so
the bottom of the plates should not be touched. Cell Media Required: Growth (90% DMEM - high glucose, 10%
Dialyzed FBS, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin, 80ug/ml Hygromycin, 80ug/ml Zeocin), Assay (90% Opti-MEM, 10% Dialyzed FB, O.lmN NEAA, 1
mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and Streptomycin), Thaw (90% DMEM - high glucose,
10% Dialyzed FB, 25mM HEPES, O.lmN NEAA, 1 mM Sodium pyruvate, lOOU/ml and lOOug/ml Penicillin and
Streptomycin), Freezing (100% Recovery Cell Culture Freezing Medium) Thawing method: 1ml frozen cells of
AR-bla were taken in pre-warmed 10ml of thaw medium for centrifuging. Thaw medium is used to re-suspend
the pellet. Seed the cells at 2 million per T-75 flask with thaw medium. Cell Proliferation Method: Detach the
cells from the flask using 0.05% Trypsin. The cells are re-seeded in T-225 flask at 3-4 million Assay Protocol: Spin
down the cells after rinsing the cells with DPBS and trypsinizing. Resuspend the pellet with assay medium. Plate
the cells in black-clear bottom 1536 well plate at 2000/well/6uL through 8 tip Multidrop plate dispenser.
Incubate at 37C for 5hrs. Transfer 23nL of compounds from the library collection and positive control to the
assay plates through Pintool. Incubate at 37C for 16hrs. Add luL of CCF4 (FRET Substrate) dye using a single tip
plate dispenser (BioRAPTR). Incubate at room temperature for 2hrs. Read the fluorescence intensity through
Envision plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Target (nominal) number of replicates:
3

-------
Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

Cyproterone acetate

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 8.8

Response cutoff threshold used to determine hit calls: 52.801

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Androgen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Cyproterone acetate was used as a positive AR antagonist control.

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest

-------
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 10496	Number of chemicals tested: 8305

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.454

Neutral control median absolute deviation, by plate: nmad	16.945

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-3779.72%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100.45

Positive control well median absolute deviation, by plate: pmad	3.961

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-5.885

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 932.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Huang R, Xia M, Cho MH, Sakamuru S, Shinn P, Houck KA, Dix DJ, Judson RS, Witt KL, Kavlock RJ,
Tice RR, Austin CP. Chemical genomics profiling of environmental chemical modulation of human nuclear
receptors. Environ Health Perspect. 2011 Aug;119(8):1142-8. doi: 10.1289/ehp. 1002952. Epub 2011 May 4.
PubMed PMID: 21543282; PubMed Central PMCID: PMC3237348., Kleinstreuer NC, Ceger P, Watt ED, Martin
M, Houck K, Browne P, Thomas RS, Casey WM, Dix DJ, Allen D, Sakamuru S, Xia M, Huang R, Judson R.
Development and Validation of a Computational Model for Androgen Receptor Activity. Chem Res Toxicol. 2017
Apr 17;30(4):946-964. doi: 10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID:
PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
-------
Assay Endpoint ID: 1340

TOX21_ESRE_BLA_Agonist_chl

1.	General Information

1.1	Assay Title: Tox21 HeLa Endoplasmic Reticulum Stress Response Element (ERSE) Agonism Beta-lactamase Assay,
Channel 1 Readout of Uncleaved Substrate

1.2	Assay Summary: TOX21_ESRE_BLA_Agonist is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-esre-bla-
pl. TOX21_ESRE_BLA_Agonist_chl is an assay readout measuring transcription factor activity and designed
using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene. The signal
is derived from the uncleaved reporter gene substrate and is used to calculate the ratio of cleaved (ch2) to
uncleaved (chl) reporter gene substrate used as the measure of target activity. Data from the assay component
TOX21_ESRE_BLA_Agonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ESRE_BLA_Agonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene ATF6. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HeLa cells are aliquoted into 1536-well microtiter plates
at a concentration of 1500 cells/6 uL and incubated with test compounds for 18 hours prior to monitoring
fluorescence emission resulting from xenobiotic endoplasmic reticulum stress response expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ESRE_BLA_Agonist_chl was designed to measure uncleaved reporter gene substrate to
target transcription factor activity, specifically mapping to ATF6 gene(s) using a positive control of 17-AAG

The Tox21 endoplasmic reticulum stress response element (ESRE) agonism beta-lactamase assay screened a
library of diverse environmental compounds to probe for xenobiotic potential to induce an endoplasmic
reticulum stress response, monitored through bla reporter gene signal activation using a mammalian one-hybrid
GAL4 system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with

-------
test compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2
hours. Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HeLa cell line used. ESRE-bla HeLa cells are engineered to express beta-
lactamase under the control of ER stress response element. These cells stably express a -beta-lactamase
reporter gene under the transcriptional control of an upstream activator sequence (UAS).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cell culture is maintained by passaging twice a week and should
not reach more than 90% confluence. The assay should be performed in black-clear bottom 1536 well plates, so
the bottom of the plates should not be touched. Cell Thawing Method: 1ml frozen cells of ESRE bla HeLa were
taken in pre-warmed 10ml of thaw medium for centrifuging. 2-3ml of the thaw medium is taken to resuspend
the pellet. The cells were seeded in T-75 flask at 2 million cells. Cell Proliferation Method: Rinse the cells with
DPBS and detach them by using 0.05% Trypsin and centrifuge. The cells are further passaged at a density of 3
million cells per T-225 flask. Assay Protocol: Trypsinize cells from the culturing flask and centrifuge and then
resuspend cells in assay medium at a density of 0.25 X 10A6 cells/mL. Plate the cells in black-clear bottom 1536
well plate at 1500/well/6uL of assay medium through 8 tip of a plate dispenser (Multi drop). Incubate at 37C for
an overnight (18hrs). Transfer 23nL of compounds from the library collection and positive control through
Pintool. Incubate at 37C for 5hrs. Add luL of CCF4 dye using a single tip of a plate dispenser (BioRAPTR). Incubate
at room temperature for 2hrs. Read the fluorescence intensity through Envision plate reader. Add 4uL of
CellTiter-Glo reagent using a single tip of a plate dispenser (BioRAPTR). Incubate at room temperature for 30
min. Read the luminescence through ViewLux plate reader.

Baseline median absolute deviation for the assay (bmad): 3.432
Response cutoff threshold used to determine hit calls: 20.589
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6 Response: Endoplasmic reticulum stress response element (ESRE) agonism is monitored by FRET emission
resulting from GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was
measured in parallel by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation
in the same wells.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

17-AAG

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
17-AAG was used as a positive ER agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
369

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-1.948

Neutral control median absolute deviation, by plate: nmad	6.001

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-126.97%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-19.337

Positive control well median absolute deviation, by plate: pmad	4.752

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.229

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 427.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1341

TOX21_ESRE_BLA_Agonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HeLa Endoplasmic Reticulum Stress Response Element (ERSE) Agonism Beta-lactamase Assay,
Channel 2 Readout of Cleaved Substrate

1.2	Assay Summary: TOX21_ESRE_BLA_Agonist is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-esre-bla-
pl. TOX21_ESRE_BLA_Agonist_ch2 is an assay readout measuring reporter gene via receptor activity and
designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene.
The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio of cleaved (ch2)
to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_ESRE_BLA_Agonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ESRE_BLA_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene ATF6. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HeLa cells are aliquoted into 1536-well microtiter plates
at a concentration of 1500 cells/6 uL and incubated with test compounds for 18 hours prior to monitoring
fluorescence emission resulting from xenobiotic endoplasmic reticulum stress response expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ESRE_BLA_Agonist_ch2 was designed to measure cleaved reporter gene substrate to
target transcription factor activity, specifically mapping to ATF6 gene(s) using a positive control of 17-AAG

The Tox21 endoplasmic reticulum stress response element (ESRE) agonism beta-lactamase assay screened a
library of diverse environmental compounds to probe for xenobiotic potential to induce an endoplasmic
reticulum stress response, monitored through bla reporter gene signal activation using a mammalian one-hybrid
GAL4 system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with

-------
test compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2
hours. Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HeLa cell line used. ESRE-bla HeLa cells are engineered to express beta-
lactamase under the control of ER stress response element. These cells stably express a -beta-lactamase
reporter gene under the transcriptional control of an upstream activator sequence (UAS).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cell culture is maintained by passaging twice a week and should
not reach more than 90% confluence. The assay should be performed in black-clear bottom 1536 well plates, so
the bottom of the plates should not be touched. Cell Thawing Method: 1ml frozen cells of ESRE bla HeLa were
taken in pre-warmed 10ml of thaw medium for centrifuging. 2-3ml of the thaw medium is taken to resuspend
the pellet. The cells were seeded in T-75 flask at 2 million cells. Cell Proliferation Method: Rinse the cells with
DPBS and detach them by using 0.05% Trypsin and centrifuge. The cells are further passaged at a density of 3
million cells per T-225 flask. Assay Protocol: Trypsinize cells from the culturing flask and centrifuge and then
resuspend cells in assay medium at a density of 0.25 X 10A6 cells/mL. Plate the cells in black-clear bottom 1536
well plate at 1500/well/6uL of assay medium through 8 tip of a plate dispenser (Multi drop). Incubate at 37C for
an overnight (18hrs). Transfer 23nL of compounds from the library collection and positive control through
Pintool. Incubate at 37C for 5hrs. Add luL of CCF4 dye using a single tip of a plate dispenser (BioRAPTR). Incubate
at room temperature for 2hrs. Read the fluorescence intensity through Envision plate reader. Add 4uL of
CellTiter-Glo reagent using a single tip of a plate dispenser (BioRAPTR). Incubate at room temperature for 30
min. Read the luminescence through ViewLux plate reader.

Baseline median absolute deviation for the assay (bmad): 1.787

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6 Response: Endoplasmic reticulum stress response element (ESRE) agonism is monitored by FRET emission
resulting from GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was
measured in parallel by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation
in the same wells.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

17-AAG

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
17-AAG was used as a positive ER agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
104

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.161

Neutral control median absolute deviation, by plate: nmad	3.088

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	86.99%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	102.852

Positive control well median absolute deviation, by plate: pmad	10.856

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	8.99

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 492.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1342

TOX21_ESRE_BLA_Agonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HeLa Endoplasmic Reticulum Stress Response Element (ERSE) Agonism Beta-lactamase Assay,
Ratio

1.2	Assay Summary: TOX21_ESRE_BLA_Agonist is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-esre-bla-
pl. TOX21_ESRE_BLA_Agonist_ratio is an assay readout measuring reporter gene via receptor activity and
designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene.
The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate used as the
measure of target activity. Data from the assay component TOX21_ESRE_BLA_Agonist_ratio was analyzed into
1 assay endpoint. This assay endpoint, TOX21_ESRE_BLA_Agonist_ratio, was analyzed in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, measures of receptor for gain-of-signal activity can be used to understand the reporter gene at the
pathway-level as they relate to the gene ATF6. Furthermore, this assay endpoint can be referred to as a primary
readout, because this assay has produced multiple assay endpoints where this ratio serves a reporter gene
function to understand target activity. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the DNA binding intended target family, where the subfamily is basic leucine zipper.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HeLa cells are aliquoted into 1536-well microtiter plates
at a concentration of 1500 cells/6 uL and incubated with test compounds for 18 hours prior to monitoring
fluorescence emission resulting from xenobiotic endoplasmic reticulum stress response expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ESRE_BLA_Agonist_ratio was designed to target transcription factor activity, specifically
mapping to ATF6 gene(s) using a positive control of 17-AAG

The Tox21 endoplasmic reticulum stress response element (ESRE) agonism beta-lactamase assay screened a
library of diverse environmental compounds to probe for xenobiotic potential to induce an endoplasmic
reticulum stress response, monitored through bla reporter gene signal activation using a mammalian one-hybrid

-------
GAL4 system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with
test compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2
hours. Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Endoplasmic Reticulum (ER) plays an important role in cellular
signaling, protein folding/maturation, stress response, lipid biosynthesis, calcium storage and release etc. Under
stress conditions, proteins in ER misfold or unfold, and leading to the expression of ER stress responsive genes.
ER stress has been associated with a variety of pathophysiological conditions such as cancer, inflammation,
neurodegenerative diseases, diabetes, ischemic heart disease etc. To identify the compounds that induce ER
stress response, an ER stress reporter cell line, CellSensor ESRE-bla HeLa (Life Technologies, Carlsbad, CA)
containing a beta-lactamase reporter gene under control of the Endoplasmic reticulum Stress Response Element
(ESRE) was used to screen Tox2110K compound library.

2.3	Experimental System: adherent HeLa cell line used. ESRE-bla HeLa cells are engineered to express beta-
lactamase under the control of ER stress response element. These cells stably express a -beta-lactamase
reporter gene under the transcriptional control of an upstream activator sequence (UAS).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cell culture is maintained by passaging twice a week and should
not reach more than 90% confluence. The assay should be performed in black-clear bottom 1536 well plates, so
the bottom of the plates should not be touched. Cell Thawing Method: 1ml frozen cells of ESRE bla HeLa were
taken in pre-warmed 10ml of thaw medium for centrifuging. 2-3ml of the thaw medium is taken to resuspend
the pellet. The cells were seeded in T-75 flask at 2 million cells. Cell Proliferation Method: Rinse the cells with
DPBS and detach them by using 0.05% Trypsin and centrifuge. The cells are further passaged at a density of 3
million cells per T-225 flask. Assay Protocol: Trypsinize cells from the culturing flask and centrifuge and then
resuspend cells in assay medium at a density of 0.25 X 10A6 cells/mL. Plate the cells in black-clear bottom 1536
well plate at 1500/well/6uL of assay medium through 8 tip of a plate dispenser (Multi drop). Incubate at 37C for
an overnight (18hrs). Transfer 23nL of compounds from the library collection and positive control through
Pintool. Incubate at 37C for 5hrs. Add luL of CCF4 dye using a single tip of a plate dispenser (BioRAPTR). Incubate
at room temperature for 2hrs. Read the fluorescence intensity through Envision plate reader. Add 4uL of
CellTiter-Glo reagent using a single tip of a plate dispenser (BioRAPTR). Incubate at room temperature for 30
min. Read the luminescence through ViewLux plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:
0.00979371647509579 nM

Target (nominal) number of replicates:

3

Standard maximum concentration tested:
765.134099616858 nM

-------
Key positive control:	Neutral vehicle control:

17-AAG	DMSO

Baseline median absolute deviation for the assay (bmad): 1.438
Response cutoff threshold used to determine hit calls: 20
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6 Response: Endoplasmic reticulum stress response element (ESRE) agonism is monitored by FRET emission
resulting from GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was
measured in parallel by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation
in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
17-AAG was used as a positive ER agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series

-------
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	1.058

Neutral control median absolute deviation, by plate: nmad	2.279

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	123.17%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	98.065

Positive control well median absolute deviation, by plate: pmad	6.039

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	14.89

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 496.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1343

T0X21_ES R E_B LA_Ago n ist_vi a bi I ity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HeLa Endoplasmic Reticulum Stress Response Element (ERSE)
Agonism Beta-lactamase Assay

1.2	Assay Summary: TOX21_ESRE_BLA_Agonist is a cell-based, single-readout assay that uses HeLa, a human cervix
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-esre-bla-
pl. TOX21_ESRE_BLA_Agonist_viability is an assay readout measuring cellular ATP content and detected with
CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_ESRE_BLA_Agonist_viability used a type of viability
reporter where loss-of-signal activity can be used to understand changes in the cell viability. Furthermore, this
assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves a viability function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HeLa cells are aliquoted into 1536-well microtiter plates
at a concentration of 1500 cells/6 uL and incubated with test compounds for 18 hours prior to monitoring
fluorescence emission resulting from xenobiotic endoplasmic reticulum stress response expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 endoplasmic reticulum stress response element (ESRE) agonism beta-lactamase assay screened a
library of diverse environmental compounds to probe for xenobiotic potential to induce an endoplasmic
reticulum stress response, monitored through bla reporter gene signal activation using a mammalian one-hybrid
GAL4 system. The assay is run in triplicate on 1536-well microplates. Following 18 hour incubation of cells with
test compounds, a membrane-permeable FRET-based substrate CCF4-AM is introduced and incubated for 2
hours. Once in the cell, cytoplasmic esterases trap the negatively charged CCF4 substrate in the cytosol and bla
expression is quantified by measuring the ratio of blue (product) to green (substrate) fluorescence. Fluorescence

-------
signals are monitored using an Envision plate reader and CellTiter-Glo assay reagent (Promega) is also incubated
with test system for 30 minutes before readout to detect cell viability. Compound auto-fluorescence was
monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact
detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Endoplasmic Reticulum (ER) plays an important role in cellular
signaling, protein folding/maturation, stress response, lipid biosynthesis, calcium storage and release etc. Under
stress conditions, proteins in ER misfold or unfold, and leading to the expression of ER stress responsive genes.
ER stress has been associated with a variety of pathophysiological conditions such as cancer, inflammation,
neurodegenerative diseases, diabetes, ischemic heart disease etc. To identify the compounds that induce ER
stress response, an ER stress reporter cell line, CellSensor ESRE-bla HeLa (Life Technologies, Carlsbad, CA)
containing a beta-lactamase reporter gene under control of the Endoplasmic reticulum Stress Response Element
(ESRE) was used to screen Tox2110K compound library.

2.3	Experimental System: adherent HeLa cell line used. ESRE-bla HeLa cells are engineered to express beta-
lactamase under the control of ER stress response element. These cells stably express a -beta-lactamase
reporter gene under the transcriptional control of an upstream activator sequence (UAS).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Cell culture is maintained by passaging twice a week and should
not reach more than 90% confluence. The assay should be performed in black-clear bottom 1536 well plates, so
the bottom of the plates should not be touched. Cell Thawing Method: 1ml frozen cells of ESRE bla HeLa were
taken in pre-warmed 10ml of thaw medium for centrifuging. 2-3ml of the thaw medium is taken to resuspend
the pellet. The cells were seeded in T-75 flask at 2 million cells. Cell Proliferation Method: Rinse the cells with
DPBS and detach them by using 0.05% Trypsin and centrifuge. The cells are further passaged at a density of 3
million cells per T-225 flask. Assay Protocol: Trypsinize cells from the culturing flask and centrifuge and then
resuspend cells in assay medium at a density of 0.25 X 10A6 cells/mL. Plate the cells in black-clear bottom 1536
well plate at 1500/well/6uL of assay medium through 8 tip of a plate dispenser (Multi drop). Incubate at 37C for
an overnight (18hrs). Transfer 23nL of compounds from the library collection and positive control through
Pintool. Incubate at 37C for 5hrs. Add luL of CCF4 dye using a single tip of a plate dispenser (BioRAPTR). Incubate
at room temperature for 2hrs. Read the fluorescence intensity through Envision plate reader. Add 4uL of
CellTiter-Glo reagent using a single tip of a plate dispenser (BioRAPTR). Incubate at room temperature for 30
min. Read the luminescence through ViewLux plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 7.423
Response cutoff threshold used to determine hit calls: 44.536

-------
Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Endoplasmic reticulum stress response element (ESRE) agonism is monitored by FRET emission
resulting from GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was
measured in parallel by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation
in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

-------
Active hit count: hitc>0.9
489

Inactive hit count: 0
-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-2.074

Neutral control median absolute deviation, by plate: nmad	10.243

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-104.81%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-10.122

Positive control well median absolute deviation, by plate: pmad	11.035

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.543

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 440.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1659

T0X21_RAR_LU C_Ago n ist

1.	General Information

1.1	Assay Title: Tox21 C3RL4 Retinoic Acid Response Element (RARE) Agonism Luciferase Assay

1.2	Assay Summary: TOX21 RAR LUC Agonist is a cell-based, single-readout assay that uses C3H10T1/2, a mouse
sarcoma cell line, with measurements taken at 6 hours after chemical dosing in a 1536-well plate. See tox21-
rar-agonist-pl. TOX21_RAR_LUC_Agonist is one of one assay component(s) measured or calculated from the
TOX21_RAR_LUC_Agonist assay. It is designed to make measurements of luciferase induction, a form of
inducible reporter, as detected with bioluminescence signals by CellTiter-Glo Luciferase-coupled ATP
quantitation technology Data from the assay component TOX21_RAR_LUC_Agonist was analyzed into 1 assay
endpoint. This assay endpoint, TOX21_RAR_LUC_Agonist, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene RARA.
Furthermore, this assay endpoint can be referred to as a primary readout, because the performed assay has
only produced 1 assay endpoint. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the nuclear receptor intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected C3RL4 cells were seeded at 1,000/well in 5 uL of assay
medium into white solid 1,536-well tissue culture plates (Greiner Bio-One North America Inc. Monroe, NC) using
a Multidrop Combi (Thermo Scientific, Waltham, MA) dispenser. After incubation at 37 C and 5% C02 for an
overnight, 23 nL of compounds dissolved in DMSO, positive control or DMSO only were transferred by a Pintool
station (Kalypsys, San Diego, CA). After 6 hr incubation at 37 C and 5% C02, 5 uL of Amplite Luciferase reagent
(AAT Bioquest Inc. Sunnyvale, CA) was added to each assay plate using a Flying Reagent Dispenser (FRD) (Aurora
Discovery, San Diego, CA). Luminescence was quantified on a ViewLux plate reader (PerkinElmer, Waltham, MA)
after 30 min incubation at room temperature.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_RAR_LUC_Agonist was designed to measure changes to bioluminescence signals produced
from an enzymatic reaction involving the key substrate [One-Glo], Changes are indicative of changes in

-------
transcriptional gene expression due to agonist activity regulated by the human retinoic acid receptor alpha
[GeneSymbokRARA]

The Tox21 retinoic acid response element (RARE) agonism luciferase assay screened a library of diverse
environmental compounds to probe for xenobiotic all-trans retinoic acid (atRA) ligand-binding and potential to
activate RAR mediated gene expression, monitored through luciferase reporter gene signal activation. C3RL4
cell line (OARSA/CFSAN/FDA, Laurel, MD) was used to screen Tox21 10K compound library for identifying
activators of the RSP. The C3RL4 clone contains a functional retinol (vitamin A) signaling pathway (RSP) and the
firefly luciferase gene (Luc) under the control of the RARE. To differentiate true RARE agonist from cytotoxic
substances, the assay is multiplexed with cell viability assay. The assay is run in triplicate on 1536-well microplate
and bioluminescence was measured following 6-hour incubation of cells with test compounds and 30 min
incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored
using a Promega ViewLux plate reader. Following the incubation period, the cell culture was screened for
bioluminescent signals in agonist mode using luciferase detection technology. Compound auto-fluorescence
was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background
artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. All-trans Retinoic acid (atRA) is the potent natural metabolite
of retinol and plays an important role in normal embryonic development and maintenance of cellular phenotype
in adult animals. atRA is the activating ligand for the retinoic acid receptors (RARs) which form heterodimers
with retinoid X receptors (RXRs) on the retinoic acid response element (RARE). Through activation of the
RAR/RXRs nuclear receptors, atRA regulates the transcription of a large number of protein-coding genes and
regulatory RNAs. Intracellular levels of atRA are controlled by the retinol signaling pathway (RSP) that regulates
the biosynthesis and catabolism of atRA to maintain physiological levels. Chemicals that interfere with the RSP
can cause abnormal intracellular levels of atRA and therefore are potential developmental toxicants. To assess
compounds for the potential adverse effect on embryonic development through interfering with retinol
signaling, a cell-based RARE luciferase reporter gene assay was used. C3RL4 cell line (OARSA/CFSAN/FDA, Laurel,
MD) was used to screen Tox2110K compound library for identifying activators of the RSP. Also cytotoxicity was
assessed by determining the viability of cells based on the quantitation of ATP.

2.3	Experimental System: adherent C3H10T1/2 cell line used. The C3H10T1/2 [clone8] (C3RL4) cell line was
obtained from ATCC (Catalog #CCL-226, Lot #58613480). The stably transfected C3RL4 line exhibits fibroblast
morphology that was isolated from a line of C3H mouse embryo cells. The C3RL4 clone contains a functional RSP
and the firefly luciferase gene (Luc) under the control of the RARE.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85% confluence. Fetal Bovine serum used
for cell culture and assay purpose is heat inactivated at 56C for 30min. Extra precautions to be taken for making
Retinol as it is photosensitive and moisture absorbent. Thawing method: Thaw a vial of cells in 9ml of pre-
warmed thaw medium and then centrifuge. Resuspend the pellet with the thaw medium and seed at 2 million
cells per T-75 flask. Cell Proliferation Method: Trypsinize cells from the culturing flask and centrifuge and then
resuspend cells in culture medium. Passage cells at 1-1.5 million perT-225 flask. Assay Protocol: Trypsinize cells
from the culturing flask and centrifuge and then resuspend cells in assay medium at a density of 0.2 X 10A6
cells/mL. Dispense 1000 cells/5uL/well into 1536-well tissue treated white/solid bottom plates using a 8 tip
dispenser (Multidrop). Incubate the plates for an overnight (20hr) at 37C and 5% C02. Transfer 23nL of

-------
compounds from the library collection (5.6nM to 92uM) and positive control (Retinol made fresh from the
powder) through pintool. Incubate the plates for 6hr at 37C and 5% C02. Then add 5ul of Amplite(TM) Luciferase
reagent using a single tip dispense (BioRAPTR). Incubate the plates at room temperature for 30min. Measure
luminescence (exposure time = 60sec) by ViewLux plate reader.

Baseline median absolute deviation for the assay (bmad): 0.839
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Agonism of the retinol (vitamin A) signaling pathway (RSP) is measured by bioluminescence activity
via a retinoic acid response element (RARE) firefly luciferase reporter gene. Increased luciferase activity can be
used to identify the compounds that promote xenobiotic all-trans retinoic acid (atRA) ligand-binding and RSP
agonism. The cytotoxicity of the compounds screened was tested in parallel by measuring the cell viability using
CellTiter-Fluor assay in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Increased luminescence (gain-of-signal) was measured relative to retinol (positive control) signal,
using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a percent of
positive control activity.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

Retinol

I arget (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest

-------
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.836

Neutral control median absolute deviation, by plate: nmad	1.961

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	207.28%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	99.191

Positive control well median absolute deviation, by plate: pmad	5.484

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	17.079

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 555.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1660

TOX21_RAR_LU C_Agon ist_via bi lity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 C3RL4 Retinoic Acid Response Element (RARE) Agonism Luciferase
Assay

1.2	Assay Summary: TOX21 RAR LUC Agonist is a cell-based, single-readout assay that uses C3H10T1/2, a mouse
sarcoma cell line, with measurements taken at 6 hours after chemical dosing in a 1536-well plate. See tox21-
rar-agonist-pl. TOX21_RAR_LUC_Agonist_viability is an assay readout measuring cellular ATP content and
detected with CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_RAR_LUC_Agonist_viability used a type
of viability reporter where loss-of-signal activity can be used to understand changes in the cell viability.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a viability function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected C3RL4 cells were seeded at 1,000/well in 5 uL of assay
medium into white solid 1,536-well tissue culture plates (Greiner Bio-One North America Inc. Monroe, NC) using
a Multidrop Combi (Thermo Scientific, Waltham, MA) dispenser. After incubation at 37 C and 5% C02 for an
overnight, 23 nL of compounds dissolved in DMSO, positive control or DMSO only were transferred by a Pintool
station (Kalypsys, San Diego, CA). After 6 hr incubation at 37 C and 5% C02, 5 uL of Amplite Luciferase reagent
(AAT Bioquest Inc. Sunnyvale, CA) was added to each assay plate using a Flying Reagent Dispenser (FRD) (Aurora
Discovery, San Diego, CA). Luminescence was quantified on a ViewLux plate reader (PerkinElmer, Waltham, MA)
after 30 min incubation at room temperature.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

-------
The Tox21 retinoic acid response element (RARE) agonism luciferase assay screened a library of diverse
environmental compounds to probe for xenobiotic all-trans retinoic acid (atRA) ligand-binding and potential to
activate RAR mediated gene expression, monitored through luciferase reporter gene signal activation. C3RL4
cell line (OARSA/CFSAN/FDA, Laurel, MD) was used to screen Tox21 10K compound library for identifying
activators of the RSP. The C3RL4 clone contains a functional retinol (vitamin A) signaling pathway (RSP) and the
firefly luciferase gene (Luc) under the control of the RARE. To differentiate true RARE agonist from cytotoxic
substances, the assay is multiplexed with cell viability assay. The assay is run in triplicate on 1536-well microplate
and bioluminescence was measured following 6-hour incubation of cells with test compounds and 30 min
incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored
using a Promega ViewLux plate reader. Following the incubation period, the cell culture was screened for
bioluminescent signals in agonist mode using luciferase detection technology. Compound auto-fluorescence
was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background
artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. All-trans Retinoic acid (atRA) is the potent natural metabolite
of retinol and plays an important role in normal embryonic development and maintenance of cellular phenotype
in adult animals. atRA is the activating ligand for the retinoic acid receptors (RARs) which form heterodimers
with retinoid X receptors (RXRs) on the retinoic acid response element (RARE). Through activation of the
RAR/RXRs nuclear receptors, atRA regulates the transcription of a large number of protein-coding genes and
regulatory RNAs. Intracellular levels of atRA are controlled by the retinol signaling pathway (RSP) that regulates
the biosynthesis and catabolism of atRA to maintain physiological levels. Chemicals that interfere with the RSP
can cause abnormal intracellular levels of atRA and therefore are potential developmental toxicants. To assess
compounds for the potential adverse effect on embryonic development through interfering with retinol
signaling, a cell-based RARE luciferase reporter gene assay was used. C3RL4 cell line (OARSA/CFSAN/FDA, Laurel,
MD) was used to screen Tox2110K compound library for identifying activators of the RSP. Also cytotoxicity was
assessed by determining the viability of cells based on the quantitation of ATP.

2.3	Experimental System: adherent C3H10T1/2 cell line used. The C3H10T1/2 [clone8] (C3RL4) cell line was
obtained from ATCC (Catalog #CCL-226, Lot #58613480). The stably transfected C3RL4 line exhibits fibroblast
morphology that was isolated from a line of C3H mouse embryo cells. The C3RL4 clone contains a functional RSP
and the firefly luciferase gene (Luc) under the control of the RARE.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85% confluence. Fetal Bovine serum used
for cell culture and assay purpose is heat inactivated at 56C for 30min. Extra precautions to be taken for making
Retinol as it is photosensitive and moisture absorbent. Thawing method: Thaw a vial of cells in 9ml of pre-
warmed thaw medium and then centrifuge. Resuspend the pellet with the thaw medium and seed at 2 million
cells per T-75 flask. Cell Proliferation Method: Trypsinize cells from the culturing flask and centrifuge and then
resuspend cells in culture medium. Passage cells at 1-1.5 million perT-225 flask. Assay Protocol: Trypsinize cells
from the culturing flask and centrifuge and then resuspend cells in assay medium at a density of 0.2 X 10A6
cells/mL. Dispense 1000 cells/5uL/well into 1536-well tissue treated white/solid bottom plates using a 8 tip
dispenser (Multidrop). Incubate the plates for an overnight (20hr) at 37C and 5% C02. Transfer 23nL of
compounds from the library collection (5.6nM to 92uM) and positive control (Retinol made fresh from the
powder) through pintool. Incubate the plates for 6hr at 37C and 5% C02. Then add 5ul of Amplite(TM) Luciferase

-------
reagent using a single tip dispense (BioRAPTR). Incubate the plates at room temperature for 30min. Measure
luminescence (exposure time = 60sec) by ViewLux plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 1.536
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Agonism of the retinol (vitamin A) signaling pathway (RSP) is measured by bioluminescence activity
via a retinoic acid response element (RARE) firefly luciferase reporter gene. Increased luciferase activity can be
used to identify the compounds that promote xenobiotic all-trans retinoic acid (atRA) ligand-binding and RSP
agonism. The cytotoxicity of the compounds screened was tested in parallel by measuring the cell viability using
CellTiter-Fluor assay in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with

-------
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9315	Number of chemicals tested: 7521

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-0.738

Neutral control median absolute deviation, by plate: nmad	2.872

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-147.29%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 612.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1816

T0X21_AR_LU C_M DAKB2_Antagon ist_0.5n M_R1881

1.	General Information

1.1	Assay Title: Tox21 MDa-kb2 Androgen Receptor (AR) Antagonism (0.5nM R1881) Luciferase Assay

1.2	Assay Summary: TOX21 AR LUC MDAKB2 Antagonist 0.5nM R1881 is a cell-based, single-readout assay that
uses MDA-kb2, a human breast cell line, with measurements taken at 24 hours after chemical dosing in a 1536-
well plate. See tox21-ar-mda-kb2-luc-antagonist-p2. TOX21_AR_LUC_MDAKB2_Antagonist_0.5nM_R1881 is
one of one assay component(s) measured or calculated from the
TOX21_AR_LUC_MDAKB2_Antagonist_0.5nM_R1881. It is designed to make measurements of luciferase
induction, a form of inducible reporter, as detected with bioluminescence signals following addition of luciferin
substrate and ATP. Data from the assay component TOX21_AR_LUC_MDAKB2_Antagonist_0.5nM_R1881 was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_AR_LUC_MDAKB2_Antagonist_0.5nM_R1881, was
analyzed in the positive analysis fitting direction relative to DMSO as the negative control and baseline of
activity. Using a type of inducible reporter, loss-of-signal activity can be used to understand changes in the
reporter gene as they relate to the gene AR. Furthermore, this assay endpoint can be referred to as a primary
readout, because the performed assay has only produced 1 assay endpoint. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where
the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected MDA-kb2 cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring luminescence resulting from AR
gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Androgen receptor (AR) is an important member of the nuclear receptor family. Its signaling plays
a critical role in AR-dependent prostate cancer and other androgen related diseases. Considerable attention has
been given in the past decades to develop methods to study and screen for the environmental chemicals that
have the potential to interfere with metabolic homeostasis, reproduction, developmental and behavioral
functions. Changes to bioluminescence signals produced from an enzymatic reaction involving the key substrate

-------
[One-Glo] are indicative of changes in transcriptional gene expression due to antagonist activity regulated by
the human androgen receptor [GeneSymbol:AR | GenelD:367 | Uniprot_SwissProt_Accession:P10275].

The Tox21 androgen receptor antagonism luciferase assay screened a large library of diverse environmental
compounds to probe for xenobiotic androgenic activity and potential to inhibit androgen-dependent
transcription, monitored through luciferase reporter gene signal activity using an AR-luciferase reporter gene
construct. The assay is run in triplicate on a 1536-well microplate and bioluminescence was measured following
15.5 hour incubation of cells with test compounds and 30 min incubation of test system with ONE-GloTM
luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux plate reader.
Following the incubation period, the cell culture was screened for bioluminescent signals in antagonist mode
using luciferase detection technology. Each compound was tested in a concentration-response format, using 15
concentrations ranging from 1.1 nM to 92 uM. To help distinguish true antagonistic activity from cytotoxic
effects, this assay was multiplexed with a fluorescence-based cell viability assay which measured conserved and
constitutive protease activity within live cells (Promega). Compound auto-fluorescence was monitored in
various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.
Concentration-response relationships were determined by monitoring luminescent signals relative to DMSO-
only exposures which provided a signal baseline, and to a known androgen receptor antagonist (Nilutamide) as
a positive control which provided a reference for 100 percent androgen receptor inhibition, as assessed in the
presence of 0.5 uM R1881, a known AR agonist.

2.2 Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Androgen receptors have pleiotropic regulatory roles in a
diverse range of tissues; particularly in mediating the activity of endogenous androgens in the hypothalamus,
pituitary, liver, prostate, and testicular tissues. Endogenous androgens are important for target gene expression
in physiological processes like developmental differentiation of the male embryo, initiation and maintenance of
spermatogenesis and in neuroendocrine system functioning. Endogenous androgens also influence male
pubertal maturation and maintenance of secondary sexual characteristics in adults. Disruption of androgen
signaling following exposure to endocrine-mimicking environmental chemicals can result in hormonal cancers,
impaired reproductive development and infertility. This assay is designed to screen a large, structurally diverse
chemical library to identify compounds capable of interference with endogenous androgenic signaling by
monitoring the increase in luminescent signals relative to a known androgen receptor agonist
(Methyltrienolone) as a positive control, and indicator of androgenic activity. The Tox21 MDA-kb2 AR luciferase
assays are qHTS format assays which measured the ability of a chemical to interact with AR by monitoring
modulation of luminescent reporter gene signals. This assay utilized an epithelial breast cancer cell line which
expresses firefly luciferase under control of a MMTV promoter that contains androgen response elements to
quantify xenobiotic androgen receptor agonism. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
androgen receptor mediated pathways and potentially affect endocrine systems in exposed populations. There
is strong evidence that androgen receptor agonism is a molecular initiating event (MIE) in an Adverse Outcome
Pathway (AOP) leading to reproductive dysfunction in fish populations (EAGMST Approved AOP), and there is
some evidence that androgen receptor activation is the MIE for a putative pathway leading to hepatocellular
adenomas and carcinomas (in mouse and rat models) (AOP currently under development). Chemical-activity
profiles derived from this assay can inform prioritization decisions for compound selection in more resource
intensive in vivo studies to further investigate the involvement of AR agonism in pathways leading to hazardous
outcomes in biological systems.

-------
2.3	Experimental System: adherent MDA-kb2 cell line used. The MDA-kb2 AR-luc cell line was derived from
epithelial breast cancer cell line, MDA-MB-453 (originally obtained in 1976 from pleural effusion of metastatic
carcinoma from 48-yo Caucasian female) by stable transfection with a mouse mammary tumor virus (MMTV)
neomycin-resistant luciferase reporter gene construct. MDA-MB-453 cells have fibroblastic morphology and
were selected for transformation due to high levels of functional, endogenous androgen and glucocorticoid
receptors while estrogen receptor (ER) alpha and progesterone receptor (PR) mRNA are not detectable, and ER-
beta is apparently expressed at very low levels. This cell line expresses firefly luciferase under control of a MMTV
promoter that contains response elements for both GR and AR. MDA-kb2 may be used in an in vitro assay to
screen androgen agonist and antagonists and to characterize its specificity and sensitivity to endocrine
disrupting chemicals.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Metabolic activity
has been examined for the parental MDA-MB-453 cells. CYP1A1 and CYP1B1 have been shown to be inducible
following TCDD exposures, with exposure to AhR agonists showing highly preferential induction of CYP1B1 as
opposed to CYP1A1.

2.5	Exposure Regime: Quality Control Precautions: Maintain cell culture below 85-90% confluence. Cell culturing
and assay culture doesn't require C02. Cell Thawing Method: Thaw a vial of cells in 9ml of pre-warmed
thaw/culture medium and then centrifuge. Resuspend the pellet with the thaw/culture medium and seed at 2
million cells per T-75 flask. Cell Proliferation Method: Trypsinize cells from the culturing flask and centrifuge
and then resuspend cells in culture medium. Passage cells at 6-7 million per T-225 flask. Assay Protocol:
Trypsinize cells from the culturing flask and centrifuge and then resuspend cells in culture/assay medium.
Dispense 3000 cells/5uL/well (for agonist mode) into 1536-well tissue treated white/solid bottom plates using
a 8 tip dispenser (Multidrop). Incubate the assay plates for 5-6hrs at 37C and 0% C02. Transfer 23nL of
compounds from the library collection and positive control/DMSO into the assay plates through Pintool.
Compound transfer was followed by the addition of luL of 0.5nM (final concentration) R1881 or assay buffer
using 2 tips of a dispenser (BioRAPTR FRD). Incubate the assay plates for 16hrs at 37C and 0% C02. After 15hrs
of incubation at 37C and 0%CO2, lul of CellTiter-Fluor reagent was added using a single tip of a dispenser
(BioRAPTR FRD). Incubate the assay plates at 37C and 0%CO2 for lhr. Read fluorescence using ViewLux plate
reader. Then followed by the addition of 4ul of ONE-Glo(TM) Luciferase reagent using a single tip of a dispenser
(BioRAPTR FRD). Incubate the assay plates at room temperature for 30min. Read Luminescence using ViewLux
plate reader.

Baseline median absolute deviation for the assay (bmad): 5.354
Response cutoff threshold used to determine hit calls: 32.127

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6 Response: Androgen receptor antagonism and inhibited gene expression is measured by monitoring
luminescent production by the luciferase reporter gene under control of androgen response element
promoters. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

Nilutamide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

2.7 Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations

-------
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ToxCast AR Pathway Model: Androgen receptor assays used in ToxCast AR Pathway model. See
10.1016/j.yrtph.2020.104764 and 10.1021/acs.chemrestox.6b00347

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: In cultures stimulated with a known agonist (0.5nM R1881), decreased luminescence (loss-of-
signal) relative to nilutamide signal (positive control, 100 percent antagonist activity), using DMSO (neutral
control) as a signal baseline as a baseline for luciferase induction. Response was reported as a percent of positive
control activity. This is an secondary assay to TOX21_AR_LUC_MDAKB2_Agonist to confirm antagonist
specificity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

-------
Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1763

Inactive hit count: Oihitc 0.9
4681

WINING MODEL SELECTION

NA hit count: hitc^O
3223

Number of sample-assay endpoints with winning hill model:

789
536

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

715

3114

-------
quadratic-polynomialfpoly2) model:	1100

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

42

295

2231

845

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

8.677

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100

Positive control well median absolute deviation, by plate: pmad	1.896

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-11.153

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 845.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Judson R, Houck K, Paul Friedman K, Brown J, Browne P, Johnston PA, Close DA, Mansouri K,
Kleinstreuer N. Selecting a minimal set of androgen receptor assays for screening chemicals. Regul Toxicol
Pharmacol. 2020 Nov; 117:104764. doi: 10.1016/j.yrtph.2020.104764. Epub 2020 Aug 14. PMID: 32798611;
PMCID: PMC8356084., Kleinstreuer NC, Ceger P, Watt ED, Martin M, Houck K, Browne P, Thomas RS, Casey WM,
Dix DJ, Allen D, Sakamuru S, Xia M, Huang R, Judson R. Development and Validation of a Computational Model
for Androgen Receptor Activity. Chem Res Toxicol. 2017 Apr 17;30(4):946-964. doi:
10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID: PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1817

TOX21_AR_LUC_MDAKB2_Antagonist_0.5nM_R1881_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 MDa-kb2 Androgen Receptor (AR) Antagonism (0.5nM R1881)
Luciferase Assay

1.2	Assay Summary: TOX21 AR LUC MDAKB2 Antagonist 0.5nM R1881 is a cell-based, single-readout assay that
uses MDA-kb2, a human breast cell line, with measurements taken at 24 hours after chemical dosing in a 1536-
well	plate.	See	tox21-ar-mda-kb2-luc-antagonist-p2.
TOX21_AR_LUC_MDAKB2_Antagonist_0.5nM_R1881_viability is an assay readout measuring cellular ATP
content and detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
TOX21_AR_LUC_MDAKB2_Antagonist_0.5nM_R1881_viability used a type of viability reporter where loss-of-
signal activity can be used to understand changes in the cell viability. Furthermore, this assay endpoint can be
referred to as a secondary readout, because this assay has produced multiple assay endpoints where this one
serves a viability function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected MDA-kb2 cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring luminescence resulting from AR
gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 androgen receptor antagonism luciferase assay screened a large library of diverse environmental
compounds to probe for xenobiotic androgenic activity and potential to inhibit androgen-dependent
transcription, monitored through luciferase reporter gene signal activity using an AR-luciferase reporter gene
construct. The assay is run in triplicate on a 1536-well microplate and bioluminescence was measured following
15.5 hour incubation of cells with test compounds and 30 min incubation of test system with ONE-GloTM

-------
luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux plate reader.
Following the incubation period, the cell culture was screened for bioluminescent signals in antagonist mode
using luciferase detection technology. Each compound was tested in a concentration-response format, using 15
concentrations ranging from 1.1 nM to 92 uM. To help distinguish true antagonistic activity from cytotoxic
effects, this assay was multiplexed with a fluorescence-based cell viability assay which measured conserved and
constitutive protease activity within live cells (Promega). Compound auto-fluorescence was monitored in
various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.
Concentration-response relationships were determined by monitoring luminescent signals relative to DMSO-
only exposures which provided a signal baseline, and to a known androgen receptor antagonist (Nilutamide) as
a positive control which provided a reference for 100 percent androgen receptor inhibition, as assessed in the
presence of 0.5 uM R1881, a known AR agonist.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Androgen receptors have pleiotropic regulatory roles in a
diverse range of tissues; particularly in mediating the activity of endogenous androgens in the hypothalamus,
pituitary, liver, prostate, and testicular tissues. Endogenous androgens are important for target gene expression
in physiological processes like developmental differentiation of the male embryo, initiation and maintenance of
spermatogenesis and in neuroendocrine system functioning. Endogenous androgens also influence male
pubertal maturation and maintenance of secondary sexual characteristics in adults. Disruption of androgen
signaling following exposure to endocrine-mimicking environmental chemicals can result in hormonal cancers,
impaired reproductive development and infertility. This assay is designed to screen a large, structurally diverse
chemical library to identify compounds capable of interference with endogenous androgenic signaling by
monitoring the increase in luminescent signals relative to a known androgen receptor agonist
(Methyltrienolone) as a positive control, and indicator of androgenic activity. The Tox21 MDA-kb2 AR luciferase
assays are qHTS format assays which measured the ability of a chemical to interact with AR by monitoring
modulation of luminescent reporter gene signals. This assay utilized an epithelial breast cancer cell line which
expresses firefly luciferase under control of a MMTV promoter that contains androgen response elements to
quantify xenobiotic androgen receptor agonism. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
androgen receptor mediated pathways and potentially affect endocrine systems in exposed populations. There
is strong evidence that androgen receptor agonism is a molecular initiating event (MIE) in an Adverse Outcome
Pathway (AOP) leading to reproductive dysfunction in fish populations (EAGMST Approved AOP), and there is
some evidence that androgen receptor activation is the MIE for a putative pathway leading to hepatocellular
adenomas and carcinomas (in mouse and rat models) (AOP currently under development). Chemical-activity
profiles derived from this assay can inform prioritization decisions for compound selection in more resource
intensive in vivo studies to further investigate the involvement of AR agonism in pathways leading to hazardous
outcomes in biological systems.

2.3	Experimental System: adherent MDA-kb2 cell line used. The MDA-kb2 AR-luc cell line was derived from
epithelial breast cancer cell line, MDA-MB-453 (originally obtained in 1976 from pleural effusion of metastatic
carcinoma from 48-yo Caucasian female) by stable transfection with a mouse mammary tumor virus (MMTV)
neomycin-resistant luciferase reporter gene construct. MDA-MB-453 cells have fibroblastic morphology and
were selected for transformation due to high levels of functional, endogenous androgen and glucocorticoid
receptors while estrogen receptor (ER) alpha and progesterone receptor (PR) mRNA are not detectable, and ER-
beta is apparently expressed at very low levels. This cell line expresses firefly luciferase under control of a MMTV
promoter that contains response elements for both GR and AR. MDA-kb2 may be used in an in vitro assay to

-------
screen androgen agonist and antagonists and to characterize its specificity and sensitivity to endocrine
disrupting chemicals.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Metabolic activity
has been examined for the parental MDA-MB-453 cells. CYP1A1 and CYP1B1 have been shown to be inducible
following TCDD exposures, with exposure to AhR agonists showing highly preferential induction of CYP1B1 as
opposed to CYP1A1.

2.5	Exposure Regime: Quality Control Precautions: Maintain cell culture below 85-90% confluence. Cell culturing
and assay culture doesn't require C02. Cell Thawing Method: Thaw a vial of cells in 9ml of pre-warmed
thaw/culture medium and then centrifuge. Resuspend the pellet with the thaw/culture medium and seed at 2
million cells per T-75 flask. Cell Proliferation Method: Trypsinize cells from the culturing flask and centrifuge
and then resuspend cells in culture medium. Passage cells at 6-7 million per T-225 flask. Assay Protocol:
Trypsinize cells from the culturing flask and centrifuge and then resuspend cells in culture/assay medium.
Dispense 3000 cells/5uL/well (for agonist mode) into 1536-well tissue treated white/solid bottom plates using
a 8 tip dispenser (Multidrop). Incubate the assay plates for 5-6hrs at 37C and 0% C02. Transfer 23nL of
compounds from the library collection and positive control/DMSO into the assay plates through Pintool.
Compound transfer was followed by the addition of luL of 0.5nM (final concentration) R1881 or assay buffer
using 2 tips of a dispenser (BioRAPTR FRD). Incubate the assay plates for 16hrs at 37C and 0% C02. After 15hrs
of incubation at 37C and 0%CO2, lul of CellTiter-Fluor reagent was added using a single tip of a dispenser
(BioRAPTR FRD). Incubate the assay plates at 37C and 0%CO2 for lhr. Read fluorescence using ViewLux plate
reader. Then followed by the addition of 4ul of ONE-Glo(TM) Luciferase reagent using a single tip of a dispenser
(BioRAPTR FRD). Incubate the assay plates at room temperature for 30min. Read Luminescence using ViewLux
plate reader.

Baseline median absolute deviation for the assay (bmad): 1.737
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Androgen receptor antagonism and inhibited gene expression is measured by monitoring
luminescent production by the luciferase reporter gene under control of androgen response element
promoters. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region, ToxCast AR Pathway Model: Androgen
receptor assays used in ToxCast AR Pathway model. See 10.1016/j.yrtph.2020.104764 and
10.1021/acs.chemrestox.6b00347

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability, and response was reported as a
percent activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

-------
5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
884

Inactive hit count: 0
-------
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2965

680

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

2.92

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

lnf%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed

NA

-------
Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 680.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

-------
5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Judson R, Houck K, Paul Friedman K, Brown J, Browne P, Johnston PA, Close DA, Mansouri K,
Kleinstreuer N. Selecting a minimal set of androgen receptor assays for screening chemicals. Regul Toxicol
Pharmacol. 2020 Nov; 117:104764. doi: 10.1016/j.yrtph.2020.104764. Epub 2020 Aug 14. PMID: 32798611;
PMCID: PMC8356084., Kleinstreuer NC, Ceger P, Watt ED, Martin M, Houck K, Browne P, Thomas RS, Casey WM,
Dix DJ, Allen D, Sakamuru S, Xia M, Huang R, Judson R. Development and Validation of a Computational Model
for Androgen Receptor Activity. Chem Res Toxicol. 2017 Apr 17;30(4):946-964. doi:
10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID: PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1822

TOX21_AR_LUC_MDAKB2_Agonist_3uM_Nilutamide

1.	General Information

1.1	Assay Title: Tox21 MDa-kb2 Androgen Receptor (AR) Agonism (3uM Nilutamide) Luciferase Assay

1.2	Assay Summary: TOX21_AR_LUC_MDAKB2_Agonist_3uM_Nilutamide is a cell-based, single-readout assay that
uses MDA-kb2, a human breast cell line, with measurements taken at 24 hours after chemical dosing in a 1536-
well plate. This is an secondary assay to TOX21_AR_LUC_MDAKB2_Agonist for agonist specificity. This particular
screening for measuring luciferase reporter gene activity using MDA-kb2 cells against Tox21 10K library
compounds is a confirmation study on agonist mode screening in the presence of AR-antagonist compound
(Nilutamide) at 3.0uM final concentration. See tox21-ar-mda-kb2-luc-agonist-p3.
TOX21_AR_LUC_MDAKB2_Agonist_3uM_Nilutamide is one of one assay component(s) measured or calculated
from the TOX21_AR_LUC_MDAKB2_Agonist_3uM_Nilutamide. It is designed to make measurements of
luciferase induction, a form of inducible reporter, as detected with bioluminescence signal following addition of
luciferin substrate and ATP. Data from the assay component
TOX21_AR_LUC_MDAKB2_Agonist_3uM_Nilutamide was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_AR_LUC_MDAKB2_Agonist_3uM_Nilutamide, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene AR.
Furthermore, this assay endpoint can be referred to as a primary readout, because the performed assay has
only produced 1 assay endpoint. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected MDA-kb2 cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring luminescence resulting from AR
gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_AR_LUC_MDAKB2_Agonist_3uM_Nilutamide was designed to measure changes to
bioluminescence signals produced from an enzymatic reaction involving the key substrate [One-Glo] in the
presence of an AR agonist. Changes are indicative of transcriptional gene expression that may not be due to

-------
direct regulation by the human androgen receptor [GeneSymbol: NR3C4 | GenelD: NR3C4 |
Uniprot_SwissProt_Accession:].

The Tox21 androgen receptor agonism luciferase assay screened a large library of diverse environmental
compounds to probe for xenobiotic androgenic activity and potential to induce androgen-dependent
transcription, monitored through luciferase reporter gene signal activation using an AR-luciferase reporter gene
construct. The assay is run in triplicate on a 1536-well microplate and bioluminescence was measured following
16 hour incubation of cells with test compounds and 30 min incubation of test system with ONE-GloTM
luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux plate reader.
Following the incubation period, the cell culture was screened for bioluminescent signals in agonist mode using
luciferase ATP detection technology. Each compound was tested in a concentration-response format, using 15
concentrations ranging from 1.1 nM to 92 uM. Compound auto-fluorescence was monitored in various
"TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Androgen receptors have pleiotropic regulatory roles in a
diverse range of tissues; particularly in mediating the activity of endogenous androgens in the hypothalamus,
pituitary, liver, prostate, and testicular tissues. Endogenous androgens are important for target gene expression
in physiological processes like developmental differentiation of the male embryo, initiation and maintenance of
spermatogenesis and in neuroendocrine system functioning. Endogenous androgens also influence male
pubertal maturation and maintenance of secondary sexual characteristics in adults. Disruption of androgen
signaling following exposure to endocrine-mimicking environmental chemicals can result in hormonal cancers,
impaired reproductive development and infertility. This assay is designed to screen a large, structurally diverse
chemical library to identify compounds capable of interference with endogenous androgenic signaling by
monitoring the increase in luminescent signals relative to a known androgen receptor agonist
(Methyltrienolone) as a positive control, and indicator of androgenic activity. The Tox21 MDA-kb2 AR luciferase
assays are qHTS format assays which measured the ability of a chemical to interact with AR by monitoring
modulation of luminescent reporter gene signals. This assay utilized an epithelial breast cancer cell line which
expresses firefly luciferase under control of a MMTV promoter that contains androgen response elements to
quantify xenobiotic androgen receptor agonism. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
androgen receptor mediated pathways and potentially affect endocrine systems in exposed populations. There
is strong evidence that androgen receptor agonism is a molecular initiating event (MIE) in an Adverse Outcome
Pathway (AOP) leading to reproductive dysfunction in fish populations (EAGMST Approved AOP), and there is
some evidence that androgen receptor activation is the MIE for a putative pathway leading to hepatocellular
adenomas and carcinomas (in mouse and rat models) (AOP currently under development). Chemical-activity
profiles derived from this assay can inform prioritization decisions for compound selection in more resource
intensive in vivo studies to further investigate the involvement of AR agonism in pathways leading to hazardous
outcomes in biological systems.

2.3	Experimental System: adherent MDA-kb2 cell line used. The MDA-kb2 AR-luc cell line was derived from
epithelial breast cancer cell line, MDA-MB-453 (originally obtained in 1976 from pleural effusion of metastatic
carcinoma from 48-yo Caucasian female) by stable transfection with a mouse mammary tumor virus (MMTV)
neomycin-resistant luciferase reporter gene construct. MDA-MB-453 cells have fibroblastic morphology and
were selected for transformation due to high levels of functional, endogenous androgen and glucocorticoid
receptors while estrogen receptor (ER) alpha and progesterone receptor (PR) mRNA are not detectable, and ER-
beta is apparently expressed at very low levels. This cell line expresses firefly luciferase under control of a MMTV

-------
promoter that contains response elements for both GR and AR. MDA-kb2 may be used in an in vitro assay to
screen androgen agonist and antagonists and to characterize its specificity and sensitivity to endocrine
disrupting chemicals.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Metabolic activity
has been examined for the parental MDA-MB-453 cells. CYP1A1 and CYP1B1 have been shown to be inducible
following TCDD exposures, with exposure to AhR agonists showing highly preferential induction of CYP1B1 as
opposed to CYP1A1.

2.5	Exposure Regime: Quality Control Precautions: Maintain cell culture below 85-90% confluence. Cell culturing
and assay culture doesn't require C02. Cell Thawing Method: Thaw a vial of cells in 9ml of pre-warmed
thaw/culture medium and then centrifuge. Resuspend the pellet with the thaw/culture medium and seed at 2
million cells per T-75 flask. Cell Proliferation Method: Trypsinize cells from the culturing flask and centrifuge
and then resuspend cells in culture medium. Passage cells at 6-7 million per T-225 flask. Assay Protocol:
Trypsinize cells from the culturing flask and centrifuge and then re-suspend cells in culture/assay medium.
Dispense 3000 cells/4uL/well into 1536-well tissue treated white/solid bottom plates using an 8 tip dispenser
(Multidrop). Incubate the assay plates for 5hrs at 37C and 0% C02. First luL of 3.0uM (final concentration)
Nilutamide or assay buffer was added using two separate tips of a dispenser (BioRAPTR). Then 23nL of
compounds were transferred from the library collection into 5-48 columns and positive control into 1-4 columns
using a Pintool station. Incubate the assay plates for 16hrs at 37C and 0% C02. After 15hrs of incubation, lul of
CellTiter-Fluor(TM) Cell Viability Assay reagent was added using a single tip of a dispenser (BioRAPTR). Incubate
the assay plates at 37C and 0% C02 for lhr. Measure fluorescence signal by ViewLux plate reader (Exposure
time = lsec). Add 4ul of ONE-Glo(TM) Luciferase reagent using a single tip of a dispenser (BioRAPTR). Incubate
the assay plates at room temperature for 30min. Measure luminescence signal by ViewLux plate reader
(Exposure time = 90sec).

Baseline median absolute deviation for the assay (bmad): 0.768
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Androgen receptor agonism and enhanced gene expression is measured by monitoring luminescent
production by the luciferase reporter gene under control of androgen response element promoters. The
cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability using by
CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

R1881

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
2.9 Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: In cultures stimulated with a known antagonist (3uM Nilutamide), increased luminescence (gain-
of-signal) was measured relative to R1881 signal (positive control, 100 percent agonist activity), using DMSO
(neutral control) as a baseline for luciferase induction. Response was reported as a percent of positive control
activity. This is an secondary assay to TOX21_AR_LUC_MDAKB2_Agonist to confirm agonist specificity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning

-------
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
312

Inactive hit count: Oihitc 0.9
8829

WINING MODEL SELECTION

NA hit count: hitc^O
526

Number of sample-assay endpoints with winning hill model:

573
474

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

578

3886

quadratic-polynomial(poly2) model: 871

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

299

31

2394

561

-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

1.191

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

7.512

100

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

NA

-------
Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	13.154

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 561.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Judson R, Houck K, Paul Friedman K, Brown J, Browne P, Johnston PA, Close DA, Mansouri K,
Kleinstreuer N. Selecting a minimal set of androgen receptor assays for screening chemicals. Regul Toxicol
Pharmacol. 2020 Nov; 117:104764. doi: 10.1016/j.yrtph.2020.104764. Epub 2020 Aug 14. PMID: 32798611;
PMCID: PMC8356084., Kleinstreuer NC, Ceger P, Watt ED, Martin M, Houck K, Browne P, Thomas RS, Casey WM,
Dix DJ, Allen D, Sakamuru S, Xia M, Huang R, Judson R. Development and Validation of a Computational Model
for Androgen Receptor Activity. Chem Res Toxicol. 2017 Apr 17;30(4):946-964. doi:
10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID: PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1823

T0X21_AR_LU C_M DAKB2_Agon ist_3u M_N i I uta mide_via bi lity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 MDa-kb2 Androgen Receptor (AR) Agonism (3uM Nilutamide)
Luciferase Assay

1.2	Assay Summary: TOX21_AR_LUC_MDAKB2_Agonist_3uM_Nilutamide is a cell-based, single-readout assay that
uses MDA-kb2, a human breast cell line, with measurements taken at 24 hours after chemical dosing in a 1536-
well plate. This is an secondary assay to TOX21_AR_LUC_MDAKB2_Agonist for agonist specificity. This particular
screening for measuring luciferase reporter gene activity using MDA-kb2 cells against Tox21 10K library
compounds is a confirmation study on agonist mode screening in the presence of AR-antagonist compound
(Nilutamide) at 3.0uM final concentration. See tox21-ar-mda-kb2-luc-agonist-p3.
TOX21_AR_LUC_MDAKB2_Agonist_3uM_Nilutamide_viability is an assay readout measuring cellular ATP
content and detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
TOX21_AR_LUC_MDAKB2_Agonist_3uM_Nilutamide_viability used a type of viability reporter where loss-of-
signal activity can be used to understand changes in the cell viability. Furthermore, this assay endpoint can be
referred to as a secondary readout, because this assay has produced multiple assay endpoints where this one
serves a viability function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected MDA-kb2 cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring luminescence resulting from AR
gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 androgen receptor agonism luciferase assay screened a large library of diverse environmental
compounds to probe for xenobiotic androgenic activity and potential to induce androgen-dependent

-------
transcription, monitored through luciferase reporter gene signal activation using an AR-luciferase reporter gene
construct. The assay is run in triplicate on a 1536-well microplate and bioluminescence was measured following
16 hour incubation of cells with test compounds and 30 min incubation of test system with ONE-GloTM
luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux plate reader.
Following the incubation period, the cell culture was screened for bioluminescent signals in agonist mode using
luciferase ATP detection technology. Each compound was tested in a concentration-response format, using 15
concentrations ranging from 1.1 nM to 92 uM. Compound auto-fluorescence was monitored in various
"TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Androgen receptors have pleiotropic regulatory roles in a
diverse range of tissues; particularly in mediating the activity of endogenous androgens in the hypothalamus,
pituitary, liver, prostate, and testicular tissues. Endogenous androgens are important for target gene expression
in physiological processes like developmental differentiation of the male embryo, initiation and maintenance of
spermatogenesis and in neuroendocrine system functioning. Endogenous androgens also influence male
pubertal maturation and maintenance of secondary sexual characteristics in adults. Disruption of androgen
signaling following exposure to endocrine-mimicking environmental chemicals can result in hormonal cancers,
impaired reproductive development and infertility. This assay is designed to screen a large, structurally diverse
chemical library to identify compounds capable of interference with endogenous androgenic signaling by
monitoring the increase in luminescent signals relative to a known androgen receptor agonist
(Methyltrienolone) as a positive control, and indicator of androgenic activity. The Tox21 MDA-kb2 AR luciferase
assays are qHTS format assays which measured the ability of a chemical to interact with AR by monitoring
modulation of luminescent reporter gene signals. This assay utilized an epithelial breast cancer cell line which
expresses firefly luciferase under control of a MMTV promoter that contains androgen response elements to
quantify xenobiotic androgen receptor agonism. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
androgen receptor mediated pathways and potentially affect endocrine systems in exposed populations. There
is strong evidence that androgen receptor agonism is a molecular initiating event (MIE) in an Adverse Outcome
Pathway (AOP) leading to reproductive dysfunction in fish populations (EAGMST Approved AOP), and there is
some evidence that androgen receptor activation is the MIE for a putative pathway leading to hepatocellular
adenomas and carcinomas (in mouse and rat models) (AOP currently under development). Chemical-activity
profiles derived from this assay can inform prioritization decisions for compound selection in more resource
intensive in vivo studies to further investigate the involvement of AR agonism in pathways leading to hazardous
outcomes in biological systems.

2.3	Experimental System: adherent MDA-kb2 cell line used. The MDA-kb2 AR-luc cell line was derived from
epithelial breast cancer cell line, MDA-MB-453 (originally obtained in 1976 from pleural effusion of metastatic
carcinoma from 48-yo Caucasian female) by stable transfection with a mouse mammary tumor virus (MMTV)
neomycin-resistant luciferase reporter gene construct. MDA-MB-453 cells have fibroblastic morphology and
were selected for transformation due to high levels of functional, endogenous androgen and glucocorticoid
receptors while estrogen receptor (ER) alpha and progesterone receptor (PR) mRNA are not detectable, and ER-
beta is apparently expressed at very low levels. This cell line expresses firefly luciferase under control of a MMTV
promoter that contains response elements for both GR and AR. MDA-kb2 may be used in an in vitro assay to
screen androgen agonist and antagonists and to characterize its specificity and sensitivity to endocrine
disrupting chemicals.

-------
2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized. Metabolic activity
has been examined for the parental MDA-MB-453 cells. CYP1A1 and CYP1B1 have been shown to be inducible
following TCDD exposures, with exposure to AhR agonists showing highly preferential induction of CYP1B1 as
opposed to CYP1A1.

2.5	Exposure Regime: Quality Control Precautions: Maintain cell culture below 85-90% confluence. Cell culturing
and assay culture doesn't require C02. Cell Thawing Method: Thaw a vial of cells in 9ml of pre-warmed
thaw/culture medium and then centrifuge. Resuspend the pellet with the thaw/culture medium and seed at 2
million cells per T-75 flask. Cell Proliferation Method: Trypsinize cells from the culturing flask and centrifuge
and then resuspend cells in culture medium. Passage cells at 6-7 million per T-225 flask. Assay Protocol:
Trypsinize cells from the culturing flask and centrifuge and then re-suspend cells in culture/assay medium.
Dispense 3000 cells/4uL/well into 1536-well tissue treated white/solid bottom plates using an 8 tip dispenser
(Multidrop). Incubate the assay plates for 5hrs at 37C and 0% C02. First luL of 3.0uM (final concentration)
Nilutamide or assay buffer was added using two separate tips of a dispenser (BioRAPTR). Then 23nL of
compounds were transferred from the library collection into 5-48 columns and positive control into 1-4 columns
using a Pintool station. Incubate the assay plates for 16hrs at 37C and 0% C02. After 15hrs of incubation, lul of
CellTiter-Fluor(TM) Cell Viability Assay reagent was added using a single tip of a dispenser (BioRAPTR). Incubate
the assay plates at 37C and 0% C02 for lhr. Measure fluorescence signal by ViewLux plate reader (Exposure
time = lsec). Add 4ul of ONE-Glo(TM) Luciferase reagent using a single tip of a dispenser (BioRAPTR). Incubate
the assay plates at room temperature for 30min. Measure luminescence signal by ViewLux plate reader
(Exposure time = 90sec).

Baseline median absolute deviation for the assay (bmad): 1.894
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Androgen receptor agonism and enhanced gene expression is measured by monitoring luminescent
production by the luciferase reporter gene under control of androgen response element promoters. The
cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability using by
CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

I arget (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability, and response was reported as a
percent activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active

-------
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
924

Inactive hit count: 0
-------
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	3.947

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-2.432

Positive control well median absolute deviation, by plate: pmad	3.207

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.475

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

-------
Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 549.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

-------
• Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Judson R, Houck K, Paul Friedman K, Brown J, Browne P, Johnston PA, Close DA, Mansouri K,
Kleinstreuer N. Selecting a minimal set of androgen receptor assays for screening chemicals. Regul Toxicol
Pharmacol. 2020 Nov; 117:104764. doi: 10.1016/j.yrtph.2020.104764. Epub 2020 Aug 14. PMID: 32798611;
PMCID: PMC8356084., Kleinstreuer NC, Ceger P, Watt ED, Martin M, Houck K, Browne P, Thomas RS, Casey WM,
Dix DJ, Allen D, Sakamuru S, Xia M, Huang R, Judson R. Development and Validation of a Computational Model
for Androgen Receptor Activity. Chem Res Toxicol. 2017 Apr 17;30(4):946-964. doi:
10.1021/acs.chemrestox.6b00347. Epub 2016 Dec 9. PMID: 27933809; PMCID: PMC5396026.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1839

T0X21_RAR_LU C_Antago n ist

1. General Information

1.1	Assay Title: Tox21 C3RL4 Retinoic Acid Response Element (RARE) Antagonism Luciferase Assay

1.2	Assay Summary: TOX21_RAR_LUC_Antagonist is a cell-based, single-readout assay that uses C3H10T1/2, a
mouse sarcoma cell line, with measurements taken at 6 hours after chemical dosing in a 1536-well plate. See
tox21-rar-antagonist-p2. TOX21_RAR_LUC_Antagonist is one of one assay component(s) measured or
calculated from the TOX21_RAR_LUC_Antagonist assay. It is designed to make measurements of luciferase
induction, a form of inducible reporter, as detected with bioluminescence signals by CellTiter-Glo Luciferase-
coupled ATP quantitation technology Data from the assay component TOX21_RAR_LUC_Antagonist was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_RAR_LUC_Antagonist, was analyzed in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of
inducible reporter, loss-of-signal activity can be used to understand changes in the reporter gene as they relate
to the gene RARA. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay
has produced multiple assay endpoints where this one serves a reporter gene function. To generalize the
intended target to other relatable targets, this assay endpoint is annotated to the nuclear receptor intended
target family, where subfamily is non-steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected C3RL4 cells were seeded at 1,000/well in 4 uL of assay
medium into white solid 1,536-well tissue culture plates (Greiner Bio-One North America Inc. Monroe, NC) using
a Multidrop Combi (Thermo Scientific, Waltham, MA) dispenser. After incubation at 37 C and 5% C02 for an
overnight, 23 nL of compounds dissolved in DMSO, positive control or DMSO only were transferred by a Pintool
station (Kalypsys, San Diego, CA). One ul of 1 uM (final concentration) of Retinol was added right after the
compound addition by using a Flying Reagent Dispenser (FRD) (Aurora Discovery, San Diego, CA). After 6 hr
incubation at 37 C and 5% C02, 5 uLof Amplite Luciferase reagent (AAT Bioquest Inc. Sunnyvale, CA) was added
to each assay plate using an FRD. Luminescence was quantified on a ViewLux plate reader (PerkinElmer,
Waltham, MA) after 30 min incubation at room temperature.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: TOX21_RAR_LUC_Antagonist was designed to measure changes to bioluminescence signals
produced from an enzymatic reaction involving the key substrate [One-Glo], Changes are indicative of changes
in transcriptional gene expression due to agonist activity regulated by the human retinoic acid receptor alpha
[GeneSymbokRARA]

The Tox21 retinoic acid response element (RARE) antagonism luciferase assay screened a library of diverse
environmental compounds to probe for xenobiotic all-trans retinoic acid (atRA) ligand-binding and potential to
suppress RAR mediated gene expression, monitored through luciferase reporter gene signal activation. C3RL4
cell line (OARSA/CFSAN/FDA, Laurel, MD) was used to screen Tox21 10K compound library for identifying
activators of the RSP. The C3RL4 clone contains a functional retinol (vitamin A) signaling pathway (RSP) and the
firefly luciferase gene (Luc) under the control of the RARE. To differentiate true RARE antagonist from cytotoxic
substances, the assay is multiplexed with cell viability assay. The assay is run in triplicate on 1536-well microplate
and bioluminescence was measured following 6-hour incubation of cells with test compounds and 30 min
incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored
using a Promega ViewLux plate reader. Following the incubation period, the cell culture was screened for
bioluminescent signals in antagonist mode using luciferase detection technology. Compound auto-fluorescence
was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background
artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. All-trans Retinoic acid (atRA) is the potent natural metabolite
of retinol and plays an important role in normal embryonic development and maintenance of cellular phenotype
in adult animals. atRA is the activating ligand for the retinoic acid receptors (RARs) which form heterodimers
with retinoid X receptors (RXRs) on the retinoic acid response element (RARE). Through activation of the
RAR/RXRs nuclear receptors, atRA regulates the transcription of a large number of protein-coding genes and
regulatory RNAs. Intracellular levels of atRA are controlled by the retinol signaling pathway (RSP) that regulates
the biosynthesis and catabolism of atRA to maintain physiological levels. Chemicals that interfere with the RS P
can cause abnormal intracellular levels of atRA and therefore are potential developmental toxicants. To assess
compounds for the potential adverse effect on embryonic development through interfering with retinol
signaling, a cell-based RARE luciferase reporter gene assay was used. C3RL4 cell line (OARSA/CFSAN/FDA, Laurel,
MD) was used to screen Tox2110K compound library for identifying inhibitors of the RSP. Also cytotoxicity was
assessed by determining the viability of cells based on the quantitation of ATP.

2.3	Experimental System: adherent C3H10T1/2 cell line used. The C3H10T1/2 [clone8] (C3RL4) cell line was
obtained from ATCC (Catalog #CCL-226, Lot #58613480). The stably transfected C3RL4 line exhibits fibroblast
morphology that was isolated from a line of C3H mouse embryo cells. The C3RL4 clone contains a functional RSP
and the firefly luciferase gene (Luc) under the control of the RARE.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85% confluence. Fetal Bovine serum used
for cell culture and assay purpose is heat inactivated at 56C for 30min. Extra precautions to be taken for making
Retinol as it is photosensitive and moisture absorbent. Thawing method: Thaw a vial of cells in 9ml of pre-
warmed thaw medium and then centrifuge. Resuspend the pellet with the thaw medium and seed at 2 million
cells per T-75 flask. Cell Proliferation Method: Trypsinize cells from the culturing flask and centrifuge and then
resuspend cells in culture medium. Passage cells at 1-1.5 million perT-225 flask. Assay Protocol: Trypsinize cells
from the culturing flask and centrifuge and then resuspend cells in assay medium at a density of 0.25 X 10A6

-------
cells/mL Dispense 1000 cells/4uL/well into 1536-well tissue treated white/solid bottom plates using a 8 tip
dispenser (Multidrop). Incubate the plates for an overnight (20hr) at 37C and 5% C02. Transfer 23nL of
compounds from the library collection (5.6nM to 92uM) and positive control through pintool. Compound
transfer was followed by the addition of lul of luM (final concentration) Retinol (Retinol made fresh from the
powder) or assay buffer using two different tips of a BioRAPTR. Incubate the plates for 6hr at 37C and 5% C02.
Add 5ul of Amplite(TM) Luciferase reagent using a single tip dispense (BioRAPTR). Incubate the plates at room
temperature for 30min. Measure luminescence (exposure time = 60sec) by ViewLux plate reader.

Baseline median absolute deviation for the assay (bmad): 4.959
Response cutoff threshold used to determine hit calls: 29.754

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Antagonism of the retinol (vitamin A) signaling pathway (RSP) is measured by bioluminescence
activity via a retinoicacid response element (RARE) firefly luciferase reporter gene. Decreased luciferase activity
can be used to identify the compounds that promote xenobiotic all-trans retinoic acid (atRA) ligand-binding and
RSP antagonism. The cytotoxicity of the compounds screened was tested in parallel by measuring the cell
viability using CellTiter-Fluor assay in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Decreased luminescence (loss-of-signal) was measured relative to ER50891 (positive control)
signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a percent
of positive control activity.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

ER50891

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical

-------
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

944	4951	3772

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	528

gain-loss (gnls) model:	483

power(pow) model:	958

linear-polynomial (polyl) model:	3532

quadratic-polynomial(poly2) model:	1179

exponential-2 (exp2) model:	428

exponential-3 (exp3) model:	75

exponential-4 (exp4) model:	1777

exponential-5 (exp5) model:	707

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.

-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-11.173

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

0

8.571
lnf%

-100
2.316

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 707.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1840

TOX21_RAR_LU C_Antagon ist_via bi lity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 C3RL4 Retinoic Acid Response Element (RARE) Antagonism
Luciferase Assay

1.2	Assay Summary: TOX21_RAR_LUC_Antagonist is a cell-based, single-readout assay that uses C3H10T1/2, a
mouse sarcoma cell line, with measurements taken at 6 hours after chemical dosing in a 1536-well plate. See
tox21-rar-antagonist-p2. TOX21_RAR_LUC_Antagonist_viability is an assay readout measuring cellular ATP
content and detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
TOX21_RAR_LUC_Antagonist_viability used a type of viability reporter where loss-of-signal activity can be used
to understand changes in the cell viability. Furthermore, this assay endpoint can be referred to as a secondary
readout, because this assay has produced multiple assay endpoints where this one serves a viability function.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the cell cycle
intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected C3RL4 cells were seeded at 1,000/well in 4 uL of assay
medium into white solid 1,536-well tissue culture plates (Greiner Bio-One North America Inc. Monroe, NC) using
a Multidrop Combi (Thermo Scientific, Waltham, MA) dispenser. After incubation at 37 C and 5% C02 for an
overnight, 23 nL of compounds dissolved in DMSO, positive control or DMSO only were transferred by a Pintool
station (Kalypsys, San Diego, CA). One ul of 1 uM (final concentration) of Retinol was added right after the
compound addition by using a Flying Reagent Dispenser (FRD) (Aurora Discovery, San Diego, CA). After 6 hr
incubation at 37 C and 5% C02, 5 uLof CellTiter-Glo reagent (Promega Corporation, Madison, Wl) was added to
each assay plate using an FRD. Luminescence was quantified on a ViewLux plate reader (PerkinElmer, Waltham,
MA) after 30 min incubation at room temperature.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

-------
The Tox21 retinoic acid response element (RARE) antagonism luciferase assay screened a library of diverse
environmental compounds to probe for xenobiotic all-trans retinoic acid (atRA) ligand-binding and potential to
suppress RAR mediated gene expression, monitored through luciferase reporter gene signal activation. C3RL4
cell line (OARSA/CFSAN/FDA, Laurel, MD) was used to screen Tox21 10K compound library for identifying
activators of the RSP. The C3RL4 clone contains a functional retinol (vitamin A) signaling pathway (RSP) and the
firefly luciferase gene (Luc) under the control of the RARE. To differentiate true RARE antagonist from cytotoxic
substances, the assay is multiplexed with cell viability assay. The assay is run in triplicate on 1536-well microplate
and bioluminescence was measured following 6-hour incubation of cells with test compounds and 30 min
incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored
using a Promega ViewLux plate reader. Following the incubation period, the cell culture was screened for
bioluminescent signals in antagonist mode using luciferase detection technology. Compound auto-fluorescence
was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background
artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. All-trans Retinoic acid (atRA) is the potent natural metabolite
of retinol and plays an important role in normal embryonic development and maintenance of cellular phenotype
in adult animals. atRA is the activating ligand for the retinoic acid receptors (RARs) which form heterodimers
with retinoid X receptors (RXRs) on the retinoic acid response element (RARE). Through activation of the
RAR/RXRs nuclear receptors, atRA regulates the transcription of a large number of protein-coding genes and
regulatory RNAs. Intracellular levels of atRA are controlled by the retinol signaling pathway (RSP) that regulates
the biosynthesis and catabolism of atRA to maintain physiological levels. Chemicals that interfere with the RSP
can cause abnormal intracellular levels of atRA and therefore are potential developmental toxicants. To assess
compounds for the potential adverse effect on embryonic development through interfering with retinol
signaling, a cell-based RARE luciferase reporter gene assay was used. C3RL4 cell line (OARSA/CFSAN/FDA, Laurel,
MD) was used to screen Tox2110K compound library for identifying inhibitors of the RSP. Also cytotoxicity was
assessed by determining the viability of cells based on the quantitation of ATP.

2.3	Experimental System: adherent C3H10T1/2 cell line used. The C3H10T1/2 [clone8] (C3RL4) cell line was
obtained from ATCC (Catalog #CCL-226, Lot #58613480). The stably transfected C3RL4 line exhibits fibroblast
morphology that was isolated from a line of C3H mouse embryo cells. The C3RL4 clone contains a functional RSP
and the firefly luciferase gene (Luc) under the control of the RARE.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85% confluence. Fetal Bovine serum used
for cell culture and assay purpose is heat inactivated at 56C for 30min. Extra precautions to be taken for making
Retinol as it is photosensitive and moisture absorbent. Thawing method: Thaw a vial of cells in 9ml of pre-
warmed thaw medium and then centrifuge. Resuspend the pellet with the thaw medium and seed at 2 million
cells per T-75 flask. Cell Proliferation Method: Trypsinize cells from the culturing flask and centrifuge and then
resuspend cells in culture medium. Passage cells at 1-1.5 million perT-225 flask. Assay Protocol: Trypsinize cells
from the culturing flask and centrifuge and then resuspend cells in assay medium at a density of 0.25 X 10A6
cells/mL. Dispense 1000 cells/4uL/well into 1536-well tissue treated white/solid bottom plates using a 8 tip
dispenser (Multidrop). Incubate the plates for an overnight (20hr) at 37C and 5% C02. Transfer 23nL of
compounds from the library collection (5.6nM to 92uM) and positive control through pintool. Compound
transfer was followed by the addition of lul of luM (final concentration) Retinol (Retinol made fresh from the
powder) or assay buffer using two different tips of a BioRAPTR. Incubate the plates for 6hr at 37C and 5% C02.

-------
Add 5ul of Amplite(TM) Luciferase reagent using a single tip dispense (BioRAPTR). Incubate the plates at room
temperature for 30min. Measure luminescence (exposure time = 60sec) by ViewLux plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 1.812
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Antagonism of the retinol (vitamin A) signaling pathway (RSP) is measured by bioluminescence
activity via a retinoicacid response element (RARE) firefly luciferase reporter gene. Decreased luciferase activity
can be used to identify the compounds that promote xenobiotic all-trans retinoic acid (atRA) ligand-binding and
RSP antagonism. The cytotoxicity of the compounds screened was tested in parallel by measuring the cell
viability using CellTiter-Fluor assay in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.

-------
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with

-------
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.396

Neutral control median absolute deviation, by plate: nmad	3.498

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	134.04%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 603.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1844

TOX21_APl_BLA_Agonist_chl

1. General Information

1.1	Assay Title: Tox21 ME-180 Activator protein-1 (AP-1) Agonism Beta-lactamase Assay, Channel 1 Readout of
Uncleaved Substrate

1.2	Assay Summary: TOX21_APl_BLA_Agonist is a cell-based, single-readout assay that uses ME-180, a human
cervix cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-apl-
agonist-pl. TOX21_APl_BLA_Agonist_chl is an assay readout measuring reporter gene via transcription factor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the uncleaved reporter gene substrate and is used to calculate the
ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_APl_BLA_Agonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_APl_BLA_Agonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene JUN | FOS |JUN. Furthermore, this assay
endpoint can be referred to as a secondary readout, because this assay has produced multiple assay endpoints
where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. 2500 cells/5uL in Opti-MEM medium containing 0.5% dialyzed FBS, 0.1 mM
NEAA, 1 mM sodium pyruvate and 10 mM HEPES was dispensed into 1536-well black wall/clear bottom plates
using a Multidrop dispenser and cells were cultured at 37Celsius overnight. Next day 23 nL of compounds
dissolved in DMSO, positive controls or DMSO were delivered to each well using a pin tool. The plates were
incubated at 37Celsius for 5 hours. 1 uL of LiveBLAzerTM B/G FRET substrate (Solution A+B+C+D) was added to
each well using a Flying Reagent Dispenser. After incubated at room temperature for 2.5 hours the plates were
measured on an EnVision plate reader at Excitation 405nm, Emissionl=460nm and Emission 2=530nm. The
%Activity was determined from the ratio of 460nm/530nm. Afterthe plates were read for Beta-Lactamase assay
on Envision; 3 ul Cell Titer Glo (Promega) was added to measure the cytotoxicity and the plates were then
incubated at RT in dark for 30 min. The luminescence was read on ViewLux (Perkin Elmer, Shelton, CT) using 6
sec exposure. ViewLux data was expressed as relative luminescence units.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: TOX21_APl_BLA_Agonist_chl was designed to measure uncleaved reporter gene substrate to
target transcription factor activity, specifically mapping to JUN gene(s) using a positive control of EGF

The Tox21 activator protein (AP-1) agonism beta-lactamase assay screened a library of diverse environmental
chemicals to probe for xenobiotic c-Fos and c-Jun binding ligand-binding and potential to induce activator
protein-1 dependent transcription, monitored through bla reporter gene signal activation using a mammalian
one-hybrid GAL4 system. The assay is run in triplicate on 1536-well microplates. ME-180 cells are plated the day
of the assay and following 37 hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap the trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
product to substrate fluorescence. Fluorescence signals are monitored using an Envision plate reader. Each
compound was tested in a concentration-response format. Compound auto-fluorescence was monitored in
various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.
Concentration-response relationships were determined by monitoring luminescent signals relative to DMSO-
only exposures which provided a signal baseline, and to a known AP-1 agonist (EGF) as a positive control, which
provided an indication of 100 percent AP-1 receptor activation.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent ME-180 cell line used. ME-180 is a cell line exhibiting epithelial morphology
that was isolated from the uterus of a 66-year-old, White female patient with epidermoid carcinoma.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Cell Media Required: Growth Medium (90% Alpha MEM, 10% Premium FBS, 400mg/mL
G418), Assay (90% Alpha MEM, 10% Premium FBS), Thaw (90% Alpha MEM, 10% Premium FBS), Freezing (100%
Recovery cell culture freezing medium). Thawing method: Place 14mLof pre-warmed thaw media into T75 flask.
Remove vial of cells to be thawed from -140 and thaw rapidly by placing in water bath with gentle agitation for
l-2min. Wipe vial with 70% ethanol before opening in biological safety cabinet. Transfer vial contents dropwise
into lOmLof thaw medium in 15mL conical tube. Centrifuge cells at 1000 rpm for4min. Resuspend and transfer
contents into T75 flask containing thaw medium and transfer flask into incubator. Switch to growth media at
first passage. Cell Proliferation Method: Aspirate media, rinse once with dPBS, add 0.25% trypsin/EDTA and
swirl to coat flask evenly. Add equal volume of growth medium to inactivate trypsin after 2-3 minutes
incubation. Centrifuge cells at 1000 RPM for 4min and resuspend in growth medium before adding to new flask.
Cells should be passaged or fed at least twice per week. Assay Protocol: Harvest cells from culture in growth
medium and resuspend in assay medium. Dispense 4000 cells per well into 1536-well tissue culture treated
white solid bottom plate using multidrop dispenser. Incubate cells 5hr, then dispense 23nL of compound,
positive control, or DMSO control using pintool. Positive and control compounds are located in the first four
columns according to the plate map, and library compounds located in columns 5-48. Incubate plates for 19hr
at 37C. Add luL of CellTiter fluor to each well using BioRAPTR dispenser. Incubate 30min at 37C. Measure
fluorescence using ViewLux. Add 5uLof Onego using BioRAPTR dispenser. Incubate 30min at room temperature.
Read luminescence on ViewLux.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

EGF

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
Baseline median absolute deviation for the assay (bmad): 2.576

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Activator protein-1 signaling pathway agonism as monitored by FRET emissions resulting from
GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel
by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
EGF was used as a positive AP-1 agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667

Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1035

Inactive hit count: 0
-------
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed

Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)

Positive control signal-to-background: (pmed/nmed)

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed

Negative control well median absolute deviation value, by plate: mmad

Z Prime Factor for median negative and neutral control across all plates:

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

-2.516
7.017
-226.89%

-28.487
4.109

NA

-3.066

NA
NA

NA
NA
NA

NA

NA

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 715.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1845

TOX21_APl_BLA_Agonist_ch2

1. General Information

1.1	Assay Title: Tox21 ME-180 Activator protein-1 (AP-1) Agonism Beta-lactamase Assay, Channel 2 Readout of
Cleaved Substrate

1.2	Assay Summary: TOX21_APl_BLA_Agonist is a cell-based, single-readout assay that uses ME-180, a human
cervix cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-apl-
agonist-pl. TOX21_APl_BLA_Agonist_ch2 is an assay readout measuring reporter gene via receptor activity and
designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene.
The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio of cleaved (ch2)
to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_APl_BLA_Agonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_APl_BLA_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, increased activity can be used to
understand changes in the reporter gene as they relate to the gene JUN|FOS|JUN. Furthermore, this assay
endpoint can be referred to as a secondary readout, because this assay has produced multiple assay endpoints
where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. 2500 cells/5uL in Opti-MEM medium containing 0.5% dialyzed FBS, 0.1 mM
NEAA, 1 mM sodium pyruvate and 10 mM HEPES was dispensed into 1536-well black wall/clear bottom plates
using a Multidrop dispenser and cells were cultured at 37Celsius overnight. Next day 23 nL of compounds
dissolved in DMSO, positive controls or DMSO were delivered to each well using a pin tool. The plates were
incubated at 37Celsius for 5 hours. 1 uL of LiveBLAzerTM B/G FRET substrate (Solution A+B+C+D) was added to
each well using a Flying Reagent Dispenser. After incubated at room temperature for 2.5 hours the plates were
measured on an EnVision plate reader at Excitation 405nm, Emissionl=460nm and Emission 2=530nm. The
%Activity was determined from the ratio of 460nm/530nm. After the plates were read for Beta-Lactamase assay
on Envision; 3 ul Cell Titer Glo (Promega) was added to measure the cytotoxicity and the plates were then
incubated at RT in dark for 30 min. The luminescence was read on ViewLux (Perkin Elmer, Shelton, CT) using 6
sec exposure. ViewLux data was expressed as relative luminescence units.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: TOX21_APl_BLA_Agonist_ch2 was designed to measure cleaved reporter gene substrate to target
transcription factor activity, specifically mapping to JUN gene(s) using a positive control of EGF

The Tox21 activator protein (AP-1) agonism beta-lactamase assay screened a library of diverse environmental
chemicals to probe for xenobiotic c-Fos and c-Jun binding ligand-binding and potential to induce activator
protein-1 dependent transcription, monitored through bla reporter gene signal activation using a mammalian
one-hybrid GAL4 system. The assay is run in triplicate on 1536-well microplates. ME-180 cells are plated the day
of the assay and following 37 hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap the trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
product to substrate fluorescence. Fluorescence signals are monitored using an Envision plate reader. Each
compound was tested in a concentration-response format. Compound auto-fluorescence was monitored in
various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.
Concentration-response relationships were determined by monitoring luminescent signals relative to DMSO-
only exposures which provided a signal baseline, and to a known AP-1 agonist (EGF) as a positive control, which
provided an indication of 100 percent AP-1 receptor activation.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent ME-180 cell line used. ME-180 is a cell line exhibiting epithelial morphology
that was isolated from the uterus of a 66-year-old, White female patient with epidermoid carcinoma.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Cell Media Required: Growth Medium (90% Alpha MEM, 10% Premium FBS, 400mg/mL
G418), Assay (90% Alpha MEM, 10% Premium FBS), Thaw (90% Alpha MEM, 10% Premium FBS), Freezing (100%
Recovery cell culture freezing medium). Thawing method: Place 14mLof pre-warmed thaw media into T75 flask.
Remove vial of cells to be thawed from -140 and thaw rapidly by placing in water bath with gentle agitation for
l-2min. Wipe vial with 70% ethanol before opening in biological safety cabinet. Transfer vial contents dropwise
into lOmLof thaw medium in 15mL conical tube. Centrifuge cells at 1000 rpm for4min. Resuspend and transfer
contents into T75 flask containing thaw medium and transfer flask into incubator. Switch to growth media at
first passage. Cell Proliferation Method: Aspirate media, rinse once with dPBS, add 0.25% trypsin/EDTA and
swirl to coat flask evenly. Add equal volume of growth medium to inactivate trypsin after 2-3 minutes
incubation. Centrifuge cells at 1000 RPM for 4min and resuspend in growth medium before adding to new flask.
Cells should be passaged or fed at least twice per week. Assay Protocol: Harvest cells from culture in growth
medium and resuspend in assay medium. Dispense 4000 cells per well into 1536-well tissue culture treated
white solid bottom plate using multidrop dispenser. Incubate cells 5hr, then dispense 23nL of compound,
positive control, or DMSO control using pintool. Positive and control compounds are located in the first four
columns according to the plate map, and library compounds located in columns 5-48. Incubate plates for 19hr
at 37C. Add luL of CellTiter fluor to each well using BioRAPTR dispenser. Incubate 30min at 37C. Measure
fluorescence using ViewLux. Add 5uLof Onego using BioRAPTR dispenser. Incubate 30min at room temperature.
Read luminescence on ViewLux.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

EGF

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

-------
Baseline median absolute deviation for the assay (bmad): 3.551
Response cutoff threshold used to determine hit calls: 21.308
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Activator protein-1 signaling pathway agonism as monitored by FRET emissions resulting from
GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel
by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
EGF was used as a positive AP-1 agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667

Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
691

Inactive hit count: 0
-------
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	8.252

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	116.22%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	98.424

Positive control well median absolute deviation, by plate: pmad	6.212

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	9.326

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 730.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1846

TOX21_APl_BLA_Agonist_ratio

1. General Information

1.1	Assay Title: Tox21 ME-180 Activator protein-1 (AP-1) Agonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21_APl_BLA_Agonist is a cell-based, single-readout assay that uses ME-180, a human
cervix cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-apl-
agonist-pl. TOX21_APl_BLA_Agonist_ratio is an assay readout measuring reporter gene via transcription factor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate
used as the measure of target activity. Data from the assay component TOX21_APl_BLA_Agonist_ratio was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_APl_BLA_Agonist_ratio, was analyzed in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of inducible reporter, gain-of-signal activity can be used to understand changes in the reporter gene as they
relate to the gene JUN|FOS|JUN. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints and this ratio serves a reporter gene function to
understand target activity. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the dna binding intended target family, where the subfamily is basic leucine zipper.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. 2500 cells/5uL in Opti-MEM medium containing 0.5% dialyzed FBS, 0.1 mM
NEAA, 1 mM sodium pyruvate and 10 mM HEPES was dispensed into 1536-well black wall/clear bottom plates
using a Multidrop dispenser and cells were cultured at 37Celsius overnight. Next day 23 nL of compounds
dissolved in DMSO, positive controls or DMSO were delivered to each well using a pin tool. The plates were
incubated at 37Celsius for 5 hours. 1 uL of LiveBLAzerTM B/G FRET substrate (Solution A+B+C+D) was added to
each well using a Flying Reagent Dispenser. After incubated at room temperature for 2.5 hours the plates were
measured on an EnVision plate reader at Excitation 405nm, Emissionl=460nm and Emission 2=530nm. The
%Activity was determined from the ratio of 460nm/530nm. After the plates were read for Beta-Lactamase assay
on Envision; 3 ul Cell Titer Glo (Promega) was added to measure the cytotoxicity and the plates were then
incubated at RT in dark for 30 min. The luminescence was read on ViewLux (Perkin Elmer, Shelton, CT) using 6
sec exposure. ViewLux data was expressed as relative luminescence units.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

-------
2. Test Method Description

2.1	Purpose: TOX21_APl_BLA_Agonist_ratio was designed to target transcription factor activity, specifically
mapping to JUN gene(s) using a positive control of EGF

The Tox21 activator protein (AP-1) agonism beta-lactamase assay screened a library of diverse environmental
chemicals to probe for xenobiotic c-Fos and c-Jun binding ligand-binding and potential to induce activator
protein-1 dependent transcription, monitored through bla reporter gene signal activation using a mammalian
one-hybrid GAL4 system. The assay is run in triplicate on 1536-well microplates. ME-180 cells are plated the day
of the assay and following 37 hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap the trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
product to substrate fluorescence. Fluorescence signals are monitored using an Envision plate reader. Each
compound was tested in a concentration-response format. Compound auto-fluorescence was monitored in
various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.
Concentration-response relationships were determined by monitoring luminescent signals relative to DMSO-
only exposures which provided a signal baseline, and to a known AP-1 agonist (EGF) as a positive control, which
provided an indication of 100 percent AP-1 receptor activation.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Activator protein-1 (AP-1), a transcription factor, plays an important
role in tumor genesis by regulating genes involved in cell proliferation, differentiation, apoptosis, and
angiogenesis. AP-1 activity is induced by a complex network of signaling pathways that involves extracellular
signals, such as growth factors. The AP-1 protein complex formed from c-Fos and c-Jun binding to the AP-1
response element results in transcriptional activation of genes containing such elements within their promoter.
Low molecular weight molecules have been discovered and developed to modulate various components of
these signaling pathways, for example, MAP kinase inhibitors. However, many points of pathway modulation
remain untested due to the lack of chemical probes making this signaling locus of general interest.

2.3	Experimental System: adherent ME-180 cell line used. ME-180 is a cell line exhibiting epithelial morphology
that was isolated from the uterus of a 66-year-old, White female patient with epidermoid carcinoma.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Cell Media Required: Growth Medium (90% Alpha MEM, 10% Premium FBS, 400mg/mL
G418), Assay (90% Alpha MEM, 10% Premium FBS), Thaw (90% Alpha MEM, 10% Premium FBS), Freezing (100%
Recovery cell culture freezing medium). Thawing method: Place 14mLof pre-warmed thaw media into T75 flask.
Remove vial of cells to be thawed from -140 and thaw rapidly by placing in water bath with gentle agitation for
l-2min. Wipe vial with 70% ethanol before opening in biological safety cabinet. Transfer vial contents dropwise
into lOmLof thaw medium in 15mL conical tube. Centrifuge cells at 1000 rpm for 4min. Resuspend and transfer
contents into T75 flask containing thaw medium and transfer flask into incubator. Switch to growth media at
first passage. Cell Proliferation Method: Aspirate media, rinse once with dPBS, add 0.25% trypsin/EDTA and
swirl to coat flask evenly. Add equal volume of growth medium to inactivate trypsin after 2-3 minutes
incubation. Centrifuge cells at 1000 RPM for 4min and resuspend in growth medium before adding to new flask.
Cells should be passaged or fed at least twice per week. Assay Protocol: Harvest cells from culture in growth
medium and resuspend in assay medium. Dispense 4000 cells per well into 1536-well tissue culture treated

-------
white solid bottom plate using multidrop dispenser. Incubate cells 5hr, then dispense 23nL of compound,
positive control, or DMSO control using pintool. Positive and control compounds are located in the first four
columns according to the plate map, and library compounds located in columns 5-48. Incubate plates for 19hr
at 37C. Add luL of CellTiter fluor to each well using BioRAPTR dispenser. Incubate 30min at 37C. Measure
fluorescence using ViewLux. Add 5uLof Onego using BioRAPTR dispenser. Incubate 30min at room temperature.
Read luminescence on ViewLux.

Baseline median absolute deviation for the assay (bmad): 2.737

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Activator protein-1 signaling pathway agonism as monitored by FRET emissions resulting from
GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel
by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of dna binding.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

EGF

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

NA

-------
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
EGF was used as a positive AP-1 agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root

-------
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.765

Neutral control median absolute deviation, by plate: nmad	6.397

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	187.8%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	104.332

Positive control well median absolute deviation, by plate: pmad	7.52

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	10.179

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

-------
(1 - ((3 * (mmad + nmad)) / abs(mmed - rimed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 695.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

-------
researcli/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 1847

TOX21_APl_BLA_Agonist_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 ME-180 Activator protein-1 (AP-1) Agonism Beta-lactamase Assay

1.2	Assay Summary: TOX21_APl_BLA_Agonist is a cell-based, single-readout assay that uses ME-180, a human
cervix cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-apl-
agonist-pl. TOX21_APl_BLA_Agonist_viability is an assay readout measuring cellular ATP content and detected
with CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_APl_BLA_Agonist_viability used a type of
viability reporter where loss-of-signal activity can be used to understand changes in the cell viability.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves a viability function. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle target family, where the subfamily is
cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. 2500 cells/5uL in Opti-MEM medium containing 0.5% dialyzed FBS, 0.1 mM
NEAA, 1 mM sodium pyruvate and 10 mM HEPES was dispensed into 1536-well black wall/clear bottom plates
using a Multidrop dispenser and cells were cultured at 37Celsius overnight. Next day 23 nL of compounds
dissolved in DMSO, positive controls or DMSO were delivered to each well using a pin tool. The plates were
incubated at 37Celsius for 5 hours. 1 uL of LiveBLAzerTM B/G FRET substrate (Solution A+B+C+D) was added to
each well using a Flying Reagent Dispenser. After incubated at room temperature for 2.5 hours the plates were
measured on an EnVision plate reader at Excitation 405nm, Emissionl=460nm and Emission 2=530nm. The
%Activity was determined from the ratio of 460nm/530nm. After the plates were read for Beta-Lactamase assay
on Envision; 3 ul Cell Titer Glo (Promega) was added to measure the cytotoxicity and the plates were then
incubated at RT in dark for 30 min. The luminescence was read on ViewLux (Perkin Elmer, Shelton, CT) using 6
sec exposure. ViewLux data was expressed as relative luminescence units.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

-------
The Tox21 activator protein (AP-1) agonism beta-lactamase assay screened a library of diverse environmental
chemicals to probe for xenobiotic c-Fos and c-Jun binding ligand-binding and potential to induce activator
protein-1 dependent transcription, monitored through bla reporter gene signal activation using a mammalian
one-hybrid GAL4 system. The assay is run in triplicate on 1536-well microplates. ME-180 cells are plated the day
of the assay and following 37 hour incubation of cells with test compounds a membrane-permeable FRET-based
substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap the trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
product to substrate fluorescence. Fluorescence signals are monitored using an Envision plate reader. Each
compound was tested in a concentration-response format. Compound auto-fluorescence was monitored in
various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.
Concentration-response relationships were determined by monitoring luminescent signals relative to DMSO-
only exposures which provided a signal baseline, and to a known AP-1 agonist (EGF) as a positive control, which
provided an indication of 100 percent AP-1 receptor activation.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Activator protein-1 (AP-1), a transcription factor, plays an important
role in tumor genesis by regulating genes involved in cell proliferation, differentiation, apoptosis, and
angiogenesis. AP-1 activity is induced by a complex network of signaling pathways that involves extracellular
signals, such as growth factors. The AP-1 protein complex formed from c-Fos and c-Jun binding to the AP-1
response element results in transcriptional activation of genes containing such elements within their promoter.
Low molecular weight molecules have been discovered and developed to modulate various components of
these signaling pathways, for example, MAP kinase inhibitors. However, many points of pathway modulation
remain untested due to the lack of chemical probes making this signaling locus of general interest.

2.3	Experimental System: adherent ME-180 cell line used. ME-180 is a cell line exhibiting epithelial morphology
that was isolated from the uterus of a 66-year-old, White female patient with epidermoid carcinoma.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Cell Media Required: Growth Medium (90% Alpha MEM, 10% Premium FBS, 400mg/mL
G418), Assay (90% Alpha MEM, 10% Premium FBS), Thaw (90% Alpha MEM, 10% Premium FBS), Freezing (100%
Recovery cell culture freezing medium). Thawing method: Place 14mLof pre-warmed thaw media intoT75 flask.
Remove vial of cells to be thawed from -140 and thaw rapidly by placing in water bath with gentle agitation for
l-2min. Wipe vial with 70% ethanol before opening in biological safety cabinet. Transfer vial contents dropwise
into lOmLof thaw medium in 15mL conical tube. Centrifuge cells at 1000 rpm for4min. Resuspend and transfer
contents into T75 flask containing thaw medium and transfer flask into incubator. Switch to growth media at
first passage. Cell Proliferation Method: Aspirate media, rinse once with dPBS, add 0.25% trypsin/EDTA and
swirl to coat flask evenly. Add equal volume of growth medium to inactivate trypsin after 2-3 minutes
incubation. Centrifuge cells at 1000 RPM for 4min and resuspend in growth medium before adding to new flask.
Cells should be passaged or fed at least twice per week. Assay Protocol: Harvest cells from culture in growth
medium and resuspend in assay medium. Dispense 4000 cells per well into 1536-well tissue culture treated
white solid bottom plate using multidrop dispenser. Incubate cells 5hr, then dispense 23nL of compound,
positive control, or DMSO control using pintool. Positive and control compounds are located in the first four
columns according to the plate map, and library compounds located in columns 5-48. Incubate plates for 19hr
at 37C. Add luL of CellTiter fluor to each well using BioRAPTR dispenser. Incubate 30min at 37C. Measure

-------
fluorescence using ViewLux. Add 5uLof Onego using BioRAPTR dispenser. Incubate 30min at room temperature.
Read luminescence on ViewLux.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.00979371647509579 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

765.134099616858 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 3.76
Response cutoff threshold used to determine hit calls: 22.561

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Activator protein-1 signaling pathway agonism as monitored by FRET emissions resulting from
GAL4/-beta-lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel
by measuring the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where

-------
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	-3.895

Neutral control median absolute deviation, by plate: nmad	6.115

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	-111.31%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	4.741

Positive control well median absolute deviation, by plate: pmad	7.849

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.882

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 517.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2053

TOX21_ERa_LUC_VM7_Antagonist_0.1nM_E2

1.	General Information

1.1	Assay Title: Tox21 VM7 Estrogen Receptor-alpha (ESR1) Antagonism (O.lnM E2) Luciferase Assay

1.2	Assay Summary: TOX21 ERa LUC VM7 Antagonist O.lnM E2 is a cell-based, single-readout assay that uses
VM7, a human breast tissue cell line, with measurements taken at 48 hours after chemical dosing in a 1536-well
plate. See tox21-er-luc-bgl-4e2-antagonist-p2. TOX21_ERa_LUC_VM7_Antagonist_0.1nM_E2 is one of one
assay component(s) measured or calculated from the TOX21_ERa_LUC_VM7_Antagonist_0.1nM_E2 assay. It is
designed to make measurements of luciferase induction, a form of inducible reporter, as detected with
bioluminescence signals by CellTiter-Glo Luciferase-coupled ATP quantitation technology. Data from the assay
component TOX21_ERa_LUC_VM7_Antagonist_0.1nM_E2 was analyzed into 1 assay endpoint. This assay
endpoint, TOX21_ERa_LUC_VM7_Antagonist_0.1nM_E2, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, loss-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene ESR1.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected VM7 cells are aliquoted into 1536-well microtiter plates
and incubated with test compounds for 24 hours prior to monitoring luminescence resulting from ER gene
expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction involving the key
substrate [One-Glo] are indicative of changes in transcriptional gene expression due to antagonist activity
regulated by the human estrogen receptor 1 [GeneSymbokESRl | GenelD:2099 |
Uniprot_SwissProt_Accession:P03372],

The Tox21 VM7 estrogen receptor alpha antagonism luciferase assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to inhibit estrogen-

-------
dependent transcription, as monitored through luciferase reporter gene signal activity using an endogenous
full-length ERalpha cell line of human breast tissue (VM7) that was stably transfected with an estrogen-
responsive luciferase reporter gene plasmid (pGudLuc7ere). The assay is run in triplicate on 1536-well
microplate and bioluminescence was measured following 24 hour incubation of cells with test compounds and
30 min incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was
monitored using a Promega ViewLux plate reader to measure antagonistic activity, this assay is performed with
small amounts of an ER alpha agonist (beta-estradiol, E2) added to each well and each compound is evaluated
against a known ER alpha antagonist (4-Hydroxytamoxifen) as a positive control (100 percent inhibition). Test
compounds were assayed for cytotoxicity by measuring protease activity with Promega CellTiter-Fluor with
tetraoctylammonium bromide as a positive control for cell death. Compound auto-fluorescence was monitored
in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER agonists, VM7-Luc-4E2 cell line (provided by Dr. Michael Denison from University of
California) has been used to screen the Tox21 library of diverse environmental compounds. VM7-Luc-4E2 cell
line endogenously expresses full-length ER-alpha and is stably transfected with a plasmid containing four
estrogen responsive elements (ERE) upstream of a luciferase reporter gene. The ERalpha_LUC_VM7 assays are
qHTS format assays which measure the ability of a chemical to interact with estrogen receptor alpha (ERalpha)
by monitoring modulation of fluorescence reporter gene signals. This assay utilized a human breast (VM7-Luc-
4E2) cell line which expresses endogenous full-length ERalpha and a luciferase reporter gene (ER-luc) to quantify
xenoestrogenic activity. This assay is intended for use as a part of an integrated testing strategy, to screen a
large structurally diverse chemical library for compounds with the potential to interact with estrogen receptor
alpha mediated pathways and potentially affect endocrine systems in exposed populations. There is strong
evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome Pathway
(AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen
receptor activation is the MIE for putative AOPs leading to reduced survival due to renal failure and leading to
skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under development).
Chemical-activity profiles derived from this assay can inform prioritization decisions for compound selection in
more resource intensive in vivo studies to further investigate the involvement of ER agonism in pathways leading
to hazardous outcomes in biological systems.

2.3	Experimental System: adherent VM7 cell line used. Michigan Cancer Foundation-7 (MCF-7) cells are a breast
cancer cell line originating from a 69-year old woman in 1970 (Lee et al. 2015). This is an immortalized cell line
which endogenously expresses receptors for estrogen (ESR1 and ESR2) and progesterone (Geisinger et al. 1989)
as well as growth factors EGF and IGF-1 (Baldwin et al. 1998), and so provides an alternative to breast cell lines
for estrogen-sensitive proliferation assays. vM7-Luc-4E2 cells are vMCF7 cells which are stably transfected with
plasmid containing four estrogen responsive elements upstream of a luciferase reporter gene (Rogers and
Denison 2000) to measure transactivation activity occurring along estrogen signaling pathways. This was
previously the BGl-Luc estrogen receptor transactivation test method, but BGlLuc4E2 cells are being renamed
VM7Luc4E2 cells because recent DNA analysis (STR) revealed that the original cell line used to generate the
BGlLuc4E2 cells were not human ovarian carcinoma (BG-1) cells but a variant of human breast cancer (MCF7)
cells.

-------
2.4 Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5 Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. For
assay purpose, the cells should be cultured in assay medium with 10% charcoal stripped FBS for 5 days with
alternate day medium changed to fresh medium. Especially while in assay culture, the cells should not reach
more than 85% confluence as they would become harder to detach if they reach over confluence. Cell Thawing
Method: Thaw a frozen vial of cells in 9ml of pre-warmed medium and seed them in T175 flask at 2 million cells.
Cell Proliferation Method: Trypsinize cells from the flask and centrifuge. Add culture medium to the pellet and
passage at 3-4 million per T-225 flask. Assay Protocol: Harvest cells from the 5-day culture in assay medium and
resuspend cells in assay medium. Dispense 4000 cells/4uL/well into 1536-well tissue treated white/solid bottom
plates using an 8 tip dispenser (Multidrop). Incubate the assay plates for 22hrs at 37C and 5% C02. First luL of
lO.OnM (final concentration) ICI-182,780 (ER-Antagonist) or assay buffer was added using two separate tips of
a dispenser (BioRAPTR). Then transfer 23nL of compounds from the library collection and positive control to the
assay plates by using a Pintool station. Incubate the assay plates for 22hrs at 37C and 5% C02. After 21hrs of
incubation, lul of CellTiter-Fluor(TM) Cell Viability Assay reagent was added using a single tip of a dispenser
(BioRAPTR). Incubate the assay plates at 37C and 5% C02 for lhr. Measure fluorescence signal by ViewLux plate
reader (Exposure time = lsec). Then add 4ul of ONE-Glo(TM) Luciferase reagent using a single tip of a dispenser
(BioRAPTR). Incubate the plates at room temperature for 30min.

Baseline median absolute deviation for the assay (bmad): 3.927
Response cutoff threshold used to determine hit calls: 23.559

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Antagonism of the estrogen receptor (ERa) signaling pathway is measured by bioluminescence
activity via an estrogen-responsive firefly luciferase reporter gene. Decreased luciferase activity can be used to
identify the compounds that inhibit xenoestrogenic ligand-binding and ERalpha antagonism. The cytotoxicity of
the compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in the
same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

4-hydroxytamoxifen

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

-------
3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Decreased luminescence (loss-of-signal) was measured relative to 4-hydroxytamoxifen (positive
control) signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a
percent of positive control activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:

-------
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	6.815

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100

Positive control well median absolute deviation, by plate: pmad	1.344

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-14.297

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

-------
Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 774.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.

Bibliography: Judson RS, Magpantay FM, Chickarmane V, Haskell C, Tania N, Taylor J, Xia M, Huang R, Rotroff
DM, Filer DL, Houck KA, Martin MT, Sipes N, Richard AM, Mansouri K, Setzer RW, Knudsen TB, Crofton KM,

-------
Thomas RS. Integrated Model of Chemical Perturbations of a Biological Pathway Using 18 In Vitro High-
Throughput Screening Assays for the Estrogen Receptor. Toxicol Sci. 2015 Nov;148(l):137-54. doi:
10.1093/toxsci/kfvl68. Epub 2015 Aug 13. PMID: 26272952; PMCID: PMC4635633., Huang R, Sakamuru S,
Martin MT, Reif DM, Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W,
Rotroff DM, Zhao T, Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox21
10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep.
2014 Jul 11;4:5664. doi: 10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2054

TOX21_ERa_LUC_VM7_Antagonist_0.1nM_E2_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 VM7 Estrogen Receptor-alpha (ESR1) Antagonism (O.lnM E2)
Luciferase Assay

1.2	Assay Summary: TOX21 ERa LUC VM7 Antagonist O.lnM E2 is a cell-based, single-readout assay that uses
VM7, a human breast tissue cell line, with measurements taken at 48 hours after chemical dosing in a 1536-well
plate. See tox21-er-luc-bgl-4e2-antagonist-p2. TOX21_ERa_LUC_VM7_Antagonist_0.1nM_E2_viability is an
assay readout measuring cellular ATP content and detected with CellTiter-Glo Luciferase-coupled ATP
quantitation. TOX21_ERa_LUC_VM7_Antagonist_0.1nM_E2_viability used a type of viability reporter where
loss-of-signal activity can be used to understand changes in the cell viability. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves a viability function. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected VM7 cells are aliquoted into 1536-well microtiter plates
and incubated with test compounds for 24 hours prior to monitoring luminescence resulting from ER gene
expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 VM7 estrogen receptor alpha antagonism luciferase assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to inhibit estrogen-
dependent transcription, as monitored through luciferase reporter gene signal activity using an endogenous
full-length ERalpha cell line of human breast tissue (VM7) that was stably transfected with an estrogen-
responsive luciferase reporter gene plasmid (pGudLuc7ere). The assay is run in triplicate on 1536-well
microplate and bioluminescence was measured following 24 hour incubation of cells with test compounds and

-------
30 min incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was
monitored using a Promega ViewLux plate reader To measure antagonistic activity, this assay is performed with
small amounts of an ER alpha agonist (beta-estradiol, E2) added to each well and each compound is evaluated
against a known ER alpha antagonist (4-Hydroxytamoxifen) as a positive control (100 percent inhibition). Test
compounds were assayed for cytotoxicity by measuring protease activity with Promega CellTiter-Fluor with
tetraoctylammonium bromide as a positive control for cell death. Compound auto-fluorescence was monitored
in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER agonists, VM7-Luc-4E2 cell line (provided by Dr. Michael Denison from University of
California) has been used to screen the Tox21 library of diverse environmental compounds. VM7-Luc-4E2 cell
line endogenously expresses full-length ER-alpha and is stably transfected with a plasmid containing four
estrogen responsive elements (ERE) upstream of a luciferase reporter gene. The ERalpha_LUC_VM7 assays are
qHTS format assays which measure the ability of a chemical to interact with estrogen receptor alpha (ERalpha)
by monitoring modulation of fluorescence reporter gene signals. This assay utilized a human breast (VM7-Luc-
4E2) cell line which expresses endogenous full-length ERalpha and a luciferase reporter gene (ER-luc) to quantify
xenoestrogenic activity. This assay is intended for use as a part of an integrated testing strategy, to screen a
large structurally diverse chemical library for compounds with the potential to interact with estrogen receptor
alpha mediated pathways and potentially affect endocrine systems in exposed populations. There is strong
evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome Pathway
(AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen
receptor activation is the MIE for putative AOPs leading to reduced survival due to renal failure and leading to
skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under development).
Chemical-activity profiles derived from this assay can inform prioritization decisions for compound selection in
more resource intensive in vivo studies to further investigate the involvement of ER agonism in pathways leading
to hazardous outcomes in biological systems.

2.3	Experimental System: adherent VM7 cell line used. Michigan Cancer Foundation-7 (MCF-7) cells are a breast
cancer cell line originating from a 69-year old woman in 1970 (Lee et al. 2015). This is an immortalized cell line
which endogenously expresses receptors for estrogen (ESR1 and ESR2) and progesterone (Geisinger et al. 1989)
as well as growth factors EGF and IGF-1 (Baldwin et al. 1998), and so provides an alternative to breast cell lines
for estrogen-sensitive proliferation assays. vM7-Luc-4E2 cells are vMCF7 cells which are stably transfected with
plasmid containing four estrogen responsive elements upstream of a luciferase reporter gene (Rogers and
Denison 2000) to measure transactivation activity occurring along estrogen signaling pathways. This was
previously the BGl-Luc estrogen receptor transactivation test method, but BGlLuc4E2 cells are being renamed
VM7Luc4E2 cells because recent DNA analysis (STR) revealed that the original cell line used to generate the
BGlLuc4E2 cells were not human ovarian carcinoma (BG-1) cells but a variant of human breast cancer (MCF7)
cells.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. For
assay purpose, the cells should be cultured in assay medium with 10% charcoal stripped FBS for 5 days with

-------
alternate day medium changed to fresh medium. Especially while in assay culture, the cells should not reach
more than 85% confluence as they would become harder to detach if they reach over confluence. Cell Thawing
Method: Thaw a frozen vial of cells in 9ml of pre-warmed medium and seed them in T175 flask at 2 million cells.
Cell Proliferation Method: Trypsinize cells from the flask and centrifuge. Add culture medium to the pellet and
passage at 3-4 million per T-225 flask. Assay Protocol: Harvest cells from the 5-day culture in assay medium and
resuspend cells in assay medium. Dispense 4000 cells/4uL/well into 1536-well tissue treated white/solid bottom
plates using an 8 tip dispenser (Multidrop). Incubate the assay plates for 22hrs at 37C and 5% C02. First luL of
lO.OnM (final concentration) ICI-182,780 (ER-Antagonist) or assay buffer was added using two separate tips of
a dispenser (BioRAPTR). Then transfer 23nL of compounds from the library collection and positive control to the
assay plates by using a Pintool station. Incubate the assay plates for 22hrs at 37C and 5% C02. After 21hrs of
incubation, lul of CellTiter-Fluor(TM) Cell Viability Assay reagent was added using a single tip of a dispenser
(BioRAPTR). Incubate the assay plates at 37C and 5% C02 for lhr. Measure fluorescence signal by ViewLux plate
reader (Exposure time = lsec). Then add 4ul of ONE-Glo(TM) Luciferase reagent using a single tip of a dispenser
(BioRAPTR). Incubate the plates at room temperature for 30min.

Baseline median absolute deviation for the assay (bmad): 2.513
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Antagonism of the estrogen receptor (ERa) signaling pathway is measured by bioluminescence
activity via an estrogen-responsive firefly luciferase reporter gene. Decreased luciferase activity can be used to
identify the compounds that inhibit xenoestrogenic ligand-binding and ERalpha antagonism. The cytotoxicity of
the compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in the
same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root

-------
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	5.963

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

-------
Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 593.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.

Bibliography: Judson RS, Magpantay FM, Chickarmane V, Haskell C, Tania N, Taylor J, Xia M, Huang R, Rotroff
DM, Filer DL, Houck KA, Martin MT, Sipes N, Richard AM, Mansouri K, Setzer RW, Knudsen TB, Crofton KM,
Thomas RS. Integrated Model of Chemical Perturbations of a Biological Pathway Using 18 In Vitro High-

-------
Throughput Screening Assays for the Estrogen Receptor. Toxicol Sci. 2015 Nov;148(l):137-54. doi:
10.1093/toxsci/kfvl68. Epub 2015 Aug 13. PMID: 26272952; PMCID: PMC4635633., Huang R, Sakamuru S,
Martin MT, Reif DM, Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W,
Rotroff DM, Zhao T, Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox21
10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep.
2014 Jul 11;4:5664. doi: 10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7. Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2055

TOX21_ERR_LUC_Agonist

1.	General Information

1.1	Assay Title: Tox21 ERR-HEK293T Estrogen Related Receptor (ERR) Agonism Luciferase Assay

1.2	Assay Summary: TOX21 ERR LUC is a cell-based, single-readout assay that uses ERR-HEK293T, a human kidney
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-err-pl.
TOX21_ERR_LUC_Agonist is one of one assay component(s) measured or calculated from the
TOX21_ERR_LUC_Agonist assay. It is designed to make measurements of luciferase induction, a form of
inducible reporter, as detected with bioluminescence signals by CellTiter-Glo Luciferase-coupled ATP
quantitation technology. Data from the assay component TOX21_ERR_LUC_Agonist was analyzed into 1 assay
endpoint. This assay endpoint, TOX21_ERR_LUC_Agonist, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene ESRRA.
Furthermore, this assay endpoint can be referred to as a primary readout, because the performed assay has
only produced 1 assay endpoint. To generalize the intended target to other relatable targets, this assay endpoint
is annotated to the nuclear receptor intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected ERR-HEK293T cells are aliquoted into 1536-well
microtiter plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission
resulting from xenobiotic ERR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERR_LUC_Agonist was designed to measure changes to bioluminescence signals produced
from an enzymatic reaction involving the key substrate [One-Glo], Changes are indicative of changes in
transcriptional gene expression due to agonist activity regulated by the human estrogen-related receptor alpha
[GeneSymbol:ESRRA | GenelD:2101|.

The Tox21 estrogen related receptor HEK293T agonism luciferase assay serves to screen and to identify
environmental compounds that perturb the ERR signaling pathways could provide valuable information for
potential therapeutic or preventive measures in the treatment of metabolic disease. To differentiate true ERR

-------
antagonists or agonist from cytotoxic substances, the assay is multiplexed with cell viability assay. The Tox21
estrogen related receptor agonism assay screened a library of diverse environmental compounds to probe for
xenoestrogenic ligand-binding and potential to suppress estrogen-dependent transcription, monitored through
changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase between the
key substrate [One-Glo], Changes are indicative of changes in transcriptional gene expression due to agonist
activity regulated by the human estrogen-related receptor alpha. Each well contained 0.5nM beta-estradiol as
an ER stimulator and measured the gain-of-signal as compared to positive control of Genistein (100 percent
inhibition). The assay is run in triplicate on 1536-well microplates. After the assay plates were incubated at a
37C/5% C02 incubator for 6 hours, 23 nL of compounds dissolved in DMSO, positive and negative controls or
DMSO only was transferred to the assay plate by a pin tool. The plates were incubated at 37C for 18 hours. 4
ul/well of One-Glo reagent was added into the assay plates using a Flying Reagent Dispenser. After 30-minute
luminescence signals are monitored using an ViewLux plate reader and CellTiter-Glo assay reagent (Promega) is
also incubated with test system for 30 minutes before readout to detect cell viability. Compound auto-
fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for
background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Estrogen related receptors (ERRs), the first orphan nuclear
receptors discovered, play an important role in the control of cellular energy metabolism. Estrogen-related
receptor alpha (ERR-alpha) is involved in maintaining energy homeostasis in response to environmental cues. It
does so by regulating a broad range of genes important in metabolism, such as those that encode enzymes that
function in the glycolytic pathway, the tricarboxylic cycle, oxidative phosphorylation, lipid metabolism,
mitochondrial functions, and biogenesis. Dysregulation can lead to metabolic syndrome, obesity, and diabetes.
ERRs are required for high-energy production in response to the environmental challenges. ERR-alpha was also
identified as an adverse marker for breast cancer progression; ERR-alpha-positive tumors have a poor prognosis.

2.3	Experimental System: adherent ERR-HEK293T cell line used. The ERR-HEK293 line is a stably transfected cell line
containing an intact ERR-alpha signaling pathway. ERR-alpha multiple hormone response element (MHRE)
reporters were introduced into HEK293 cells that express endogenous ERR-alpha. The HEK-293 cell line is a
human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5 DNAby Frank
Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5 kilobases from the
viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic characterization
established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of
growth and transfection cells and are frequently used to produce exogenous proteins or viruses for
pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 80-85% confluence in culture medium. The
cells are not to be left in Trypsin for more than 5 min at room temperature. Cell Thawing Method: Place 14 mL
of pre-warmed thaw medium into a 15 mL conical tube. Remove the vial of cells to be thawed from liquid
nitrogen and thaw rapidly by placing at 37C in a water bath. Mix the entire content of the vial to 14 ml of pre-
warmed medium and centrifuge to remove DMSO. Discard the supernatant and reconstitute the pellet using
pre-warmed thaw medium. Transfer the cells to T75 flask Cell Proliferation Method: Detach the cells from the
flask using Trypsin-EDTA (0.05%). The cells in growth medium are re-seeded in T225 flask. Assay Protocol:
Harvest cells and re-suspend in assay medium and adjust the required cell density. Dispense 2500 cells/5 uL/well
into 1536 well tissue treated white plates using a Multidrop dispenser. Incubate the plates at a 37C, 5% C02

-------
incubator for 6 h. Transfer 23nL of compounds and positive control to the assay plate by a pin tool. Incubate the
assay plates at 37C, 5% C02 for 17.5 h. Add 1 uL of CellTiter-Fluor regent using a BioRAPTR. Incubate at 37C, 5%
C02 for 17.5 h. Read the fluorescence intensity in ViewLux plate reader. Add 4uL of OneGlo and incubate the
plates at room temperature for 0.5 h. Read the luminescence intensity in ViewLux plate reader.

Baseline median absolute deviation for the assay (bmad): 2.205
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Agonism of the estrogen-related receptor (ERR) signaling pathway is measured by bioluminescence
activity via an estrogen-related-receptor firefly luciferase reporter gene. Decreased luciferase activity can be
used to identify the compounds that inhibit xenoestrogenic ligand-binding and ERR agonism. The cytotoxicity of
the compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in the
same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Increased luminescence (gain-of-signal) was measured relative to Genistein (positive control)
signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a percent
of positive control activity.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

Genistein

I arget (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest

-------
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.139

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

0

2.698
lnf%

15.543
113.713

NA

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 782.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2057

TOX21_ERR_LUC_Antagonist

1.	General Information

1.1	Assay Title: Tox21 ERR-HEK293T Estrogen Related Receptor (ERR) Antagonism Luciferase Assay

1.2	Assay Summary: TOX21 ERR LUC is a cell-based, single-readout assay that uses ERR-HEK293T, a human kidney
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-err-pl.
TOX21_ERR_LUC_Antagonist is one of one assay component(s) measured or calculated from the
TOX21_ERR_LUC_Agonist assay. It is designed to make measurements of luciferase induction, a form of
inducible reporter, as detected with bioluminescence signals by CellTiter-Glo Luciferase-coupled ATP
quantitation technology. Data from the assay component TOX21_ERR_LUC_Antagonist was analyzed into 1
assay endpoint. This assay endpoint, TOX21_ERR_LUC_Antagonist, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter,
loss-of-signal activity can be used to understand changes in the reporter gene as they relate to the gene ESRRA.
Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has produced
multiple assay endpoints where this one serves a reporter gene function. To generalize the intended target to
other relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected ERR-HEK293T cells are aliquoted into 1536-well
microtiter plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission
resulting from xenobiotic ERR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERR_LUC_Antagonist was designed to measure changes to bioluminescence signals
produced from an enzymatic reaction involving the key substrate [One-Glo], Changes are indicative of changes
in transcriptional gene expression due to agonist activity regulated by the human estrogen-related receptor
alpha [GeneSymbol:ESRRA | GenelD:2101|.

The Tox21 estrogen related receptor HEK293T antagonism luciferase assay serves to screen and to identify
environmental compounds that perturb the ERR signaling pathways could provide valuable information for
potential therapeutic or preventive measures in the treatment of metabolic disease. To differentiate true ERR

-------
antagonists or agonist from cytotoxic substances, the assay is multiplexed with cell viability assay. The Tox21
estrogen related receptor agonism assay screened a library of diverse environmental compounds to probe for
xenoestrogenic ligand-binding and potential to suppress estrogen-dependent transcription, monitored through
changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase between the
key substrate [One-Glo], Changes are indicative of changes in transcriptional gene expression due to agonist
activity regulated by the human estrogen-related receptor alpha. Each well contained 0.5nM beta-estradiol as
an ER stimulator and measured the loss-of-signal as compared to positive control of XCT790 (100 percent
inhibition). The assay is run in triplicate on 1536-well microplates. After the assay plates were incubated at a
37C/5% C02 incubator for 6 hours, 23 nL of compounds dissolved in DMSO, positive and negative controls or
DMSO only was transferred to the assay plate by a pin tool. The plates were incubated at 37C for 18 hours. 4
ul/well of One-Glo reagent was added into the assay plates using a Flying Reagent Dispenser. After 30-minute
luminescence signals are monitored using an ViewLux plate reader and CellTiter-Glo assay reagent (Promega) is
also incubated with test system for 30 minutes before readout to detect cell viability. Compound auto-
fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for
background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Estrogen related receptors (ERRs), the first orphan nuclear
receptors discovered, play an important role in the control of cellular energy metabolism. Estrogen-related
receptor alpha (ERR-alpha) is involved in maintaining energy homeostasis in response to environmental cues. It
does so by regulating a broad range of genes important in metabolism, such as those that encode enzymes that
function in the glycolytic pathway, the tricarboxylic cycle, oxidative phosphorylation, lipid metabolism,
mitochondrial functions, and biogenesis. Dysregulation can lead to metabolic syndrome, obesity, and diabetes.
ERRs are required for high-energy production in response to the environmental challenges. ERR-alpha was also
identified as an adverse marker for breast cancer progression; ERR-alpha-positive tumors have a poor prognosis.

2.3	Experimental System: adherent ERR-HEK293T cell line used. The ERR-HEK293 line is a stably transfected cell line
containing an intact ERR-alpha signaling pathway. ERR-alpha multiple hormone response element (MHRE)
reporters were introduced into HEK293 cells that express endogenous ERR-alpha. The HEK-293 cell line is a
human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5 DNAby Frank
Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5 kilobases from the
viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic characterization
established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of
growth and transfection cells and are frequently used to produce exogenous proteins or viruses for
pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 80-85% confluence in culture medium. The
cells are not to be left in Trypsin for more than 5 min at room temperature. Cell Thawing Method: Place 14 mL
of pre-warmed thaw medium into a 15 mL conical tube. Remove the vial of cells to be thawed from liquid
nitrogen and thaw rapidly by placing at 37C in a water bath. Mix the entire content of the vial to 14 ml of pre-
warmed medium and centrifuge to remove DMSO. Discard the supernatant and reconstitute the pellet using
pre-warmed thaw medium. Transfer the cells to T75 flask Cell Proliferation Method: Detach the cells from the
flask using Trypsin-EDTA (0.05%). The cells in growth medium are re-seeded in T225 flask. Assay Protocol:
Harvest cells and re-suspend in assay medium and adjust the required cell density. Dispense 2500 cells/5 uL/well
into 1536 well tissue treated white plates using a Multidrop dispenser. Incubate the plates at a 37C, 5% C02

-------
incubator for 6 h. Transfer 23nL of compounds and positive control to the assay plate by a pin tool. Incubate the
assay plates at 37C, 5% C02 for 17.5 h. Add 1 uL of CellTiter-Fluor regent using a BioRAPTR. Incubate at 37C, 5%
C02 for 17.5 h. Read the fluorescence intensity in ViewLux plate reader. Add 4uL of OneGlo and incubate the
plates at room temperature for 0.5 h. Read the luminescence intensity in ViewLux plate reader.

Baseline median absolute deviation for the assay (bmad): 3.522
Response cutoff threshold used to determine hit calls: 21.131

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Antagonism of the estrogen-related receptor (ERR) signaling pathway is measured by
bioluminescence activity via an estrogen-related-receptor firefly luciferase reporter gene. Decreased luciferase
activity can be used to identify the compounds that inhibit xenoestrogenic ligand-binding and ERR agonism. The
cytotoxicity of the compounds screened was tested in parallel by measuring the cell viability using CellTiter-
Fluor assay in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Decreased luminescence (loss-of-signal) was measured relative to XCT790 (positive control)
signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a percent
of positive control activity.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

XCT790

I arget (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest

-------
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.139

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

0

4.241
lnf%

26.897
186.31

NA

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 871.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2059

TOX21_ERR_LUC_viability

1.	General Information

1.1	Assay Title: Viability Assessment in theTox21 ERR-HEK293T Estrogen Related Receptor (ERR) Agonism Luciferase
Assay

1.2	Assay Summary: TOX21 ERR LUC is a cell-based, single-readout assay that uses ERR-HEK293T, a human kidney
cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-err-pl.
TOX21_ERR_LUC_viability is an assay readout measuring cellular ATP content and detected with CellTiter-Glo
Luciferase-coupled ATP quantitation. TOX21_ERR_LUC_viability used a type of viability reporter where loss-of-
signal activity can be used to understand changes in the cell viability. Furthermore, this assay endpoint can be
referred to as a secondary readout, because this assay has produced multiple assay endpoints where this one
serves a viability function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected ERR-HEK293T cells are aliquoted into 1536-well
microtiter plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission
resulting from xenobiotic ERR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 estrogen related receptor HEK293T agonism luciferase assay serves to screen and to identify
environmental compounds that perturb the ERR signaling pathways could provide valuable information for
potential therapeutic or preventive measures in the treatment of metabolic disease. To differentiate true ERR
antagonists or agonist from cytotoxic substances, the assay is multiplexed with cell viability assay. The Tox21
estrogen related receptor agonism assay screened a library of diverse environmental compounds to probe for
xenoestrogenic ligand-binding and potential to suppress estrogen-dependent transcription, monitored through
changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase between the

-------
key substrate [One-Glo], Changes are indicative of changes in transcriptional gene expression due to agonist
activity regulated by the human estrogen-related receptor alpha. Each well contained 0.5nM beta-estradiol as
an ER stimulator and measured the gain-of-signal as compared to positive control of Genistein (100 percent
inhibition). The assay is run in triplicate on 1536-well microplates. After the assay plates were incubated at a
37C/5% C02 incubator for 6 hours, 23 nL of compounds dissolved in DMSO, positive and negative controls or
DMSO only was transferred to the assay plate by a pin tool. The plates were incubated at 37C for 18 hours. 4
ul/well of One-Glo reagent was added into the assay plates using a Flying Reagent Dispenser. After 30-minute
luminescence signals are monitored using an ViewLux plate reader and CellTiter-Glo assay reagent (Promega) is
also incubated with test system for 30 minutes before readout to detect cell viability. Compound auto-
fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering wavelengths to allow for
background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Estrogen related receptors (ERRs), the first orphan nuclear
receptors discovered, play an important role in the control of cellular energy metabolism. Estrogen-related
receptor alpha (ERR-alpha) is involved in maintaining energy homeostasis in response to environmental cues. It
does so by regulating a broad range of genes important in metabolism, such as those that encode enzymes that
function in the glycolytic pathway, the tricarboxylic cycle, oxidative phosphorylation, lipid metabolism,
mitochondrial functions, and biogenesis. Dysregulation can lead to metabolic syndrome, obesity, and diabetes.
ERRs are required for high-energy production in response to the environmental challenges. ERR-alpha was also
identified as an adverse marker for breast cancer progression; ERR-alpha-positive tumors have a poor prognosis.

2.3	Experimental System: adherent ERR-HEK293T cell line used. The ERR-HEK293 line is a stably transfected cell line
containing an intact ERR-alpha signaling pathway. ERR-alpha multiple hormone response element (MHRE)
reporters were introduced into HEK293 cells that express endogenous ERR-alpha. The HEK-293 cell line is a
human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5 DNAby Frank
Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5 kilobases from the
viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic characterization
established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of
growth and transfection cells and are frequently used to produce exogenous proteins or viruses for
pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 80-85% confluence in culture medium. The
cells are not to be left in Trypsin for more than 5 min at room temperature. Cell Thawing Method: Place 14 mL
of pre-warmed thaw medium into a 15 mL conical tube. Remove the vial of cells to be thawed from liquid
nitrogen and thaw rapidly by placing at 37C in a water bath. Mix the entire content of the vial to 14 ml of pre-
warmed medium and centrifuge to remove DMSO. Discard the supernatant and reconstitute the pellet using
pre-warmed thaw medium. Transfer the cells to T75 flask Cell Proliferation Method: Detach the cells from the
flask using Trypsin-EDTA (0.05%). The cells in growth medium are re-seeded in T225 flask. Assay Protocol:
Harvest cells and re-suspend in assay medium and adjust the required cell density. Dispense 2500 cells/5 uL/well
into 1536 well tissue treated white plates using a Multidrop dispenser. Incubate the plates at a 37C, 5% C02
incubator for 6 h. Transfer 23nL of compounds and positive control to the assay plate by a pin tool. Incubate the
assay plates at 37C, 5% C02 for 17.5 h. Add 1 uL of CellTiter-Fluor regent using a BioRAPTR. Incubate at 37C, 5%
C02 for 17.5 h. Read the fluorescence intensity in ViewLux plate reader. Add 4uL of OneGlo and incubate the
plates at room temperature for 0.5 h. Read the luminescence intensity in ViewLux plate reader.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 2.254
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Agonism of the estrogen-related receptor (ERR) signaling pathway is measured by bioluminescence
activity via an estrogen-related-receptor firefly luciferase reporter gene. Increased luciferase activity can be
used to identify the compounds that promote xenoestrogenic ligand-binding and ERR agonism. The cytotoxicity
of the compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in
the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where

-------
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

0

3.484
lnf%

NA
NA

NA

4.2 Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

-------
4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 633.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2113

TOX21_ERb_BlA_Agonist_chl

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-beta (ESR2) Agonism Beta-lactamase Assay, Channel 1 Readout
of Uncleaved Substrate

1.2	Assay Summary: TOX21_ERb_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-erb-
bla-pl. TOX21_ERb_BLA_Agonist_chl is an assay readout measuring reporter gene via receptor activity and
designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene.
The signal is derived from the uncleaved reporter gene substrate and is used to calculate the ratio of cleaved
(ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_ERb_BLA_Agonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ERb_BLA_Agonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene ESR2. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERbeta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERb_BLA_Agonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to ESR2 gene(s) using a
positive control of 17b-estradiol

The Tox21 estrogen receptor-beta beta-lactamase agonism viability assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-
dependent transcription, monitored through changes to bioluminescence signals produced from an enzymatic

-------
reaction catalyzed by luciferase between the key substrate [CellTiter-Glo] and the target cofactor [ATP], Each
well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as compared to
positive control of tetraoctylammonium bromide (100 percent inhibition). The assay is run in triplicate on 1536-
well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable FRET-
based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER beta-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor beta (ERbeta) fused to the DNA-binding domain of GAL4
stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). The HEK-293 cell line is a human embryonic
kidney cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973
(Graham et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into
human chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293
line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection
cells and are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical
research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: ER-beta-UAS-bla HEK293T cells were dispensed at 2,000 cells/5 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16
hr. Then 1 uL of LiveBLAzerTM B/G FRET substrate was added using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA). After the plates were incubated at room temperature for 2 hours,
fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 4.446
Response cutoff threshold used to determine hit calls: 26.674
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6 Response: Estrogen receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Increased luminescence (gain-of-signal) was measured relative to 17beta-estradiol (positive
control) signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a
percent of positive control activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

-------
2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
156

Inactive hit count: 0
-------
quadratic-polynomialfpoly2) model: 720

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

481

4

235

2535

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

13.565

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-21.728

Positive control well median absolute deviation, by plate: pmad	13.186

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.179

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 481.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2114

TOX21_ERb_BlA_Agonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-beta (ESR2) Agonism Beta-lactamase Assay, Channel 2 Readout
of Cleaved Substrate

1.2	Assay Summary: TOX21_ERb_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-erb-
bla-pl. TOX21_ERb_BLA_Agonist_ch2 is an assay readout measuring reporter gene via receptor activity and
designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene.
The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio of cleaved (ch2)
to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_ERb_BLA_Agonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ERb_BLA_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene ESR2. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERbeta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERb_BLA_Agonist_ch2 was designed to measure cleaved reporter gene substrate to target
nuclear receptor activity at the protein (receptor) level, specifically mapping to ESR2 gene(s) using a positive
control of 17b-estradiol

The Tox21 estrogen receptor-beta beta-lactamase agonism viability assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-
dependent transcription, monitored through changes to bioluminescence signals produced from an enzymatic

-------
reaction catalyzed by luciferase between the key substrate [CellTiter-Glo] and the target cofactor [ATP], Each
well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as compared to
positive control of tetraoctylammonium bromide (100 percent inhibition). The assay is run in triplicate on 1536-
well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable FRET-
based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER beta-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor beta (ERbeta) fused to the DNA-binding domain of GAL4
stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). The HEK-293 cell line is a human embryonic
kidney cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973
(Graham et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into
human chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293
line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection
cells and are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical
research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: ER-beta-UAS-bla HEK293T cells were dispensed at 2,000 cells/5 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16
hr. Then 1 uL of LiveBLAzerTM B/G FRET substrate was added using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA). After the plates were incubated at room temperature for 2 hours,
fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 1.201

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6 Response: Estrogen receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Increased luminescence (gain-of-signal) was measured relative to 17beta-estradiol (positive
control) signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a
percent of positive control activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

-------
2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
235

Inactive hit count: 0
-------
quadratic-polynomial(poly2) model:	1219

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

356

9

2031

618

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0

Neutral control median absolute deviation, by plate: nmad

2.942

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	100

Positive control well median absolute deviation, by plate: pmad	25.228

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	3.919

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 618.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2115

TOX21_ERb_BLA_Agonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-beta (ESR2) Agonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21_ERb_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-erb-
bla-pl. TOX21_ERb_BLA_Agonist_ratio is an assay readout measuring reporter gene via receptor activity and
designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene.
The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate used as the
measure of target activity. Data from the assay component TOX21_ERb_BLA_Agonist_ratio was analyzed into
1 assay endpoint. This assay endpoint, TOX21_ERb_BLA_Agonist_ratio, was analyzed in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, gain-of-signal activity can be used to understand changes in the reporter gene as they relate to the
gene ESR2. Furthermore, this assay endpoint can be referred to as a primary readout, because this assay has
produced multiple assay endpoints and this ratio serves a reporter gene function to understand target activity.
To generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERbeta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERb_BLA_Agonist_ch2 was designed to target nuclear receptor activity at the protein
(receptor) level, specifically mapping to ESR2 gene(s) using a positive control of 17b-estradiol

The Tox21 estrogen receptor-beta beta-lactamase agonism viability assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-
dependent transcription, monitored through changes to bioluminescence signals produced from an enzymatic
reaction catalyzed by luciferase between the key substrate [CellTiter-Glo] and the target cofactor [ATP], Each

-------
well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as compared to
positive control of tetraoctylammonium bromide (100 percent inhibition). The assay is run in triplicate on 1536-
well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable FRET-
based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER antagonists, GeneBLAzer ERbeta-UAS-bla GripTite cell line (Invitrogen) has been used
to screen the Tox21 library of diverse environmental compounds. ERbeta-UAS-bla cell line expresses a partial
ERbeta one-hybrid GAL4 system and is stably transfected with a beta-lactamase reporter gene. The Tox21
ERbeta bla assays are qHTS format assays which measured the ability of a chemical to inhibit estrogen receptor
alpha (ERbeta) signaling pathways by monitoring modulation of fluorescence reporter gene signals. This assay
utilized a human embryonic kidney cell line (HEK293T) which expresses a partial ERbeta and a one-hybrid GAL4
system to quantify xenoestrogenic activity. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
estrogen receptor alpha mediated pathways and potentially affect endocrine systems in exposed populations.
There is strong evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse
Outcome Pathway (AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some
evidence that estrogen receptor activation is the MIE for putative AOPs leading to reduced survival due to renal
failure and leading to skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under
development). Chemical-activity profiles derived from this assay can inform prioritization decisions for
compound selection in more resource intensive in vivo studies to further investigate the involvement of ER
agonism in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER beta-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor beta (ERbeta) fused to the DNA-binding domain of GAL4
stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). The HEK-293 cell line is a human embryonic
kidney cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973
(Graham et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into
human chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293
line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection
cells and are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical
research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

-------
2.5 Exposure Regime: ER-beta-UAS-bla HEK293T cells were dispensed at 2,000 cells/5 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16
hr. Then 1 uL of LiveBLAzerTM B/G FRET substrate was added using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA). After the plates were incubated at room temperature for 2 hours,
fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

Baseline median absolute deviation for the assay (bmad): 0.952

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Estrogen receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

17b-Estradiol

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

-------
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
17beta-estradiol was used as a positive ERb agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50

-------
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	2.332

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	100

Positive control well median absolute deviation, by plate: pmad	4.737

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	17.885

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

-------
Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 646.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.

Supporting Information:

-------
More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2116

T0X2 l_ERb_B LA_Agon ist_via bi lity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293T Estrogen Receptor-beta (ESR2) Agonism Beta-lactamase
Assay

1.2	Assay Summary: TOX21_ERb_BLA_Agonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-erb-
bla-pl. TOX21_ERb_BLA_Agonist_viability is an assay readout measuring cellular ATP content and detected with
CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_ERb_BLA_Agonist_viability used a type of viability
reporter where loss-of-signal activity can be used to understand changes in the cell viability. Furthermore, this
assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves a viability function. To generalize the intended target to other relatable targets,
this assay endpoint is annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERbeta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 estrogen receptor-beta beta-lactamase agonism viability assay screened a library of diverse
environmental compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-
dependent transcription, monitored through changes to bioluminescence signals produced from an enzymatic
reaction catalyzed by luciferase between the key substrate [CellTiter-Glo] and the target cofactor [ATP], Each
well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as compared to
positive control of tetraoctylammonium bromide (100 percent inhibition). The assay is run in triplicate on 1536-
well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable FRET-

-------
based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases trap
the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of
blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER antagonists, GeneBLAzer ERbeta-UAS-bla GripTite cell line (Invitrogen) has been used
to screen the Tox21 library of diverse environmental compounds. ERbeta-UAS-bla cell line expresses a partial
ERbeta one-hybrid GAL4 system and is stably transfected with a beta-lactamase reporter gene. The Tox21
ERbeta bla assays are qHTS format assays which measured the ability of a chemical to inhibit estrogen receptor
alpha (ERbeta) signaling pathways by monitoring modulation of fluorescence reporter gene signals. This assay
utilized a human embryonic kidney cell line (HEK293T) which expresses a partial ERbeta and a one-hybrid GAL4
system to quantify xenoestrogenic activity. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
estrogen receptor alpha mediated pathways and potentially affect endocrine systems in exposed populations.
There is strong evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse
Outcome Pathway (AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some
evidence that estrogen receptor activation is the MIE for putative AOPs leading to reduced survival due to renal
failure and leading to skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under
development). Chemical-activity profiles derived from this assay can inform prioritization decisions for
compound selection in more resource intensive in vivo studies to further investigate the involvement of ER
agonism in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER beta-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor beta (ERbeta) fused to the DNA-binding domain of GAL4
stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). The HEK-293 cell line is a human embryonic
kidney cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973
(Graham et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into
human chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293
line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection
cells and are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical
research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: ER-beta-UAS-bla HEK293T cells were dispensed at 2,000 cells/5 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and

-------
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16
hr. Then 1 uL of LiveBLAzerTM B/G FRET substrate was added using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA) and then the plates were incubated at room temperature for 2 hours, and
fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT). For cell viability
readout that measures cytotoxicity, 4 uL/well of CellTiter-Glo reagent was added into the assay plates using a
FRD. After 30 min incubation at room temperature, the luminescence intensity in the plates was measured using
a ViewLux (PerkinElmer) plate reader.

Baseline median absolute deviation for the assay (bmad): 3.425
Response cutoff threshold used to determine hit calls: 20.55

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Estrogen receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase gene
expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell viability
using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical

-------
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.832

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

0

7.489
lnf%

8.403
6.414

NA

-------
(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 758.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2117

TOX21_ERb_BLA_Antagonist_chl

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-beta (ESR2) Antagonism Beta-lactamase Assay, Channel 1
Readout of Uncleaved Substrate

1.2	Assay Summary: TOX21 ERb BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-erb-
bla-antagonist-pl. TOX21_ERb_BLA_Antagonist_chl is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the uncleaved reporter gene substrate and is used to calculate the
ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_ERb_BLA_Antagonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ERb_BLA_Antagonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, increased activity can be used to
understand changes in the reporter gene as they relate to the gene ESR2. Furthermore, this assay endpoint can
be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERbeta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERb_BLA_Antagonist_chl was designed to measure uncleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to ESR2 gene(s) using a
positive control of 4-hydroxytamoxifen

TheTox21 estrogen receptor-beta antagonism beta-lactamase assay screened a library of diverse environmental
compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4

-------
system. Each well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as
compared to positive control of 4-Hydroxytamoxifen (100 percent inhibition). The assay is run in triplicate on
1536-well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable
FRET-based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases
trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio
of blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER beta-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor beta (ERbeta) fused to the DNA-binding domain of GAL4
stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). The HEK-293 cell line is a human embryonic
kidney cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973
(Graham et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into
human chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293
line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection
cells and are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical
research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: ER-beta-UAS-bla HEK293T cells were dispensed at 2,000 cells/4 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL (InM as final
concentration) of 17-beta-estradiol (E2) in assay medium using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16 hr. Then 1 uL of
LiveBLAzerTM B/G FRET substrate was added using a Flying Reagent Dispenser, the plates were incubated at
room temperature for 2 hours, and fluorescence intensity was measured by an Envision plate reader
(PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

4-hydroxytamoxifen

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 4.857
Response cutoff threshold used to determine hit calls: 29.141
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6 Response: Estrogen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
4-hydroxytamoxifen was used as a positive ERb antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667

Number of chemicals tested: 7871

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
291

Inactive hit count: Oihitc 0.9
7297

WINING MODEL SELECTION

NA hit count: hitc^O
2079

Number of sample-assay endpoints with winning hill model:

229
392

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

4338

386

quadratic-polynomial(poly2) model: 721

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

472

224

2891

14

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	19.902

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	19.045

Positive control well median absolute deviation, by plate: pmad	28.589

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	0.543

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 472.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2118

TOX21_ERb_BLA_Antagonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-beta (ESR2) Antagonism Beta-lactamase Assay, Channel 2
Readout of Cleaved Substrate

1.2	Assay Summary: TOX21 ERb BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-erb-
bla-antagonist-pl. TOX21_ERb_BLA_Antagonist_ch2 is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio
of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_ERb_BLA_Antagonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ERb_BLA_Antagonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene ESR2. Furthermore, this assay endpoint
can be referred to as a secondary readout, because this assay has produced multiple assay endpoints where this
one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERbeta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERb_BLA_Antagonist_ch2 was designed to measure cleaved reporter gene substrate to
target nuclear receptor activity at the protein (receptor) level, specifically mapping to ESR2 gene(s) using a
positive control of 4-hydroxytamoxifen

TheTox21 estrogen receptor-beta antagonism beta-lactamase assay screened a library of diverse environmental
compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4

-------
system. Each well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as
compared to positive control of 4-Hydroxytamoxifen (100 percent inhibition). The assay is run in triplicate on
1536-well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable
FRET-based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases
trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio
of blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER beta-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor beta (ERbeta) fused to the DNA-binding domain of GAL4
stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). The HEK-293 cell line is a human embryonic
kidney cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973
(Graham et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into
human chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293
line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection
cells and are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical
research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: ER-beta-UAS-bla HEK293T cells were dispensed at 2,000 cells/4 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL (InM as final
concentration) of 17-beta-estradiol (E2) in assay medium using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16 hr. Then 1 uL of
LiveBLAzerTM B/G FRET substrate was added using a Flying Reagent Dispenser, the plates were incubated at
room temperature for 2 hours, and fluorescence intensity was measured by an Envision plate reader
(PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

4-hydroxytamoxifen

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 6.097
Response cutoff threshold used to determine hit calls: 36.582
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6 Response: Estrogen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
4-hydroxytamoxifen was used as a positive ERb antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

-------
Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667

Number of chemicals tested: 7871

-------
ACTIVITY HIT CALLS

Active hit count: hitc>0.9
971

Inactive hit count: Oihitc 0.9
5101

WINING MODEL SELECTION

NA hit count: hitc^O
3595

Number of sample-assay endpoints with winning hill model:

371
409

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

4434

587

quadratic-polynomialfpoly2) model: 947

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

332

668

18

1901

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	20.222

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100

Positive control well median absolute deviation, by plate: pmad	5.875

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-4.74

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 668.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-

-------
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2119

TOX21_ERb_BLA_Antagonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293T Estrogen Receptor-beta (ESR2) Antagonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21 ERb BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-erb-
bla-antagonist-pl. TOX21_ERb_BLA_Antagonist_ratio is an assay readout measuring reporter gene via receptor
activity and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase
reporter gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate
used as the measure of target activity. Data from the assay component TOX21_ERb_BLA_Antagonist_ratio was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_ERb_BLA_Antagonist_ratio, was analyzed in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of inducible reporter, loss-of-signal activity can be used to understand changes in the reporter gene as they
relate to the gene ESR2. Furthermore, this assay endpoint can be referred to as a primary readout, because this
assay has produced multiple assay endpoints and this ratio serves a reporter gene function to understand target
activity. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERbeta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERb_BLA_Antagonist_ratio was designed to target nuclear receptor activity at the protein
(receptor) level, specifically mapping to ESR2 gene(s) using a positive control of 4-hydroxytamoxifen

TheTox21 estrogen receptor-beta antagonism beta-lactamase assay screened a library of diverse environmental
compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. Each well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as

-------
compared to positive control of 4-Hydroxytamoxifen (100 percent inhibition). The assay is run in triplicate on
1536-well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable
FRET-based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases
trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio
of blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER antagonists, GeneBLAzer ERbeta-UAS-bla GripTite cell line (Invitrogen) has been used
to screen the Tox21 library of diverse environmental compounds. ERbeta-UAS-bla cell line expresses a partial
ERbeta one-hybrid GAL4 system and is stably transfected with a beta-lactamase reporter gene. The Tox21
ERbeta bla assays are qHTS format assays which measured the ability of a chemical to inhibit estrogen receptor
alpha (ERbeta) signaling pathways by monitoring modulation of fluorescence reporter gene signals. This assay
utilized a human embryonic kidney cell line (HEK293T) which expresses a partial ERbeta and a one-hybrid GAL4
system to quantify xenoestrogenic activity. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
estrogen receptor alpha mediated pathways and potentially affect endocrine systems in exposed populations.
There is strong evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse
Outcome Pathway (AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some
evidence that estrogen receptor activation is the MIE for putative AOPs leading to reduced survival due to renal
failure and leading to skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under
development). Chemical-activity profiles derived from this assay can inform prioritization decisions for
compound selection in more resource intensive in vivo studies to further investigate the involvement of ER
agonism in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER beta-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor beta (ERbeta) fused to the DNA-binding domain of GAL4
stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). The HEK-293 cell line is a human embryonic
kidney cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973
(Graham et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into
human chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293
line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection
cells and are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical
research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

-------
2.5 Exposure Regime: ER-beta-UAS-bla HEK293T cells were dispensed at 2,000 cells/4 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL (InM as final
concentration) of 17-beta-estradiol (E2) in assay medium using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16 hr. Then 1 uL of
LiveBLAzerTM B/G FRET substrate was added using a Flying Reagent Dispenser, the plates were incubated at
room temperature for 2 hours, and fluorescence intensity was measured by an Envision plate reader
(PerkinElmer, Shelton, CT).

Baseline median absolute deviation for the assay (bmad): 4.042
Response cutoff threshold used to determine hit calls: 24.253
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Estrogen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

4-hydroxytamoxifen

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

-------
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = | (Vcompound - Vdmso)/(Vpos - Vdmso)] x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
4-hydroxytamoxifen was used as a positive ERb antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where

-------
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	12.416

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-100

Positive control well median absolute deviation, by plate: pmad	1&21

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-6.785

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

-------
Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 875.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.

Bibliography: NA

-------
7. Supporting Information:

More information on the ToxCast program can be found at: httpsi//www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www,epa,gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2120

TOX21_ERb_BLA_Antagonist_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293T Estrogen Receptor-beta (ESR2) Antagonism Beta-
lactamase Assay

1.2	Assay Summary: TOX21 ERb BLA Antagonist is a cell-based, single-readout assay that uses HEK293T, a human
kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. See tox21-erb-
bla-antagonist-pl. TOX21_ERb_BLA_Antagonist_viability is an assay readout measuring cellular ATP content
and detected with CellTiter-Glo Luciferase-coupled ATP quantitation. TOX21_ERb_BLA_Antagonist_viability
used a type of viability reporter where loss-of-signal activity can be used to understand changes in the cell
viability. Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has
produced multiple assay endpoints where this one serves a viability function. To generalize the intended target
to other relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the
subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
xenobiotic ERbeta gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

TheTox21 estrogen receptor-beta antagonism beta-lactamase assay screened a library of diverse environmental
compounds to probe for xenoestrogenic ligand-binding and potential to suppress estrogen-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. Each well contained 0.5nM -beta-estradiol as an ER stimulator and measured the loss-of-signal as
compared to positive control of 4-Hydroxytamoxifen (100 percent inhibition). The assay is run in triplicate on
1536-well microplates. Following 18 hour incubation of cells with test compounds, a membrane-permeable

-------
FRET-based substrate CCF4-AM is introduced and incubated for 2 hours. Once in the cell, cytoplasmic esterases
trap the negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio
of blue (product) to green (substrate) fluorescence. Fluorescence signals are monitored using an Envision plate
reader and CellTiter-Glo assay reagent (Promega) is also incubated with test system for 30 minutes before
readout to detect cell viability. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER antagonists, GeneBLAzer ERbeta-UAS-bla GripTite cell line (Invitrogen) has been used
to screen the Tox21 library of diverse environmental compounds. ERbeta-UAS-bla cell line expresses a partial
ERbeta one-hybrid GAL4 system and is stably transfected with a beta-lactamase reporter gene. The Tox21
ERbeta bla assays are qHTS format assays which measured the ability of a chemical to inhibit estrogen receptor
alpha (ERbeta) signaling pathways by monitoring modulation of fluorescence reporter gene signals. This assay
utilized a human embryonic kidney cell line (HEK293T) which expresses a partial ERbeta and a one-hybrid GAL4
system to quantify xenoestrogenic activity. This assay is intended for use as a part of an integrated testing
strategy, to screen a large structurally diverse chemical library for compounds with the potential to interact with
estrogen receptor alpha mediated pathways and potentially affect endocrine systems in exposed populations.
There is strong evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse
Outcome Pathway (AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some
evidence that estrogen receptor activation is the MIE for putative AOPs leading to reduced survival due to renal
failure and leading to skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under
development). Chemical-activity profiles derived from this assay can inform prioritization decisions for
compound selection in more resource intensive in vivo studies to further investigate the involvement of ER
agonism in pathways leading to hazardous outcomes in biological systems.

2.3	Experimental System: adherent HEK293T cell line used. GeneBLAzer ER beta-UAS-bla GripTite cells contain the
ligand-binding domain of the human estrogen receptor beta (ERbeta) fused to the DNA-binding domain of GAL4
stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). The HEK-293 cell line is a human embryonic
kidney cell line (of unknown parentage) transformed with sheared adenovirus 5 DNA by Frank Graham in 1973
(Graham et al. 1977). The transformation incorporated approximately 4.5 kilobases from the viral genome into
human chromosome 19 of the HEK cells, and subsequent cytogenetic characterization established that the 293
line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular for their ease of growth and transfection
cells and are frequently used to produce exogenous proteins or viruses for pharmaceutical and biomedical
research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: ER-beta-UAS-bla HEK293T cells were dispensed at 2,000 cells/4 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and

-------
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL (InM as final
concentration) of 17-beta-estradiol (E2) in assay medium using a Flying Reagent Dispenser (FRD, Aurora
Discovery, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16 hr. Then 1 uL of LiveBLAzer B/G
FRET substrate was added using a Flying Reagent Dispenser, the plates were incubated at room temperature
for 2 hours, and fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).
For cell viability readout that measures cytotoxicity, 4 uL/well of CellTiter-Glo reagent was added into the assay
plates using a FRD. After 30 min incubation at room temperature, the luminescence intensity in the plates was
measured using a ViewLux (PerkinElmer) plate reader.

Baseline median absolute deviation for the assay (bmad): 3.512
Response cutoff threshold used to determine hit calls: 21.07

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Estrogen receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

-------
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested

-------
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	7.567

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 681.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2121

TOX21_PR_BLA_Agonist_chl

1.	General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Agonism Beta-lactamase Assay, Channel 1 Readout of
Uncleaved Substrate

1.2	Assay Summary: TOX21 PR BLA Agonist is a cell-based, single-readout assay that uses transfected HEK293T, a
human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. To
identify the compounds that stimulate PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T
(Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase reporter gene under the control of an UAS response
element was used to screen Tox21 10K compound library. The cytotoxicity of the Tox21 compound library
against the PR-bla cell line was tested in parallel by measuring the cell viability using CellTiter-Glo assay in the
same wells. See tox21-pr-bla-agonist-pl. TOX21_PR_BLA_Agonist_chl is an assay readout measuring reporter
gene via receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4
b-lactamase reporter gene. The signal is derived from the uncleaved reporter gene substrate and is used to
calculate the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data
from the assay component TOX21_PR_BLA_Agonist_chl was analyzed into 1 assay endpoint. This assay
endpoint, TOX21_PR_BLA_Agonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be
used to understand changes in the reporter gene as they relate to the gene NR3C3. Furthermore, this assay
endpoint can be referred to as a secondary readout, because this assay has produced multiple assay endpoints
where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine

-------
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) agonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK 293T cells were dispensed at 3,000 cells/5 uL of assay
medium containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a
Multidrop Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a
37C and 5% C02 for 6hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred
to the assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02
for 16hr. Then 1 uL of LiveBLAzer B/G FRET substrate was added using a BioRAPTR FRD (Flying Reagent
Dispenser, Aurora Discovery, San Diego, CA). After the plates were incubated at room temperature for 2 hours,
fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

promegestone (R5020)

Baseline median absolute deviation for the assay (bmad): 4.232

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Response cutoff threshold used to determine hit calls: 25.395
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

-------
2.6	Response: Progesterone receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Promegestone (R5020) was used as a positive PR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must

-------
occur through the database and should come from the available list of methods for each processing level.

Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667

Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
350

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.21 A

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

0

14.616
lnf%

-38.876
8.382

NA

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 423.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2122

TOX21_PR_BLA_Agonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Agonism Beta-lactamase Assay, Channel 2 Readout of
Cleaved Substrate

1.2	Assay Summary: TOX21 PR BLA Agonist is a cell-based, single-readout assay that uses transfected HEK293T, a
human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. To
identify the compounds that stimulate PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T
(Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase reporter gene under the control of an UAS response
element was used to screen Tox21 10K compound library. The cytotoxicity of the Tox21 compound library
against the PR-bla cell line was tested in parallel by measuring the cell viability using CellTiter-Glo assay in the
same wells. See tox21-pr-bla-agonist-pl. TOX21_PR_BLA_Agonist_ch2 is an assay readout measuring reporter
gene via receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4
b-lactamase reporter gene. The signal is derived from the cleaved reporter gene substrate and is used to
calculate the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data
from the assay component TOX21_PR_BLA_Agonist_ch2 was analyzed into 1 assay endpoint. This assay
endpoint, TOX21_PR_BLA_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be
used to understand changes in the reporter gene as they relate to the gene NR3C3. Furthermore, this assay
endpoint can be referred to as a secondary readout, because this assay has produced multiple assay endpoints
where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine

-------
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) agonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK 293T cells were dispensed at 3,000 cells/5 uL of assay
medium containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a
Multidrop Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a
37C and 5% C02 for 6hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred
to the assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02
for 16hr. Then 1 uL of LiveBLAzer B/G FRET substrate was added using a BioRAPTR FRD (Flying Reagent
Dispenser, Aurora Discovery, San Diego, CA). After the plates were incubated at room temperature for 2 hours,
fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

promegestone (R5020)

Baseline median absolute deviation for the assay (bmad): 2.834

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

-------
2.6	Response: Progesterone receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Promegestone (R5020) was used as a positive PR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must

-------
occur through the database and should come from the available list of methods for each processing level.

Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667

Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
304

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	3.512

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

0

8.085
lnf%

100
26.98

NA

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 533.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2123

TOX21_PR_BLA_Agonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Agonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21 PR BLA Agonist is a cell-based, single-readout assay that uses transfected HEK293T, a
human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. To
identify the compounds that stimulate PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T
(Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase reporter gene under the control of an UAS response
element was used to screen Tox21 10K compound library. The cytotoxicity of the Tox21 compound library
against the PR-bla cell line was tested in parallel by measuring the cell viability using CellTiter-Glo assay in the
same wells. See tox21-pr-bla-agonist-pl. TOX21_PR_BLA_Agonist_ratio is an assay readout measuring reporter
gene via receptor activity and designed using inducible reporter (beta lactamase induction) detected with GAL4
b-lactamase reportergene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reportergene
substrate used as the measure of target activity. Data from the assay component TOX21_PR_BLA_Agonist_ratio
was analyzed into 1 assay endpoint. This assay endpoint, TOX21_PR_BLA_Agonist_ratio, was analyzed in the
positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type
of inducible reporter, gain-of-signal activity can be used to understand the reporter gene at the pathway-level
as they relate to the gene NR3C3. Furthermore, this assay endpoint can be referred to as a primary readout,
because this assay has produced multiple assay endpoints where this one serves a reporter gene function. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine

-------
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) agonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Progesterone receptor (PR), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. It is activated by the steroid hormone
progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like
PR causes disruption of normal endocrine function. To identify the compounds that activate PR signaling, a PR-
UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase
reporter gene under the control of an Upstream Activator Sequence (UAS) was used to screen Tox21 10K
compound library.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK 293T cells were dispensed at 3,000 cells/5 uL of assay
medium containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a
Multidrop Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a
37C and 5% C02 for 6hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred
to the assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02
for 16hr. Then 1 uL of LiveBLAzer B/G FRET substrate was added using a BioRAPTR FRD (Flying Reagent

-------
Dispenser, Aurora Discovery, San Diego, CA). After the plates were incubated at room temperature for 2 hours,
fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

promegestone (R5020)

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 0.845

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Progesterone receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control

NA

-------
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
Promegestone (R5020) was used as a positive PR agonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were

-------
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	22.922

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

0

1.997
lnf%

100
3.672

NA

-------
((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):

(mmed/nmed)

NA

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 573.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2124

T0X21_PR_B LA_Agon ist_via bi lity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293 Progesterone Receptor (PR) Agonism Beta-lactamase
Assay

1.2	Assay Summary: TOX21 PR BLA Agonist is a cell-based, single-readout assay that uses transfected HEK293T, a
human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well plate. To
identify the compounds that stimulate PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T
(Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase reporter gene under the control of an UAS response
element was used to screen Tox21 10K compound library. The cytotoxicity of the Tox21 compound library
against the PR-bla cell line was tested in parallel by measuring the cell viability using CellTiter-Glo assay in the
same wells. See tox21-pr-bla-agonist-pl. TOX21_PR_BLA_Agonist_viability is a component of the
TOX21_PR_BLA_Agonist assay. This component measures cell viability using the CellTiter-Glo Luciferase-
coupled ATP quantitation. TOX21_PR_BLA_Agonist_viability used a type of viability reporter where loss-of-
signal activity can be used to understand changes in the cell viability. Furthermore, this assay endpoint can be
referred to as a secondary readout, because this assay has produced multiple assay endpoints where this one
serves a viability function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 progesterone receptor (PR) agonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce PR-dependent

-------
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Progesterone receptor (PR), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. It is activated by the steroid hormone
progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like
PR causes disruption of normal endocrine function. To identify the compounds that activate PR signaling, a PR-
UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase
reporter gene under the control of an Upstream Activator Sequence (UAS) was used to screen Tox21 10K
compound library.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK293T cells were dispensed at 3,000 cells/5 uLof assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16
hr. Then 1 uL of LiveBLAzer B/G FRET substrate was added using a Flying Reagent Dispenser (FRD, Aurora
Discovery, San Diego, CA), the plates were incubated at room temperature for 2 hours, and fluorescence
intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT). For cell viability readout that
measures cytotoxicity, 4 uL/well of CellTiter-Glo reagent was added into the assay plates using a FRD. After 30
min incubation at room temperature, the luminescence intensity in the plates was measured using a ViewLux
(PerkinElmer) plate reader.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 5.54
Response cutoff threshold used to determine hit calls: 33.24

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Progesterone receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute

-------
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	1.446

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

0

13.493
lnf%

28.745
13.703

NA

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 716.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2125

TOX21_PR_BLA_Antagonist_chl

1.	General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Antagonism Beta-lactamase Assay, Channel 1 Readout
of Uncleaved Substrate

1.2	Assay Summary: TOX21_PR_BLA_Antagonist is a cell-based, single-readout assay that uses transfected
HEK293T, a human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well
plate. To identify the compounds that stimulate PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla
HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase reporter gene under the control of an UAS
response element was used to screen Tox21 10K compound library. The cytotoxicity of the Tox21 compound
library against the PR-bla cell line was tested in parallel by measuring the cell viability using CellTiter-Glo assay
in the same wells. See tox21-pr-bla-antagonist-pl. TOX21_PR_BLA_Antagonist_chl is an assay readout
measuring reporter gene via receptor activity and designed using inducible reporter (beta lactamase induction)
detected with GAL4 b-lactamase reporter gene. The signal is derived from the uncleaved reporter gene
substrate and is used to calculate the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure
of target activity. Data from the assay component TOX21_PR_BLA_Antagonist_chl was analyzed into 1 assay
endpoint. This assay endpoint, TOX21_PR_BLA_Antagonist_chl, was analyzed in the positive analysis fitting
direction relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter,
increased activity can be used to understand changes in the reporter gene as they relate to the gene NR3C3.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine

-------
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) antagonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK293T cells were dispensed at 3,000 cells/4 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL (InM as final
concentration) of Promegestone (R5020) in assay medium using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16 hr. Then 1 uL of
LiveBLAzer B/G FRET substrate was added using a FRD. After the plates were incubated at room temperature
for 2 hours, fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

RU486

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 4.748
Response cutoff threshold used to determine hit calls: 28.487

-------
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Progesterone receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
GW9662 was used as a positive PPARg antagonist control. RU486 was used as a positive PR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must

-------
occur through the database and should come from the available list of methods for each processing level.

Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667

Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
306

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	2.3

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

0
8.61
lnf%

53.795
21.413

NA

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 457.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2126

TOX21_PR_BLA_Antagonist_ch2

1.	General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Antagonism Beta-lactamase Assay, Channel 2 Readout
of Cleaved Substrate

1.2	Assay Summary: TOX21_PR_BLA_Antagonist is a cell-based, single-readout assay that uses transfected
HEK293T, a human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well
plate. To identify the compounds that stimulate PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla
HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase reporter gene under the control of an UAS
response element was used to screen Tox21 10K compound library. The cytotoxicity of the Tox21 compound
library against the PR-bla cell line was tested in parallel by measuring the cell viability using CellTiter-Glo assay
in the same wells. See tox21-pr-bla-antagonist-pl. TOX21_PR_BLA_Antagonist_ch2 is an assay readout
measuring reporter gene via receptor activity and designed using inducible reporter (beta lactamase induction)
detected with GAL4 b-lactamase reporter gene. The signal is derived from the cleaved reporter gene substrate
and is used to calculate the ratio of cleaved (ch2) to uncleaved (chl) substrate used as the measure of target
activity. Data from the assay component TOX21_PR_BLA_Antagonist_ch2 was analyzed into 1 assay endpoint.
This assay endpoint, TOX21_PR_BLA_Antagonist_ch2, was analyzed in the positive analysis fitting direction
relative to DMSO as the negative control and baseline of activity. Using a type of inducible reporter, loss-of-
signal activity can be used to understand changes in the reporter gene as they relate to the gene NR3C3.
Furthermore, this assay endpoint can be referred to as a secondary readout, because this assay has produced
multiple assay endpoints where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine

-------
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) antagonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK293T cells were dispensed at 3,000 cells/4 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL (InM as final
concentration) of Promegestone (R5020) in assay medium using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16 hr. Then 1 uL of
LiveBLAzer B/G FRET substrate was added using a FRD. After the plates were incubated at room temperature
for 2 hours, fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

RU486

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 6.924
Response cutoff threshold used to determine hit calls: 41.545

-------
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Progesterone receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
GW9662 was used as a positive PPARg antagonist control. RU486 was used as a positive PR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must

-------
occur through the database and should come from the available list of methods for each processing level.

Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667

Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
1270

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-6.552

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

0

14.337
lnf%

-100
5.509

NA

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 684.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2127

TOX21_PR_BLA_Antagonist_ratio

1.	General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Antagonism Beta-lactamase Assay, Ratio

1.2	Assay Summary: TOX21_PR_BLA_Antagonist is a cell-based, single-readout assay that uses transfected
HEK293T, a human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well
plate. To identify the compounds that stimulate PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla
HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase reporter gene under the control of an UAS
response element was used to screen Tox21 10K compound library. The cytotoxicity of the Tox21 compound
library against the PR-bla cell line was tested in parallel by measuring the cell viability using CellTiter-Glo assay
in the same wells. See tox21-pr-bla-antagonist-pl. TOX21_PR_BLA_Antagonist_ratio is an assay readout
measuring reporter gene via receptor activity and designed using inducible reporter (beta lactamase induction)
detected with GAL4 b-lactamase reportergene. The signal is derived from the ratio of cleaved (ch2) to uncleaved
(chl) reporter gene substrate used as the measure of target activity. Data from the assay component
TOX21_PR_BLA_Antagonist_ratio was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PR_BLA_Antagonist_ratio, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity an be used to
understand changes in the reporter gene as they relate to the gene NR3C3. Furthermore, this assay endpoint
can be referred to as a primary readout, because this assay has produced multiple assay endpoints and this ratio
serves a reporter gene function to understand target activity. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the
subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine

-------
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) antagonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Progesterone receptor (PR), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. It is activated by the steroid hormone
progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like
PR causes disruption of normal endocrine function. To identify the compounds that inhibit PR signaling, a PR-
UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase
reporter gene under the control of an Upstream Activator Sequence (UAS) was used to screen Tox21 10K
compound library.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK293T cells were dispensed at 3,000 cells/4 uLof assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL (InM as final
concentration) of Promegestone (R5020) in assay medium using a BioRAPTR FRD (Flying Reagent Dispenser,

-------
Aurora Discovery, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16 hr. Then 1 uL of
LiveBLAzer B/G FRET substrate was added using a FRD. After the plates were incubated at room temperature
for 2 hours, fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

RU486

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 2.512

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Progesterone receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100

NA

-------
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
GW9662 was used as a positive PPARg antagonist control. RU486 was used as a positive PR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates

-------
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-21.39

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

0

4.075
lnf%

-100
1.971

NA

-------
Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 970.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
-------
Assay Endpoint ID: 2128

T0X2 l_PR_BLA_Antagon ist_via bi I ity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293 Progesterone Receptor (PR) Antagonism Beta-lactamase
Assay

1.2	Assay Summary: TOX21_PR_BLA_Antagonist is a cell-based, single-readout assay that uses transfected
HEK293T, a human kidney cell line, with measurements taken at 24 hours after chemical dosing in a 1536-well
plate. To identify the compounds that stimulate PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla
HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase reporter gene under the control of an UAS
response element was used to screen Tox21 10K compound library. The cytotoxicity of the Tox21 compound
library against the PR-bla cell line was tested in parallel by measuring the cell viability using CellTiter-Glo assay
in the same wells. See tox21-pr-bla-antagonist-pl. TOX21_PR_BLA_Antagonist_viability is a component of the
TOX21_PR_BLA_Antagonist assay. This component measures cell viability using the CellTiter-Glo Luciferase-
coupled ATP quantitation. TOX21_PR_BLA_Antagonist_viability used a type of viability reporter where loss-of-
signal activity can be used to understand changes in the cell viability. Furthermore, this assay endpoint can be
referred to as a secondary readout, because this assay has produced multiple assay endpoints where this one
serves a viability function. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 progesterone receptor (PR) antagonism beta-lactamase assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PR-dependent

-------
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Progesterone receptor (PR), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. It is activated by the steroid hormone
progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like
PR causes disruption of normal endocrine function. To identify the compounds that inhibit PR signaling, a PR-
UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase
reporter gene under the control of an Upstream Activator Sequence (UAS) was used to screen Tox21 10K
compound library.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK293T cells were dispensed at 3,000 cells/4 uLof assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 4 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL of R5020 in assay medium
using a Flying Reagent Dispenser (FRD, Aurora Discovery, San Diego, CA). The plates were incubated at 37C and
5% C02 for 16 hr. Then 1 uL of LiveBLAzer B/G FRET substrate was added using a Flying Reagent Dispenser, the
plates were incubated at room temperature for 2 hours, and fluorescence intensity was measured by an Envision
plate reader (PerkinElmer, Shelton, CT). For cell viability readout that measures cytotoxicity, 4 uL/well of
CellTiter-Glo reagent was added into the assay plates using a FRD. After 30 min incubation at room temperature,
the luminescence intensity in the plates was measured using a ViewLux (PerkinElmer) plate reader.

-------
ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 5.48
Response cutoff threshold used to determine hit calls: 32.879

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Progesterone receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute

-------
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

0

11.745
lnf%

NA
NA

NA

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 642.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2211

TOX21_ERa_LUC_VM7_Agonist_10nM_ICI182780

1.	General Information

1.1	Assay Title: Tox21 VM7 Estrogen Receptor-alpha (ESR1) Agonism (lOnM ICI182780) Luciferase Assay

1.2	Assay Summary: TOX21_ERa_LUC_VM7_Agonist_10nM_ICI182780 is a cell-based, single-readout assay that
uses VM7, a human breast tissue cell line, with measurements taken at 22 hours after chemical dosing in a 1536-
well plate. This is a secondary assay for specificity to TOX21_ERa_LUC_VM7_Agonist. See tox21-er-luc-bgl-4e2-
agonist-p4. TOX21_ERa_LUC_VM7_Agonist_10nM_ICI182780 is one of one assay component(s) measured or
calculated from the TOX21_ERa_LUC_VM7_Agonist_10nM_ICI182780 assay. It is designed to make
measurements of luciferase induction, a form of inducible reporter, as detected with bioluminescence signals
by CellTiter-Glo Luciferase-coupled ATP quantitation technology. Data from the assay component
TOX21_ERa_LUC_VM7_Agonist was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_ERa_LUC_VM7_Agonist, was analyzed in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of inducible reporter, gain-of-signal activity can be used
to understand changes in the reporter gene as they relate to the gene ESR1. Furthermore, this assay endpoint
can be referred to as a primary readout, because the performed assay has only produced 1 assay endpoint. To
generalize the intended target to other relatable targets, this assay endpoint is annotated to the nuclear
receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected VM7 cells are aliquoted into 1536-well microtiter plates
and incubated with test compounds for 24 hours prior to monitoring luminescence resulting from ER gene
expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_ERa_LUC_VM7_Agonist_10nM_ICI182780 was designed to measure changes to
bioluminescence signals produced from an enzymatic reaction involving the key substrate [One-Glo] in the
presence of an ER antagonist. Changes are indicative of transcriptional gene expression that may not be due to
direct regulation by the human estrogen receptor 1 [GeneSymbokESRl | GenelD:2099 |
Uniprot_SwissProt_Accession:P03372],

-------
The Tox21 VM7 estrogen receptor alpha agonism luciferase assay screened a library of diverse environmental
compounds to probe for xenoestrogenic ligand-binding and potential to inhibit estrogen-dependent
transcription, as monitored through luciferase reporter gene signal activity using an endogenous full-length
ERalpha cell line of human breast tissue (VM7) that was stably transfected with an estrogen-responsive
luciferase reporter gene plasmid (pGudLuc7ere). The assay is run in triplicate on 1536-well microplate and
bioluminescence was measured following 24 hour incubation of cells with test compounds and 30 min
incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored
using a Promega ViewLux plate reader to measure agonistic activity, this assay is performed with small amounts
of an ER alpha antagonist (ICI182780) added to each well and each compound is evaluated against a known ER
alpha agonist (beta-estradiol, E2) as a positive control (100 percent inhibition). Test compounds were assayed
for cytotoxicity by measuring protease activity with Promega CellTiter-Fluor with tetraoctylammonium bromide
as a positive control for cell death. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER agonists, VM7-Luc-4E2 cell line (provided by Dr. Michael Denison from University of
California) has been used to screen the Tox21 library of diverse environmental compounds. VM7-Luc-4E2 cell
line endogenously expresses full-length ER-alpha and is stably transfected with a plasmid containing four
estrogen responsive elements (ERE) upstream of a luciferase reporter gene. The ERalpha_LUC_VM7 assays are
qHTS format assays which measure the ability of a chemical to interact with estrogen receptor alpha (ERalpha)
by monitoring modulation of fluorescence reporter gene signals. This assay utilized a human breast (VM7-Luc-
4E2) cell line which expresses endogenous full-length ERalpha and a luciferase reporter gene (ER-luc) to quantify
xenoestrogenic activity. This assay is intended for use as a part of an integrated testing strategy, to screen a
large structurally diverse chemical library for compounds with the potential to interact with estrogen receptor
alpha mediated pathways and potentially affect endocrine systems in exposed populations. There is strong
evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome Pathway
(AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen
receptor activation is the MIE for putative AOPs leading to reduced survival due to renal failure and leading to
skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under development).
Chemical-activity profiles derived from this assay can inform prioritization decisions for compound selection in
more resource intensive in vivo studies to further investigate the involvement of ER agonism in pathways leading
to hazardous outcomes in biological systems.

2.3	Experimental System: adherent VM7 cell line used. Michigan Cancer Foundation-7 (MCF-7) cells are a breast
cancer cell line originating from a 69-year old woman in 1970 (Lee et al. 2015). This is an immortalized cell line
which endogenously expresses receptors for estrogen (ESR1 and ESR2) and progesterone (Geisinger et al. 1989)
as well as growth factors EGF and IGF-1 (Baldwin et al. 1998), and so provides an alternative to breast cell lines
for estrogen-sensitive proliferation assays. vM7-Luc-4E2 cells are vMCF7 cells which are stably transfected with
plasmid containing four estrogen responsive elements upstream of a luciferase reporter gene (Rogers and
Denison 2000) to measure transactivation activity occurring along estrogen signaling pathways. This was
previously the BGl-Luc estrogen receptor transactivation test method, but BGlLuc4E2 cells are being renamed
VM7Luc4E2 cells because recent DNA analysis (STR) revealed that the original cell line used to generate the

-------
BGlLuc4E2 cells were not human ovarian carcinoma (BG-1) cells but a variant of human breast cancer (MCF7)
cells.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. For
assay purpose, the cells should be cultured in assay medium with 10% charcoal stripped FBS for 5 days with
alternate day medium changed to fresh medium. Especially while in assay culture, the cells should not reach
more than 85% confluence as they would become harder to detach if they reach over confluence. Cell Thawing
Method: Thaw a frozen vial of cells in 9ml of pre-warmed medium and seed them in T175 flask at 2 million cells.
Cell Proliferation Method: Trypsinize cells from the flask and centrifuge. Add culture medium to the pellet and
passage at 3-4 million per T-225 flask. Assay Protocol: Harvest cells from the 5-day culture in assay medium and
resuspend cells in assay medium. Dispense 4000 cells/4uL/well into 1536-well tissue treated white/solid bottom
plates using an 8 tip dispenser (Multidrop). Incubate the assay plates for 22hrs at 37C and 5% C02. First luL of
lO.OnM (final concentration) ICI-182,780 (ER-Antagonist) or assay buffer was added using two separate tips of
a dispenser (BioRAPTR). Then transfer 23nL of compounds from the library collection and positive control to the
assay plates by using a Pintool station. Incubate the assay plates for 22hrs at 37C and 5% C02. After 21hrs of
incubation, lul of CellTiter-Fluor(TM) Cell Viability Assay reagent was added using a single tip of a dispenser
(BioRAPTR). Incubate the assay plates at 37C and 5% C02 for lhr. Measure fluorescence signal by ViewLux plate
reader (Exposure time = lsec). Then add 4ul of ONE-Glo(TM) Luciferase reagent using a single tip of a dispenser
(BioRAPTR). Incubate the plates at room temperature for 30min.

Baseline median absolute deviation for the assay (bmad): 0.534
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Agonism of the estrogen receptor (ERa) signaling pathway is measured by bioluminescence activity
via an estrogen-responsive firefly luciferase reporter gene. Increased luciferase activity can be used to identify
the compounds that induce xenoestrogenic ligand-binding and ERalpha agonism. The cytotoxicity of the
compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in the same
wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

17b-Estradiol

I arget (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

NA

-------
Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Increased luminescence (gain-of-signal) was measured relative to 17beta-estradiol (positive
control) signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a
percent of positive control activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active

-------
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

219	9254	194

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	616

gain-loss (gnls) model:	798

power(pow) model:	667

linear-polynomial (polyl) model:	2251

quadratic-polynomial(poly2) model:	778

exponential-2 (exp2) model:	235

exponential-3 (exp3) model:	49

exponential-4 (exp4) model:	3499

exponential-5 (exp5) model:	774

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	0.928

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	100

Positive control well median absolute deviation, by plate: pmad	5.139

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	18.961

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 774.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: Judson RS, Magpantay FM, Chickarmane V, Haskell C, Tania N, Taylor J, Xia M, Huang R, Rotroff
DM, Filer DL, Houck KA, Martin MT, Sipes N, Richard AM, Mansouri K, Setzer RW, Knudsen TB, Crofton KM,
Thomas RS. Integrated Model of Chemical Perturbations of a Biological Pathway Using 18 In Vitro High-
Throughput Screening Assays for the Estrogen Receptor. Toxicol Sci. 2015 Nov;148(l):137-54. doi:
10.1093/toxsci/kfvl68. Epub 2015 Aug 13. PMID: 26272952; PMCID: PMC4635633., Huang R, Sakamuru S,
Martin MT, Reif DM, Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W,
Rotroff DM, Zhao T, Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox21
10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep.
2014 Jul 11;4:5664. doi: 10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2212

TOX21_ERa_LUC_VM7_Agonist_10nM_ICI182780_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 VM7 Estrogen Receptor-alpha (ESR1) Agonism (lOnM ICI182780)
Luciferase Assay

1.2	Assay Summary: TOX21_ERa_LUC_VM7_Agonist_10nM_ICI182780 is a cell-based, single-readout assay that
uses VM7, a human breast tissue cell line, with measurements taken at 22 hours after chemical dosing in a 1536-
well plate. This is a secondary assay for specificity to TOX21_ERa_LUC_VM7_Agonist. See tox21-er-luc-bgl-4e2-
agonist-p4. TOX21_ERa_LUC_VM7_Agonist_10nM_ICI182780_viability is an assay readout measuring cellular
ATP content and detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
TOX21_ERa_LUC_VM7_Agonist_10nM_ICI182780_viability used a type of viability reporter where loss-of-signal
activity can be used to understand changes in the cell viability. Furthermore, this assay endpoint can be referred
to as a secondary readout, because this assay has produced multiple assay endpoints where this one serves a
viability function. To generalize the intended target to other relatable targets, this assay endpoint is annotated
to the cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected VM7 cells are aliquoted into 1536-well microtiter plates
and incubated with test compounds for 24 hours prior to monitoring luminescence resulting from ER gene
expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 VM7 estrogen receptor alpha agonism luciferase assay screened a library of diverse environmental
compounds to probe for xenoestrogenic ligand-binding and potential to inhibit estrogen-dependent
transcription, as monitored through luciferase reporter gene signal activity using an endogenous full-length
ERalpha cell line of human breast tissue (VM7) that was stably transfected with an estrogen-responsive
luciferase reporter gene plasmid (pGudLuc7ere). The assay is run in triplicate on 1536-well microplate and

-------
bioluminescence was measured following 24 hour incubation of cells with test compounds and 30 min
incubation of test system with ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored
using a Promega ViewLux plate reader to measure agonistic activity, this assay is performed with small amounts
of an ER alpha antagonist (ICI182780) added to each well and each compound is evaluated against a known ER
alpha agonist (beta-estradiol, E2) as a positive control (100 percent inhibition). Test compounds were assayed
for cytotoxicity by measuring protease activity with Promega CellTiter-Fluor with tetraoctylammonium bromide
as a positive control for cell death. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor"
assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Estrogen receptor (ER), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. Endocrine disrupting chemicals
(EDCs) and their interactions with steroid hormone receptors like ER causes disruption of normal endocrine
function. Therefore, it is important to understand the effect of environmental chemicals on the ER signaling
pathway. To identify ER agonists, VM7-Luc-4E2 cell line (provided by Dr. Michael Denison from University of
California) has been used to screen the Tox21 library of diverse environmental compounds. VM7-Luc-4E2 cell
line endogenously expresses full-length ER-alpha and is stably transfected with a plasmid containing four
estrogen responsive elements (ERE) upstream of a luciferase reporter gene. The ERalpha_LUC_VM7 assays are
qHTS format assays which measure the ability of a chemical to interact with estrogen receptor alpha (ERalpha)
by monitoring modulation of fluorescence reporter gene signals. This assay utilized a human breast (VM7-Luc-
4E2) cell line which expresses endogenous full-length ERalpha and a luciferase reporter gene (ER-luc) to quantify
xenoestrogenic activity. This assay is intended for use as a part of an integrated testing strategy, to screen a
large structurally diverse chemical library for compounds with the potential to interact with estrogen receptor
alpha mediated pathways and potentially affect endocrine systems in exposed populations. There is strong
evidence that estrogen receptor agonism is the Molecular Initiating Event (MIE) in an Adverse Outcome Pathway
(AOP) leading to reproductive dysfunction in oviparous vertebrates, and there is some evidence that estrogen
receptor activation is the MIE for putative AOPs leading to reduced survival due to renal failure and leading to
skewed sex ratios due to altered sexual differentiation in males (all AOPs currently under development).
Chemical-activity profiles derived from this assay can inform prioritization decisions for compound selection in
more resource intensive in vivo studies to further investigate the involvement of ER agonism in pathways leading
to hazardous outcomes in biological systems.

2.3	Experimental System: adherent VM7 cell line used. Michigan Cancer Foundation-7 (MCF-7) cells are a breast
cancer cell line originating from a 69-year old woman in 1970 (Lee et al. 2015). This is an immortalized cell line
which endogenously expresses receptors for estrogen (ESR1 and ESR2) and progesterone (Geisinger et al. 1989)
as well as growth factors EGF and IGF-1 (Baldwin et al. 1998), and so provides an alternative to breast cell lines
for estrogen-sensitive proliferation assays. vM7-Luc-4E2 cells are vMCF7 cells which are stably transfected with
plasmid containing four estrogen responsive elements upstream of a luciferase reporter gene (Rogers and
Denison 2000) to measure transactivation activity occurring along estrogen signaling pathways. This was
previously the BGl-Luc estrogen receptor transactivation test method, but BGlLuc4E2 cells are being renamed
VM7Luc4E2 cells because recent DNA analysis (STR) revealed that the original cell line used to generate the
BGlLuc4E2 cells were not human ovarian carcinoma (BG-1) cells but a variant of human breast cancer (MCF7)
cells.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

-------
2.5 Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. For
assay purpose, the cells should be cultured in assay medium with 10% charcoal stripped FBS for 5 days with
alternate day medium changed to fresh medium. Especially while in assay culture, the cells should not reach
more than 85% confluence as they would become harder to detach if they reach over confluence. Cell Thawing
Method: Thaw a frozen vial of cells in 9ml of pre-warmed medium and seed them in T175 flask at 2 million cells.
Cell Proliferation Method: Trypsinize cells from the flask and centrifuge. Add culture medium to the pellet and
passage at 3-4 million per T-225 flask. Assay Protocol: Harvest cells from the 5-day culture in assay medium and
resuspend cells in assay medium. Dispense 4000 cells/4uL/well into 1536-well tissue treated white/solid bottom
plates using an 8 tip dispenser (Multidrop). Incubate the assay plates for 22hrs at 37C and 5% C02. First luL of
lO.OnM (final concentration) ICI-182,780 (ER-Antagonist) or assay buffer was added using two separate tips of
a dispenser (BioRAPTR). Then transfer 23nL of compounds from the library collection and positive control to the
assay plates by using a Pintool station. Incubate the assay plates for 22hrs at 37C and 5% C02. After 21hrs of
incubation, lul of CellTiter-Fluor(TM) Cell Viability Assay reagent was added using a single tip of a dispenser
(BioRAPTR). Incubate the assay plates at 37C and 5% C02 for lhr. Measure fluorescence signal by ViewLux plate
reader (Exposure time = lsec). Then add 4ul of ONE-Glo(TM) Luciferase reagent using a single tip of a dispenser
(BioRAPTR). Incubate the plates at room temperature for 30min.

Baseline median absolute deviation for the assay (bmad): 1.88
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Agonism of the estrogen receptor (ERa) signaling pathway is measured by bioluminescence activity
via an estrogen-responsive firefly luciferase reporter gene. Increased luciferase activity can be used to identify
the compounds that induce xenoestrogenic ligand-binding and ERalpha agonism. The cytotoxicity of the
compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in the same
wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 nM
Neutral vehicle control:

DMSO

3.

Additionally, this assay was annotated to the intended target family of cell cycle.
Data Interpretation

-------
The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:

-------
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0

Neutral control median absolute deviation, by plate: nmad	3.447

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	lnf%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	-1.397

Positive control well median absolute deviation, by plate: pmad	3.496

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.264

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

-------
NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 582.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

-------
6.	Bibliography: Judson RS, Magpantay FM, Chickarmane V, Haskell C, Tania N, Taylor J, Xia M, Huang R, Rotroff
DM, Filer DL, Houck KA, Martin MT, Sipes N, Richard AM, Mansouri K, Setzer RW, Knudsen TB, Crofton KM,
Thomas RS. Integrated Model of Chemical Perturbations of a Biological Pathway Using 18 In Vitro High-
Throughput Screening Assays for the Estrogen Receptor. Toxicol Sci. 2015 Nov;148(l):137-54. doi:
10.1093/toxsci/kfvl68. Epub 2015 Aug 13. PMID: 26272952; PMCID: PMC4635633., Huang R, Sakamuru S,
Martin MT, Reif DM, Judson RS, Houck KA, Casey W, Hsieh JH, Shockley KR, Ceger P, Fostel J, Witt KL, Tong W,
Rotroff DM, Zhao T, Shinn P, Simeonov A, Dix DJ, Austin CP, Kavlock RJ, Tice RR, Xia M. Profiling of the Tox21
10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep.
2014 Jul 11;4:5664. doi: 10.1038/srep05664. PMID: 25012808; PMCID: PMC4092345.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2214

T0X21_PR_B LA_Fol lowu p_Agon ist_ch 1

1. General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Agonism Beta-lactamase Follow-Up Assay, Channel 1
Readout of Uncleaved Substrate

1.2	Assay Summary: TOX21_PR_BLA_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. The compounds were also tested for auto fluorescence that may interfere with the biological
target readout resulting in potential false positives and/or negatives. To identify the compounds that stimulate
PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing
a beta-lactamase reporter gene under the control of an UAS response element was used to screen Tox21 10K
compound library. The cytotoxicity of the Tox21 compound library against the PR-bla cell line was tested in
parallel by measuring the cell viability using CellTiter-Glo assay in the same wells.
TOX21_PR_BLA_Followup_Agonist_chl is an assay readout measuring reporter gene via receptor activity and
designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene.
The signal is derived from the uncleaved reporter gene substrate and is used to calculate the ratio of cleaved
(ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_PR_BLA_Followup_Agonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PR_BLA_Followup_Agonist_chl, was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be
used to understand changes in the reporter gene as they relate to the gene NR3C3. Furthermore, this assay
endpoint can be referred to as a secondary readout, because this assay has produced multiple assay endpoints
where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) agonism beta-lactamase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK 293T cells were dispensed at 3,000 cells/5 uL of assay
medium containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a
Multidrop Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a
37C and 5% C02 for 6hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred
to the assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02
for 16hr. Then 1 uL of LiveBLAzer B/G FRET substrate was added using a BioRAPTR FRD (Flying Reagent
Dispenser, Aurora Discovery, San Diego, CA). After the plates were incubated at room temperature for 2 hours,
fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

promegestone (R5020)

Baseline median absolute deviation for the assay (bmad): 10.352

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

-------
Response cutoff threshold used to determine hit calls: 31.056
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Progesterone receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
RU486 was used as a positive PR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230

Number of chemicals tested: 230

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.846

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

0

10.553
lnf%

-32.89
4.689

NA

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 14.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2215

TOX21_PR_BLA_Followup_Antagonist_chl

1. General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Antagonism Beta-lactamase Follow-Up Assay, Channel 1
Readout of Uncleaved Substrate

1.2	Assay Summary: TOX21_PR_BLA_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. The compounds were also tested for auto fluorescence that may interfere with the biological
target readout resulting in potential false positives and/or negatives. To identify the compounds that stimulate
PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing
a beta-lactamase reporter gene under the control of an UAS response element was used to screen Tox21 10K
compound library. The cytotoxicity of the Tox21 compound library against the PR-bla cell line was tested in
parallel by measuring the cell viability using CellTiter-Glo assay in the same wells.
TOX21_PR_BLA_Followup_Antagonist_chl is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the uncleaved reporter gene substrate and is used to calculate the ratio of
cleaved (ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay
component TOX21_PR_BLA_Followup_Antagonist_chl was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PR_BLA_Followup_Antagonist_chl, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter, increased activity can
be used to understand changes in the reporter gene as they relate to the gene NR3C3. Furthermore, this assay
endpoint can be referred to as a secondary readout, because this assay has produced multiple assay endpoints
where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) antagonism beta-lactamase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK293T cells were dispensed at 3,000 cells/4 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL (InM as final
concentration) of Promegestone (R5020) in assay medium using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16 hr. Then 1 uL of
LiveBLAzer B/G FRET substrate was added using a FRD. After the plates were incubated at room temperature
for 2 hours, fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

RU486

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

-------
Baseline median absolute deviation for the assay (bmad): 9.562
Response cutoff threshold used to determine hit calls: 28.686
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Progesterone receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 1.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
RU486 was used as a positive PR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230

Number of chemicals tested: 230

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
35

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	1.571

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

0

9.665
lnf%

38.16
22.804

NA

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 22.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2216

T0X21_PR_B LA_Fol lowu p_Agon ist_ch2

1. General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Agonism Beta-lactamase Follow-Up Assay, Channel 2
Readout of Cleaved Substrate

1.2	Assay Summary: TOX21_PR_BLA_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. The compounds were also tested for auto fluorescence that may interfere with the biological
target readout resulting in potential false positives and/or negatives. To identify the compounds that stimulate
PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing
a beta-lactamase reporter gene under the control of an UAS response element was used to screen Tox21 10K
compound library. The cytotoxicity of the Tox21 compound library against the PR-bla cell line was tested in
parallel by measuring the cell viability using CellTiter-Glo assay in the same wells.
TOX21_PR_BLA_Followup_Agonist_ch2 is an assay readout measuring reporter gene via receptor activity and
designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene.
The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio of cleaved (ch2)
to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_PR_BLA_Followup_Agonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PR_BLA_Followup_Agonist_ch2, was analyzed in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity can be
used to understand changes in the reporter gene as they relate to the gene NR3C3. Furthermore, this assay
endpoint can be referred to as a secondary readout, because this assay has produced multiple assay endpoints
where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) agonism beta-lactamase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK 293T cells were dispensed at 3,000 cells/5 uL of assay
medium containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a
Multidrop Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a
37C and 5% C02 for 6hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred
to the assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02
for 16hr. Then 1 uL of LiveBLAzer B/G FRET substrate was added using a BioRAPTR FRD (Flying Reagent
Dispenser, Aurora Discovery, San Diego, CA). After the plates were incubated at room temperature for 2 hours,
fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

promegestone (R5020)

Baseline median absolute deviation for the assay (bmad): 3.613

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

-------
Response cutoff threshold used to determine hit calls: 10.839
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Progesterone receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
RU486 was used as a positive PR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230

Number of chemicals tested: 230

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
61

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	6.072

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

0
3.18
lnf%

100
15.961

NA

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 15.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2217

TOX21_PR_BLA_Followup_Antagonist_ch2

1. General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Antagonism Beta-lactamase Follow-Up Assay, Channel 2
Readout of Cleaved Substrate

1.2	Assay Summary: TOX21_PR_BLA_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. The compounds were also tested for auto fluorescence that may interfere with the biological
target readout resulting in potential false positives and/or negatives. To identify the compounds that stimulate
PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing
a beta-lactamase reporter gene under the control of an UAS response element was used to screen Tox21 10K
compound library. The cytotoxicity of the Tox21 compound library against the PR-bla cell line was tested in
parallel by measuring the cell viability using CellTiter-Glo assay in the same wells.
TOX21_PR_BLA_Followup_Antagonist_ch2 is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the cleaved reporter gene substrate and is used to calculate the ratio of cleaved
(ch2) to uncleaved (chl) substrate used as the measure of target activity. Data from the assay component
TOX21_PR_BLA_Followup_Antagonist_ch2 was analyzed into 1 assay endpoint. This assay endpoint,
TOX21_PR_BLA_Followup_Antagonist_ch2, was analyzed in the positive analysis fitting direction relative to
DMSO as the negative control and baseline of activity. Using a type of inducible reporter, loss-of-signal activity
can be used to understand changes in the reporter gene as they relate to the gene NR3C3. Furthermore, this
assay endpoint can be referred to as a secondary readout, because this assay has produced multiple assay
endpoints where this one serves as artifact detection function.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) antagonism beta-lactamase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: These channel 1 (chl) and channel 2 (ch2) endpoints are secondary readouts and may not
be informative on their own. They should not be considered bioactivity endpoints but rather supportive
evidence for the ratio results and used in a qualitative sense to understand autofluorescence.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK293T cells were dispensed at 3,000 cells/4 uL of assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL (InM as final
concentration) of Promegestone (R5020) in assay medium using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16 hr. Then 1 uL of
LiveBLAzer B/G FRET substrate was added using a FRD. After the plates were incubated at room temperature
for 2 hours, fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

RU486

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

-------
Baseline median absolute deviation for the assay (bmad): 27.271
Response cutoff threshold used to determine hit calls: 81.813
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Progesterone receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of channel 2.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
RU486 was used as a positive PR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

-------
The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230

Number of chemicals tested: 230

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
7

Inactive hit count: 0
-------
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-3.364

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

0

29.089
lnf%

-100
3.806

NA

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 16.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2218

T0X2 l_PR_BLA_Fol lowu p_Agonist_ratio

1. General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Agonism Beta-lactamase Follow-Up Assay, Ratio

1.2	Assay Summary: TOX21_PR_BLA_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. The compounds were also tested for auto fluorescence that may interfere with the biological
target readout resulting in potential false positives and/or negatives. To identify the compounds that stimulate
PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing
a beta-lactamase reporter gene under the control of an UAS response element was used to screen Tox21 10K
compound library. The cytotoxicity of the Tox21 compound library against the PR-bla cell line was tested in
parallel by measuring the cell viability using CellTiter-Glo assay in the same wells.
TOX21_PR_BLA_Followup_Agonist_ratio is an assay readout measuring reporter gene via receptor activity and
designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter gene.
The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate used as the
measure of target activity. Data from the assay component TOX21_PR_BLA_Followup_Agonist_ratio was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_PR_BLA_Followup_Agonist_ratio, was analyzed in
the positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a
type of inducible reporter, measures of receptor for gain-of-signal activity can be used to understand the
reporter gene at the pathway-level as they relate to the gene NR3C3. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this one
serves a reporter gene function. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the nuclear receptor intended target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) agonism beta-lactamase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Progesterone receptor (PR), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. It is activated by the steroid hormone
progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like
PR causes disruption of normal endocrine function. To identify the compounds that activate PR signaling, a PR-
UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase
reporter gene under the control of an Upstream Activator Sequence (UAS) was used to screen Tox21 10K
compound library.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK 293T cells were dispensed at 3,000 cells/5 uL of assay
medium containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a
Multidrop Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a
37C and 5% C02 for 6hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred

-------
to the assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02
for 16hr. Then 1 uL of LiveBLAzer B/G FRET substrate was added using a BioRAPTR FRD (Flying Reagent
Dispenser, Aurora Discovery, San Diego, CA). After the plates were incubated at room temperature for 2 hours,
fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

Baseline median absolute deviation for the assay (bmad): 1.464

Response cutoff threshold used to determine hit calls: 20

Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Progesterone receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

promegestone (R5020)

I arget (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

NA

-------
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
RU486 was used as a positive PR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);

-------
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230	Number of chemicals tested: 230

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	17.951

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

0

1.745
lnf%

100
5.298

NA

-------
{(mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 20.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2219

T0X2 l_PR_BLA_Fol lowu p_Antagonist_ratio

1. General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Antagonism Beta-lactamase Follow-Up Assay, Ratio

1.2	Assay Summary: TOX21_PR_BLA_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. The compounds were also tested for auto fluorescence that may interfere with the biological
target readout resulting in potential false positives and/or negatives. To identify the compounds that stimulate
PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing
a beta-lactamase reporter gene under the control of an UAS response element was used to screen Tox2110K
compound library. The cytotoxicity of the Tox21 compound library against the PR-bla cell line was tested in
parallel by measuring the cell viability using CellTiter-Glo assay in the same wells.
TOX21_PR_BLA_Followup_Antagonist_ratio is an assay readout measuring reporter gene via receptor activity
and designed using inducible reporter (beta lactamase induction) detected with GAL4 b-lactamase reporter
gene. The signal is derived from the ratio of cleaved (ch2) to uncleaved (chl) reporter gene substrate used as
the measure of target activity. Data from the assay component TOX21_PR_BLA_Followup_Antagonist_ratio was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_PR_BLA_Followup_Antagonist_ratio, was analyzed
in the positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using
a type of inducible reporter, measures of receptor for gain-of-signal activity can be used to understand the
reporter gene at the pathway-level as they relate to the gene NR3C3. Furthermore, this assay endpoint can be
referred to as a primary readout, because this assay has produced multiple assay endpoints where this ratio
serves a reporter gene function to understand target activity. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the nuclear receptor intended target family, where the
subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2. Test Method Description

-------
2.1	Purpose: Progesterone receptor (PR), a nuclear hormone receptor, plays an important role in development,
metabolic homeostasis and reproduction. It is activated by the steroid hormone progesterone. Endocrine
disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like PR causes disruption of
normal endocrine function.

The Tox21 progesterone receptor (PR) antagonism beta-lactamase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Progesterone receptor (PR), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. It is activated by the steroid hormone
progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like
PR causes disruption of normal endocrine function. To identify the compounds that inhibit PR signaling, a PR-
UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase
reporter gene under the control of an Upstream Activator Sequence (UAS) was used to screen Tox21 10K
compound library.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK293T cells were dispensed at 3,000 cells/4 uLof assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the

-------
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL (InM as final
concentration) of Promegestone (R5020) in assay medium using a BioRAPTR FRD (Flying Reagent Dispenser,
Aurora Discovery, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16 hr. Then 1 uL of
LiveBLAzer B/G FRET substrate was added using a FRD. After the plates were incubated at room temperature
for 2 hours, fluorescence intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT).

Baseline median absolute deviation for the assay (bmad): 8.038
Response cutoff threshold used to determine hit calls: 48.228
Detection technology used: GAL4 b-lactamase reporter gene (Fluorescence)

2.6	Response: Progesterone receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Beta lactamase expression is quantified by measuring the fluorescence ratio of cleaved product
(blue 460 nm emission values) to uncleaved reporter gene substrate (green 530 nm emission values). In the
absence of beta-lactamase, excitation of a donor molecule at 405 nm results in energy transfer by FRET to
fluorescein (acceptor molecule), causing it to emit light in the green region of the spectrum (emission peak at
520 nm). However, exposure to (5-lactamase promotes hydrolysis of the CCF2-AM (5-lactam ring and separates
the 3-fluorescein from the remainder of the substrate. Fluorescence intensity at 405 nm excitation and 460 and
530 nm emissions was measured by a PerkinElmer Envision plate reader. Raw plate reads for each titration point

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

RU486

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

NA

-------
were first normalized relative to the positive control compound (-100% for antagonist mode and 100% for
agonist mode) and DMSO-only wells (0%) as follows: % Activity = ((Vcompound - Vdmso)/(Vpos - Vdmso))x 100
(in which Vcompound denotes the compound well values, Vpos denotes the median value of the positive control
wells, and Vdmso denotes the median values of the DMSO-only wells) and then corrected by applying an NCATS
in-house pattern correction algorithm sing compound-free control plates (i.e., DMSO-only plates) at the
beginning and end of the compound plate stack. Response was reported as a percent of positive control activity.
RU486 was used as a positive PR antagonist control.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag

-------
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230	Number of chemicals tested: 230

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

132	94	4

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	61

gain-loss (gnls) model:	34

power(pow) model:	24

linear-polynomial (polyl) model:	32

quadratic-polynomial(poly2) model:	27

exponential-2 (exp2) model:	3

exponential-3 (exp3) model:	2

exponential-4 (exp4) model:	14

exponential-5 (exp5) model:	33

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In

-------
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-10.39

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

0

8.719
lnf%

-100
5.725

NA

-------
Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 33.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/expjoring-toxcast-data-dqvynloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is

-------
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2220

TOX21_PR_B LA_Fol lowu p_Agon ist_via bi I ity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293 Progesterone Receptor (PR) Agonism Beta-lactamase
Follow-Up Assay

1.2	Assay Summary: TOX21_PR_BLA_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. The compounds were also tested for auto fluorescence that may interfere with the biological
target readout resulting in potential false positives and/or negatives. To identify the compounds that stimulate
PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing
a beta-lactamase reporter gene under the control of an UAS response element was used to screen Tox21 10K
compound library. The cytotoxicity of the Tox21 compound library against the PR-bla cell line was tested in
parallel by measuring the cell viability using CellTiter-Glo assay in the same wells.
TOX21_PR_BLA_Followup_Agonist_viability is a component of the TOX21_PR_BLA_Followup_Agonist assay.
This component measures cell viability using the CellTiter-Glo Luciferase-coupled ATP quantitation.
TOX21_PR_BLA_Followup_Agonist_viability used a type of viability reporter where loss-of-signal activity can be
used to understand changes in the cell viability. Furthermore, this assay endpoint can be referred to as a
secondary readout, because this assay has produced multiple assay endpoints where this one serves a viability
function. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

-------
The Tox21 progesterone receptor (PR) agonism beta-lactamase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to induce PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Progesterone receptor (PR), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. It is activated by the steroid hormone
progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like
PR causes disruption of normal endocrine function. To identify the compounds that activate PR signaling, a PR-
UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase
reporter gene under the control of an Upstream Activator Sequence (UAS) was used to screen Tox21 10K
compound library.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK293T cells were dispensed at 3,000 cells/5 uLof assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 6 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C and 5% C02 for 16
hr. Then 1 uL of LiveBLAzer B/G FRET substrate was added using a Flying Reagent Dispenser (FRD, Aurora
Discovery, San Diego, CA), the plates were incubated at room temperature for 2 hours, and fluorescence
intensity was measured by an Envision plate reader (PerkinElmer, Shelton, CT). For cell viability readout that
measures cytotoxicity, 4 uL/well of CellTiter-Glo reagent was added into the assay plates using a FRD. After 30

-------
min incubation at room temperature, the luminescence intensity in the plates was measured using a ViewLux
(PerkinElmer) plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 7.862
Response cutoff threshold used to determine hit calls: 47.175

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Progesterone receptor agonism is monitored by FRET emission resulting from GAL4/-beta-lactamase
gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring the cell
viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

NA

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median

-------
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

Number of samples tested: 230

Active hit count: hitc>0.9
8

SAMPLE AND CHEMICAL COVERAGE

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

0

6.084
lnf%

NA
NA

NA

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 22.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2221

T0X2 l_PR_BLA_Fol lowu p_Antagon ist_via bi I ity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293 Progesterone Receptor (PR) Antagonism Beta-lactamase
Follow-Up Assay

1.2	Assay Summary: TOX21_PR_BLA_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. The compounds were also tested for auto fluorescence that may interfere with the biological
target readout resulting in potential false positives and/or negatives. To identify the compounds that stimulate
PR signaling, a PR-UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing
a beta-lactamase reporter gene under the control of an UAS response element was used to screen Tox21 10K
compound library. The cytotoxicity of the Tox21 compound library against the PR-bla cell line was tested in
parallel by measuring the cell viability using CellTiter-Glo assay in the same wells.
TOX21_PR_BLA_Followup_Antagonist_viability is a component of the TOX21_PR_BLA_Followup_Antagonist
assay. This component measures cell viability using the CellTiter-Glo Luciferase-coupled ATP quantitation.
TOX21_PR_BLA_Followup_Antagonist_viability used a type of viability reporter where loss-of-signal activity can
be used to understand changes in the cell viability. Furthermore, this assay endpoint can be referred to as a
secondary readout, because this assay has produced multiple assay endpoints where this one serves a viability
function. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

-------
The Tox21 progesterone receptor (PR) antagonism beta-lactamase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PR-dependent
transcription, monitored through bla reporter gene signal activation using a mammalian one-hybrid GAL4
system. The assay is run in triplicate on 1536-well microplates. HEK293T cells are plated the day of the assay
and following 16-hour incubation of cells with test compounds a membrane-permeable FRET-based substrate
CCF4-AM is introduced and incubated for an additional hour. Once in the cell, cytoplasmic esterases trap the
negatively charged CCF4 substrate in the cytosol and bla expression is quantified by measuring the ratio of blue
(460nm, product) to green (530nm, substrate) fluorescence. Fluorescence signals are monitored using an
Envision plate reader. Following CCF4 incubation and detection, luL of CellTiter-Glo reagent is added to each
well and incubated for 30 minutes before cytotoxicity readout is measured on a ViewLux microtiter plate reader.
Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: GeneBLAzer Beta-lactamase Reporter Technology provides a highly accurate, sensitive,
and easy to use method of monitoring cellular responses to drug candidates or other stimuli. The core of the
GeneBLAzer Technology is a Fluorescence Resonance Energy Transfer (FRET) substrate that generates a
ratiometric reporter response with minimal experimental noise. In addition to the two-color (blue/green)
readout of stimulated and unstimulated cells, this ratiometric method reduces the absolute and relative errors
that can mask the underlying biological response of interest. Such errors include variations in cell number,
transfection efficiency, substrate concentration, excitation path length, fluorescence detectors, and volume
changes. The GeneBLAzer Beta-lactamase Reporter Technology has been proven effective in high-throughput
screening (HTS) campaigns for a range of target classes, including G-protein coupled receptors (GPCRs), nuclear
receptors, and kinase signaling pathways. Progesterone receptor (PR), a nuclear hormone receptor, plays an
important role in development, metabolic homeostasis and reproduction. It is activated by the steroid hormone
progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone receptors like
PR causes disruption of normal endocrine function. To identify the compounds that inhibit PR signaling, a PR-
UAS-bla GripTite cell line i.e. PR-UAS-bla HEK293T (Invitrogen, Carlsbad, CA, USA) containing a beta-lactamase
reporter gene under the control of an Upstream Activator Sequence (UAS) was used to screen Tox21 10K
compound library.

2.3	Experimental System: adherent PR-UAS-bla-HEK293T cell line used. GeneBLAzer PR-UAS-bla GripTite cells
contain the ligand-binding domain of the human progesterone receptor (PR) fused to the DNA-binding domain
of GAL4 stably integrated into the cell line. These cells stably express a -beta-lactamase reporter gene under the
transcriptional control of an upstream activator sequence (UAS). When an agonist binds to the LBD of the GAL4-
ERalpha fusion protein, the protein binds to the UAS, resulting in expression of-beta-lactamase. The HEK-293
cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared adenovirus 5
DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated approximately 4.5
kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent cytogenetic
characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells are popular
for their ease of growth and transfection cells and are frequently used to produce exogenous proteins or viruses
for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Assay Protocol: PR-UAS-bla HEK293T cells were dispensed at 3,000 cells/4 uLof assay medium
containing 2% charcoal stripped FBS per well into black wall/clear-bottom 1536-well plates using a Multidrop
Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the assay plates were incubated at a 37C and
5% C02 for 4 hr, 23 nL of compounds dissolved in DMSO, positive controls or DMSO only was transferred to the
assay plate by a Pintool station (Kalypsys, San Diego, CA), followed by addition of 1 uL of R5020 in assay medium
using a Flying Reagent Dispenser (FRD, Aurora Discovery, San Diego, CA). The plates were incubated at 37C and
5% C02 for 16 hr. Then 1 uL of LiveBLAzer B/G FRET substrate was added using a Flying Reagent Dispenser, the
plates were incubated at room temperature for 2 hours, and fluorescence intensity was measured by an Envision
plate reader (PerkinElmer, Shelton, CT). For cell viability readout that measures cytotoxicity, 4 uL/well of

-------
CellTiter-Glo reagent was added into the assay plates using a FRD. After 30 min incubation at room temperature,
the luminescence intensity in the plates was measured using a ViewLux (PerkinElmer) plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 11.08
Response cutoff threshold used to determine hit calls: 66.481

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Progesterone receptor antagonism is monitored by FRET emission resulting from GAL4/-beta-
lactamase gene expression. The cytotoxicity of the compounds screened was measured in parallel by measuring
the cell viability using by CellTiter-Glo Luciferase-coupled ATP quantitation in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

NA

-------
Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median

-------
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230	Number of chemicals tested: 230

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,

0

7.799
lnf%

NA
NA

NA

-------
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2222

T0X2 l_PR_LUC_Fol lowu p_Agonist

1.	General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Agonism Luciferase Follow-Up Assay

1.2	Assay Summary: TOX21_PR_LUC_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. TOX21_PR_LUC_Followup_Agonist is one of one assay component(s) measured or calculated
from the TOX21_PR_LUC_Followup assay. It is designed to make measurements of luciferase induction, a form
of inducible reporter, as detected with bioluminescence signals by CellTiter-Glo Luciferase-coupled ATP
quantitation technology Data from the assay component TOX21_PR_LUC_Followup_Agonist was analyzed into
1 assay endpoint. This assay endpoint, TOX21_PR_LUC_Followup_Agonist, was analyzed in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of inducible
reporter, gain-of-signal activity can be used to understand changes in the reporter gene as they relate to the
gene NR3C3. Furthermore, this assay endpoint can be referred to as a primary readout, because the performed
assay has only produced 1 assay endpoint. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the nuclear receptor target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PR_LUC_Agonist assay was designed to measure changes to bioluminescence signals
produced from an enzymatic reaction involving the key substrate [One-Glo], Changes are indicative of changes
in transcriptional gene expression due to agonist activity regulated by the human Progesterone receptor (PR)

The Tox21 progesterone receptor (PR) agonism luciferase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic progesterone activity and potential to induce progesterone-
dependent transcription, as monitored through luciferase reporter gene signal activation using an PR-luciferase
reporter gene construct. To differentiate true PR antagonists or agonist from cytotoxic substances, the assay is

-------
multiplexed with cell viability assay. The assay is run in triplicate on a 1536-well microplate and bioluminescence
was measured following 16 hour incubation of cells with test compounds and 30 min incubation of test system
with ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux
plate reader. Following the incubation period, the cell culture was screened for bioluminescent signals in agonist
mode using luciferase ATP detection technology. Each compound was tested in a concentration-response
format. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Progesterone receptor (PR), a nuclear hormone receptor,
plays an important role in development, metabolic homeostasis and reproduction. It is activated by the steroid
hormone progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone
receptors like PR causes disruption of normal endocrine function.

2.3	Experimental System: adherent HEK293-PR-A cell line used. The HEK293-PR-A-luc cell line was derived from the
human embryonic kidney cell line, HEK-293, by stable transfection with a luciferase reporter gene construct.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated
approximately 4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent
cytogenetic characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells
are popular for their ease of growth and transfection cells and are frequently used to produce exogenous
proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. For
assay purpose, the cells should be cultured in assay medium with 10% charcoal stripped FBS for 5 days with
alternate day medium changed to fresh medium. Especially while in assay culture, the cells should not reach
more than 85% confluence as they would become harder to detach if they reach over confluence. Assay
Protocol: HEK293-PR-A-luc cells were dispensed at 3,000 cells/5 uL/well of culture medium into white wall/solid-
bottom 1536-well plates using a Multidrop Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the
assay plates were incubated at 37Cand 0% C02 for 5 h, 23 nL of compounds dissolved in DMSO, positive controls
or DMSO only was transferred to the assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were
incubated at 37C and 0% C02 for 16 h. Then 4 uL of ONE-Glo reagent (Promega, Madison, Wl) was added to
each plate using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter, Indianapolis, IN,
USA) and luminescence was quantified on a ViewLux (PerkinElmer, Shelton, CT) plate reader after 30 min
incubation at room temperature.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

promegestone (R5020)

Baseline median absolute deviation for the assay (bmad): 0.639
Response cutoff threshold used to determine hit calls: 20

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

-------
Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Agonism of the progesterone receptor (PR) signaling pathway is measured by bioluminescence
activity via an progesterone-responsive firefly luciferase reporter gene. Increased luciferase activity can be used
to identify the compounds that promote xenobiotic progesterone ligand-binding and PR agonism. The
cytotoxicity of the compounds screened was tested in parallel by measuring the cell viability using CellTiter-
Fluor assay in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Increased luminescence (gain-of-signal) was measured relative to promegestone (R5020)
(positive control) signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was
reported as a percent of positive control activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230	Number of chemicals tested: 230

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
67

163

0

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

40
64

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

17

17

quadratic-polynomialfpoly2) model:	13

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

3

3

51

22

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	9.55

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 22.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

0
0.68
lnf%

100
10.486

NA

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2223

TOX21_PR_LUC_Followup_Antagonist

1.	General Information

1.1	Assay Title: Tox21 HEK293 Progesterone Receptor (PR) Antagonism Luciferase Follow-Up Assay

1.2	Assay Summary: TOX21_PR_LUC_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. TOX21_PR_LUC_Followup_Antagonist is one of one assay component(s) measured or calculated
from the TOX21_PR_LUC_Followup assay. It is designed to make measurements of luciferase induction, a form
of inducible reporter, as detected with bioluminescence signals by CellTiter-Glo Luciferase-coupled ATP
quantitation technology Data from the assay component TOX21_PR_LUC_Followup_Antagonist was analyzed
into 1 assay endpoint. This assay endpoint, TOX21_PR_LUC_Followup_Antagonist, was analyzed in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of
inducible reporter, gain-of-signal activity can be used to understand changes in the reporter gene as they relate
to the gene NR3C3. Furthermore, this assay endpoint can be referred to as a primary readout, because the
performed assay has only produced 1 assay endpoint. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor target family, where the subfamily is steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: TOX21_PR_LUC_Antagonist assay was designed to measure changes to bioluminescence signals
produced from an enzymatic reaction involving the key substrate [One-Glo], Changes are indicative of changes
in transcriptional gene expression due to antagonist activity regulated by the human Progesterone receptor (PR)

The Tox21 progesterone receptor (PR) antagonism luciferase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PR-dependent
transcription, monitored luciferase reporter gene signal activation using an PR-luciferase reporter gene
construct. To differentiate true PR antagonists or agonist from cytotoxic substances, the assay is multiplexed

-------
with cell viability assay. The assay is run in triplicate on a 1536-well microplate and bioluminescence was
measured following 16 hour incubation of cells with test compounds and 30 min incubation of test system with
ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux plate
reader. Following the incubation period, the cell culture was screened for bioluminescent signals in agonist
mode using luciferase ATP detection technology. Each compound was tested in a concentration-response
format. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Progesterone receptor (PR), a nuclear hormone receptor,
plays an important role in development, metabolic homeostasis and reproduction. It is activated by the steroid
hormone progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone
receptors like PR causes disruption of normal endocrine function.

2.3	Experimental System: adherent HEK293-PR-A cell line used. The HEK293-PR-A-luc cell line was derived from the
human embryonic kidney cell line, HEK-293, by stable transfection with a luciferase reporter gene construct.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated
approximately 4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent
cytogenetic characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells
are popular for their ease of growth and transfection cells and are frequently used to produce exogenous
proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. For
assay purpose, the cells should be cultured in assay medium with 10% charcoal stripped FBS for 5 days with
alternate day medium changed to fresh medium. Especially while in assay culture, the cells should not reach
more than 85% confluence as they would become harder to detach if they reach over confluence. Assay
Protocol: HEK293-PR-A-luc cells were dispensed at 3,000 cells/5 uL/well of culture medium into white wall/solid-
bottom 1536-well plates using a Multidrop Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the
assay plates were incubated at 37Cand 0% C02 for 5 h, 23 nL of compounds dissolved in DMSO, positive controls
or DMSO only was transferred to the assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were
incubated at 37C and 0% C02 for 16 h. Then 4 uL of ONE-Glo reagent (Promega, Madison, Wl) was added to
each plate using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter, Indianapolis, IN,
USA) and luminescence was quantified on a ViewLux (PerkinElmer, Shelton, CT) plate reader after 30 min
incubation at room temperature.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

RU486

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 8.261
Response cutoff threshold used to determine hit calls: 49.564

-------
Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6	Response: Antagonism of the progesterone receptor (PR) signaling pathway is measured by bioluminescence
activity via an progesterone-responsive firefly luciferase reporter gene. Decreased luciferase activity can be used
to identify the compounds that suppress xenobiotic progesterone ligand-binding and PR antagonism. The
cytotoxicity of the compounds screened was tested in parallel by measuring the cell viability using CellTiter-
Fluor assay in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Decreased luminescence (loss-of-signal) was measured relative to RU486 (positive control) signal,
using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a percent of
positive control activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230	Number of chemicals tested: 230

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
99

117

14

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

43
20

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

26

30

quadratic-polynomialfpoly2) model: 30

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2

7

39

33

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-13.202

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 33.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

0

7.568
lnf%

-100
0.605

NA

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2224

T0X2 l_PR_LUC_Fol lowu p_Agon ist_via bi I ity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293 Progesterone Receptor (PR) Agonism Luciferase Follow-
Up Assay

1.2	Assay Summary: TOX21_PR_LUC_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. TOX21_PR_LUC_Followup_Agonist_viability is an assay readout measuring cellular ATP content
and	detected	with	CellTiter-Glo	Luciferase-coupled	ATP	quantitation.
TOX21_PR_LUC_Followup_Agonist_viability used a type of viability reporter where loss-of-signal activity can be
used to understand changes in the cell viability. Furthermore, this assay endpoint can be referred to as a
secondary readout, because this assay has produced multiple assay endpoints where this one serves a viability
function. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 progesterone receptor (PR) agonism luciferase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic progesterone activity and potential to induce progesterone-
dependent transcription, as monitored through luciferase reporter gene signal activation using an PR-luciferase
reporter gene construct. To differentiate true PR antagonists or agonist from cytotoxic substances, the assay is
multiplexed with cell viability assay. The assay is run in triplicate on a 1536-well microplate and bioluminescence
was measured following 16 hour incubation of cells with test compounds and 30 min incubation of test system

-------
with ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux
plate reader. Following the incubation period, the cell culture was screened for bioluminescent signals in agonist
mode using luciferase ATP detection technology. Each compound was tested in a concentration-response
format. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Progesterone receptor (PR), a nuclear hormone receptor,
plays an important role in development, metabolic homeostasis and reproduction. It is activated by the steroid
hormone progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone
receptors like PR causes disruption of normal endocrine function.

2.3	Experimental System: adherent HEK293-PR-A cell line used. The HEK293-PR-A-luc cell line was derived from the
human embryonic kidney cell line, HEK-293, by stable transfection with a luciferase reporter gene construct.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated
approximately 4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent
cytogenetic characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells
are popular for their ease of growth and transfection cells and are frequently used to produce exogenous
proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. For
assay purpose, the cells should be cultured in assay medium with 10% charcoal stripped FBS for 5 days with
alternate day medium changed to fresh medium. Especially while in assay culture, the cells should not reach
more than 85% confluence as they would become harder to detach if they reach over confluence. Assay
Protocol: HEK293-PR-A-luc cells were dispensed at 3,000 cells/5 uL/well of culture medium into white wall/solid-
bottom 1536-well plates using a Multidrop Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the
assay plates were incubated at 37Cand 0% C02 for 5 h, 23 nL of compounds dissolved in DMSO, positive controls
or DMSO only was transferred to the assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were
incubated at 37C and 0% C02 for 16 h. Then 4 uL of ONE-Glo reagent (Promega, Madison, Wl) was added to
each plate using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter, Indianapolis, IN,
USA) and luminescence was quantified on a ViewLux (PerkinElmer, Shelton, CT) plate reader after 30 min
incubation at room temperature.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 5.712
Response cutoff threshold used to determine hit calls: 34.273

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

-------
2.6	Response: Agonism of the progesterone receptor (PR) signaling pathway is measured by bioluminescence
activity via an progesterone-responsive firefly luciferase reporter gene. Increased luciferase activity can be used
to identify the compounds that promote xenobiotic progesterone ligand-binding and PR agonism. The
cytotoxicity of the compounds screened was tested in parallel by measuring the cell viability using CellTiter-
Fluor assay in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230	Number of chemicals tested: 230

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
2

Inactive hit count: 0
-------
WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

9

30

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

59

13

quadratic-polynomialfpoly2) model:	15

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2

22

79

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

-------
4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 22.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity

0

10.344
lnf%

NA
NA

NA

-------
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2225

T0X21_PR_LU C_Fol lowu p_Antagon ist_via bi lity

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HEK293 Progesterone Receptor (PR) Antagonism Luciferase
Follow-Up Assay

1.2	Assay Summary: TOX21_PR_LUC_Followup is a secondary assay for specificity for the TOX21_PR_BLA_Agonist
assay. To differentiate true PR agonists from cytotoxic substances, this follow up assay is multiplexed with a cell
viability assay. TOX21_PR_LUC_Followup_Antagonist_viability is an assay readout measuring cellular ATP
content and detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
TOX21_PR_LUC_Followup_Antagonist_viability used a type of viability reporter where loss-of-signal activity can
be used to understand changes in the cell viability. Furthermore, this assay endpoint can be referred to as a
secondary readout, because this assay has produced multiple assay endpoints where this one serves a viability
function. To generalize the intended target to other relatable targets, this assay endpoint is annotated to the
cell cycle intended target family, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HEK293T cells are aliquoted into 1536-well microtiter
plates and incubated with test compounds for 16 hours prior to monitoring fluorescence emission resulting from
PR gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes to bioluminescence signals produced from an enzymatic reaction catalyzed by luciferase
between the key substrate [CellTiter-Glo] and the target cofactor [ATP] are correlated to the viability of the
system.

The Tox21 progesterone receptor (PR) antagonism luciferase follow-up assay screened a library of diverse
environmental compounds to probe for xenobiotic ligand-binding and potential to suppress PR-dependent
transcription, monitored luciferase reporter gene signal activation using an PR-luciferase reporter gene
construct. To differentiate true PR antagonists or agonist from cytotoxic substances, the assay is multiplexed
with cell viability assay. The assay is run in triplicate on a 1536-well microplate and bioluminescence was
measured following 16 hour incubation of cells with test compounds and 30 min incubation of test system with

-------
ONE-GloTM luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux plate
reader. Following the incubation period, the cell culture was screened for bioluminescent signals in agonist
mode using luciferase ATP detection technology. Each compound was tested in a concentration-response
format. Compound auto-fluorescence was monitored in various "TOX21_AutoFluor" assays run at interfering
wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. Progesterone receptor (PR), a nuclear hormone receptor,
plays an important role in development, metabolic homeostasis and reproduction. It is activated by the steroid
hormone progesterone. Endocrine disrupting chemicals (EDCs) and their interactions with steroid hormone
receptors like PR causes disruption of normal endocrine function.

2.3	Experimental System: adherent HEK293-PR-A cell line used. The HEK293-PR-A-luc cell line was derived from the
human embryonic kidney cell line, HEK-293, by stable transfection with a luciferase reporter gene construct.
The HEK-293 cell line is a human embryonic kidney cell line (of unknown parentage) transformed with sheared
adenovirus 5 DNA by Frank Graham in 1973 (Graham et al. 1977). The transformation incorporated
approximately 4.5 kilobases from the viral genome into human chromosome 19 of the HEK cells, and subsequent
cytogenetic characterization established that the 293 line is pseudotriploid (Bylund et al. 2004). HEK293 cells
are popular for their ease of growth and transfection cells and are frequently used to produce exogenous
proteins or viruses for pharmaceutical and biomedical research purposes.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. For
assay purpose, the cells should be cultured in assay medium with 10% charcoal stripped FBS for 5 days with
alternate day medium changed to fresh medium. Especially while in assay culture, the cells should not reach
more than 85% confluence as they would become harder to detach if they reach over confluence. Assay
Protocol: HEK293-PR-A-luc cells were dispensed at 3,000 cells/5 uL/well of culture medium into white wall/solid-
bottom 1536-well plates using a Multidrop Combi (ThermoFisher Scientific, Waltham, MA) dispenser. After the
assay plates were incubated at 37Cand 0% C02 for 5 h, 23 nL of compounds dissolved in DMSO, positive controls
or DMSO only was transferred to the assay plate by a Pintool station (Kalypsys, San Diego, CA). The plates were
incubated at 37C and 0% C02 for 16 h. Then 4 uL of ONE-Glo reagent (Promega, Madison, Wl) was added to
each plate using a BioRAPTR Flying Reagent Dispenser (FRD) workstation (Beckman Coulter, Indianapolis, IN,
USA) and luminescence was quantified on a ViewLux (PerkinElmer, Shelton, CT) plate reader after 30 min
incubation at room temperature.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
11

Standard minimum concentration tested:

0.00162327108940092 nM
Key positive control:

tetraoctylammonium bromide

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

95.852534562212 nM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 5.987
Response cutoff threshold used to determine hit calls: 35.921

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

-------
2.6	Response: Antagonism of the progesterone receptor (PR) signaling pathway is measured by bioluminescence
activity via an progesterone-responsive firefly luciferase reporter gene. Decreased luciferase activity can be used
to identify the compounds that suppress xenobiotic progesterone ligand-binding and PR antagonism. The
cytotoxicity of the compounds screened was tested in parallel by measuring the cell viability using CellTiter-
Fluor assay in the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 230	Number of chemicals tested: 230

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

10
24

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

9

81

quadratic-polynomialfpoly2) model: 23

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2

9

71

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

-------
4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed)/sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 9.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity

0

4.918
lnf%

NA
NA

NA

-------
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: NA

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2362

T0X21_PX R_ LU C_Ago n i st_v lability

1.	General Information

1.1	Assay Title: Viability Assessment in the Tox21 HepG2 Pregnane X Receptor (hPXR) Agonism Luciferase Assay

1.2	Assay Summary: TOX21 PXR LUC Agonist is a cell-based, single-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing on a microplate: 1536-well plate. To screen
Tox21 libraries for compounds that activate hPXR-mediated CYP3A4 gene expression, PXR-Luc HepG2 cells were
used. Cytotoxicity of the compounds was tested in parallel by measuring the cell viability using CellTiter-Fluor
assay system in the same wells. See tox21-pxr-pl. TOX21_PXR_LUC_Agonist_viability is a component of the
TOX21_PXR_LUC_Agonist assay. This component measures cell viability using the CellTiter-Fluor assay system.
The assay component endpoint TOX21_PXR_LUC_Agonist_viability was analyzed with bidirectional fitting
relation to DMSO as the negative control and baseline of activity. Using a type of viability reporter, loss-of-signal
activity can be used to understand changes in viability. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to cell cycle, where the subfamily is cytotoxicity.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HepG2 were dispensed at 2000 cells/5 uL/well into white
wall/solid bottom 1536-well plates using a Multidrop Combi (Thermo Fisher Scientific Inc., Waltham, MA)
dispenser. After the assay plates were incubated at 37C for 5 hours, 23 nL of compound or DMSO vehicle was
transferred to the assay plates by a pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C
for 24 hours prior to monitoring fluorescence emission resulting from PXR-mediated cytochrome P450 3A4
(CYP3A4) gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The human pregnane X receptor (hPXR) regulates the expression of several drug metabolizing
enzymes and induction of these proteins is a major mechanism for developing drug resistance in cancer. One
such key enzyme catalyzing the drug metabolism is cytochrome P450 3A4 (CYP3A4). Changes in CYP3A4
expression affect drug metabolism, thereby reducing the therapeutic efficacy and altering toxicological
response to a drug and which finally causes adverse drug interaction.

-------
The Tox21 pregnane x receptor (hPXR) agonism luciferase assay screened a library of diverse environmental
compounds to probe for xenobiotic ligand-binding and potential to activate hPXR-mediated CYP3A4 gene
expression, monitored through luciferase reporter gene signal activation. HepG2 (human hepatocellular
carcinoma) cells were stably transfected with an pregnane x receptor responsive firefly luciferase reporter gene
plasmid. Increased luciferase activity can be used to identify the compounds that induce hPXR-mediated CYP3A4
gene expression. To differentiate true hPXR agonist from cytotoxic substances, the assay is multiplexed with cell
viability assay. The assay is run in triplicate on 1536-well microplate and bioluminescence was measured
following 24-hour incubation of cells with test compounds and 30 min incubation of test system with ONE-
GloTM luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux plate
reader. Following the incubation period, the cell culture was screened for bioluminescent signals in agonist
mode using luciferase detection technology. Compound auto-fluorescence was monitored in various
"TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. The human pregnane X receptor (hPXR) regulates the
expression of several drug metabolizing enzymes and induction of these proteins is a major mechanism for
developing drug resistance in cancer. One such key enzyme catalyzing the drug metabolism is cytochrome P450
3A4 (CYP3A4). Changes in CYP3A4 expression affect drug metabolism, thereby reducing the therapeutic efficacy
and altering toxicological response to a drug and which finally causes adverse drug interaction.

2.3	Experimental System: adherent HepG2 cell line used. A cell based PXR-Luc HepG2 cell assay (cell line provided
by Dr. Taosheng Chen Department of Chemical Biology and Therapeutics, St. Jude Children's Research Hospital),
was used to assess the activation of PXR. HepG2 (human hepatocellular carcinoma) cells were stably transfected
with a CYP3A4-luc promoter construct and hPXR expression plasmid.

2.4	Metabolic Competence: The parental HepG2 cell line has been shown by others to retain the potential for Phase
I and Phase II metabolic responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A,
2E1, and 3A4/5 with CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human
hepatocytes although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have
even been observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2
cells include SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7).
In addition, HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53
protein (Boehme et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant
response element (ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a
number of ATP-binding cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound
proteins also regulated in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. Cell
Thawing Method: Thaw a vial of cells in 9ml of pre-warmed thaw medium and then centrifuge. Re-suspend the
pellet with the thaw medium and seed at 2 million cells per T-75 flask. Cell Proliferation Method: Trypsinize
cells from the culturing flask and centrifuge and then re-suspend cells in culture medium. Passage cells at 2-3
million per T-225 flask. Assay Protocol: Trypsinize cells from the culturing flask and centrifuge and then re-
suspend cells in assay medium at a density of 0.4 X 10A6 cells/mL. Dispense 2000 cells/5uL/well into 1536-well
tissue treated white/solid bottom plates using a 8 tip dispenser (Multidrop). Incubate the plates for 5hr at 37C
and 5% C02. Transfer 23nL of compounds from the library collection (0.59nM to 92uM) and positive control
through Pintool. Incubate the plates for 24hr at 37C and 5% C02. After 23hrs of incubation at 37C, add lul of
CellTiter-Fluor using single tip dispense (BioRAPTR). Incubate the plates for lhr at 37C and 5% C02. Read

-------
fluorescence (exposure time = 3sec) intensity using ViewLux plate reader. Then add 4ul of ONE-Glo(TM)
Luciferase reagent using a single tip dispense (BioRAPTR). Incubate the plates at room temperature for 30min.
Read fluorescence (exposure time = 45sec) intensity using ViewLux plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
15

Standard minimum concentration tested:

0.0117795391705069 nM
Key positive control:

46 uM rifampicin

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

920.276497695853 uM
Neutral vehicle control:

DMSO

Baseline median absolute deviation for the assay (bmad): 2.432
Response cutoff threshold used to determine hit calls: 20
Detection technology used: Cell Titer Fluor assay (Fluorescence)

2.6	Response: Agonism of the pregnane x receptor (hPXR) signaling pathway is measured by bioluminescence
activity via an pregnane-responsive firefly luciferase reporter gene. Increased luciferase activity can be used to
identify the compounds that promote xenobiotic pregnane ligand-binding and hPXR agonism. The cytotoxicity
of the compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in
the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

Cytotoxicity Burst: Assays used to defne the cytotoxicity burst region

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cellular ATP content was detected with CellTiter-Glo Luciferase-coupled ATP quantitation.
Decreased luminescence (loss-of-signal) was measured relative to tetraoctylammonium bromide (positive
control) signal, using DMSO (neutral control) as a baseline for cell viability. Response was reported as a percent
activity.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by

-------
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 27: ow_bidirectional_loss (Multiply winning model hitcall
(hitc) by -1 for models fit in the positive analysis direction. Typically used for endpoints where only
negative responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest

-------
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: 0
-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.967

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

0

5.546
lnf%

-31.246
8.611

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 639.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Lynch C, Sakamuru S, Huang R, Niebler J, Ferguson SS, Xia M. Characterization of human pregnane
X receptor activators identified from a screening of the Tox21 compound library. Biochem Pharmacol. 2021
Feb;184:114368. doi: 10.1016/j.bcp.2020.114368. Epub 2020 Dec 14. PMID: 33333074., Lynch C, Mackowiak B,
Huang R, Li L, Heyward S, Sakamuru S, Wang H, Xia M. Identification of Modulators That Activate the Constitutive
Androstane Receptor From the Tox21 10K Compound Library. Toxicol Sci. 2019 Jan l;167(l):282-292. doi:
10.1093/toxsci/kfy242. PMID: 30247703; PMCID: PMC6657574.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
-------
Assay Endpoint ID: 2363

T0X21_PXR_LU C_Agon ist

1.	General Information

1.1	Assay Title: Tox21 HepG2 Pregnane X Receptor (hPXR) Agonism Luciferase Assay

1.2	Assay Summary: TOX21 PXR LUC Agonist is a cell-based, single-readout assay that uses HepG2, a human liver
cell line, with measurements taken at 24 hours after chemical dosing on a microplate: 1536-well plate. To screen
Tox21 libraries for compounds that activate hPXR-mediated CYP3A4 gene expression, PXR-Luc HepG2 cells were
used. Cytotoxicity of the compounds was tested in parallel by measuring the cell viability using CellTiter-Fluor
assay system in the same wells. See tox21-pxr-pl. TOX21_PXR_LUC_Agonist is one of one assay component(s)
measured or calculated from the TOX21_PXR_LUC_Agonist assay. It is designed to make measurements of
luciferase induction, a form of inducible reporter, as detected with bioluminescence signals by CellTiter-Glo
Luciferase-coupled ATP quantitation technology. Data from the assay component TOX21_PXR_LUC_Agonist was
analyzed into 1 assay endpoint. This assay endpoint, TOX21_PXR_LUC_Agonist, was analyzed in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of
inducible reporter, gain-of-signal activity can be used to understand changes in the reporter gene as they relate
to gene NR1I2. Furthermore, this assay endpoint can be referred to as a primary readout, because the
performed assay has only produced 1 assay endpoint. To generalize the intended target to other relatable
targets, this assay endpoint is annotated to the nuclear receptor target family, where the subfamily is non-
steroidal.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Tox21 is an interagency agreement between the NIH, NTP, FDA and EPA. NIH Chemical Genomics
Center (NCGC) is the primary screening facility running ultra high-throughput screening assays across a large
interagency-developed chemical library.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: This assay is not proprietary. The Tox21 qHTS robotic platform has a 1536-well per run
capacity and is capable of fully-automated, hands-free execution (liquid dispensing and aspiration, plate
centrifugation and incubation) and signal recording (plate readout). The GeneBLAzer System is publically
available through Invitrogen.

1.9	Assay Throughput: 1536-well plate. Stably transfected HepG2 were dispensed at 2000 cells/5 uL/well into white
wall/solid bottom 1536-well plates using a Multidrop Combi (Thermo Fisher Scientific Inc., Waltham, MA)
dispenser. After the assay plates were incubated at 37C for 5 hours, 23 nL of compound or DMSO vehicle was
transferred to the assay plates by a pintool station (Kalypsys, San Diego, CA). The plates were incubated at 37C
for 24 hours prior to monitoring fluorescence emission resulting from PXR-mediated cytochrome P450 3A4
(CYP3A4) gene expression.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: The human pregnane X receptor (hPXR) regulates the expression of several drug metabolizing
enzymes and induction of these proteins is a major mechanism for developing drug resistance in cancer. One

-------
such key enzyme catalyzing the drug metabolism is cytochrome P450 3A4 (CYP3A4). Changes in CYP3A4
expression affect drug metabolism, thereby reducing the therapeutic efficacy and altering toxicological
response to a drug and which finally causes adverse drug interaction.

The Tox21 pregnane x receptor (hPXR) agonism luciferase assay screened a library of diverse environmental
compounds to probe for xenobiotic ligand-binding and potential to activate hPXR-mediated CYP3A4 gene
expression, monitored through luciferase reporter gene signal activation. HepG2 (human hepatocellular
carcinoma) cells were stably transfected with an pregnane x receptor responsive firefly luciferase reporter gene
plasmid. Increased luciferase activity can be used to identify the compounds that induce hPXR-mediated CYP3A4
gene expression. To differentiate true hPXR agonist from cytotoxic substances, the assay is multiplexed with cell
viability assay. The assay is run in triplicate on 1536-well microplate and bioluminescence was measured
following 24-hour incubation of cells with test compounds and 30 min incubation of test system with ONE-
GloTM luciferase assay reagent. The bioluminescent signal was monitored using a Promega ViewLux plate
reader. Following the incubation period, the cell culture was screened for bioluminescent signals in agonist
mode using luciferase detection technology. Compound auto-fluorescence was monitored in various
"TOX21_AutoFluor" assays run at interfering wavelengths to allow for background artifact detection.

2.2	Scientific Principles: Luciferase reporter-gene assays are a commonly used bioluminescence assay. The
construct includes a promoter region of a gene of interest followed by a luciferase gene. When this is introduced
into a cell, luciferase is expressed in quantities that are proportional to the promoter activity. The luciferase
(and hence the promoter activity) can then be quantified by the measurement of the luminescence produced
when the enzyme substrate is added. In this way, the transcriptional activity of the gene of interest (i.e., its
expression) can be measured in response to the effects of different modulators of the relevant signaling
pathways. The luciferase reaction can also be used in combination with constitutively active promoters, to
investigate cytotoxicity or transfection efficiency. The human pregnane X receptor (hPXR) regulates the
expression of several drug metabolizing enzymes and induction of these proteins is a major mechanism for
developing drug resistance in cancer. One such key enzyme catalyzing the drug metabolism is cytochrome P450
3A4 (CYP3A4). Changes in CYP3A4 expression affect drug metabolism, thereby reducing the therapeutic efficacy
and altering toxicological response to a drug and which finally causes adverse drug interaction.

2.3	Experimental System: adherent HepG2 cell line used. A cell based PXR-Luc HepG2 cell assay (cell line provided
by Dr. Taosheng Chen Department of Chemical Biology and Therapeutics, St. Jude Children's Research Hospital),
was used to assess the activation of PXR. HepG2 (human hepatocellular carcinoma) cells were stably transfected
with a CYP3A4-luc promoter construct and hPXR expression plasmid.

2.4	Metabolic Competence: The parental HepG2 cell line has been shown by others to retain the potential for Phase
I and Phase II metabolic responses to xenobiotics, e.g., expression of CYP1A1/2, 2A6, 2B6, 2C8/9, 2C19, 2D6/3A,
2E1, and 3A4/5 with CYP1A2, CYP2C9, CYP2D6, CYP2E1 and CYP3A activities reported at levels similar to human
hepatocytes although variable depending on source and culture conditions; some enzymes (e.g., CYP2W1) have
even been observed at higher rates than in primary hepatocytes. Phase II enzyme activities identified in HepG2
cells include SULTS (1A1,1A2,1E1 and 2A1), GSTs (mGST-1, GST ul), NAT1, EPHX1 and UGTs (1A1,1A6 and 2B7).
In addition, HepG2 cells can potentially express xenobiotic regulation activities via functionally active p53
protein (Boehme et al. 2010) and Nrf2, a transcription factor which regulates genes containing antioxidant
response element (ARE) sequences in their promoters; HepG2 cells also possess the capacity to express a
number of ATP-binding cassette (ABC) xenobiotic export pumps (e.g., ABCC1, C2, C3 and G2 membrane-bound
proteins also regulated in part by Nrf2 TF DNA-binding).

2.5	Exposure Regime: Quality Control Precautions: Maintain cells below 85-90% confluence in culture medium. Cell
Thawing Method: Thaw a vial of cells in 9ml of pre-warmed thaw medium and then centrifuge. Re-suspend the
pellet with the thaw medium and seed at 2 million cells per T-75 flask. Cell Proliferation Method: Trypsinize
cells from the culturing flask and centrifuge and then re-suspend cells in culture medium. Passage cells at 2-3
million per T-225 flask. Assay Protocol: Trypsinize cells from the culturing flask and centrifuge and then re-
suspend cells in assay medium at a density of 0.4 X 10A6 cells/mL. Dispense 2000 cells/5uL/well into 1536-well

-------
tissue treated white/solid bottom plates using a 8 tip dispenser (Multidrop). Incubate the plates for 5hr at 37C
and 5% C02. Transfer 23nL of compounds from the library collection (0.59nM to 92uM) and positive control
through Pintool. Incubate the plates for 24hr at 37C and 5% C02. After 23hrs of incubation at 37C, add lul of
CellTiter-Fluor using single tip dispense (BioRAPTR). Incubate the plates for lhr at 37C and 5% C02. Read
fluorescence (exposure time = 3sec) intensity using ViewLux plate reader. Then add 4ul of ONE-Glo(TM)
Luciferase reagent using a single tip dispense (BioRAPTR). Incubate the plates at room temperature for 30min.
Read fluorescence (exposure time = 45sec) intensity using ViewLux plate reader.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:	Target (nominal) number of replicates:

15	3

Standard minimum concentration tested:	Standard maximum concentration tested:

0.0117795391705069 nM	920.276497695853 nM

Key positive control:	Neutral vehicle control:

46 uM rifampicin	DMSO

Baseline median absolute deviation for the assay (bmad): 1.806
Response cutoff threshold used to determine hit calls: 20

Detection technology used: CellTiter-Glo Luciferase-coupled ATP quantitation (Luminescence)

2.6 Response: Agonism of the pregnane x receptor (hPXR) signaling pathway is measured by bioluminescence
activity via an pregnane-responsive firefly luciferase reporter gene. Increased luciferase activity can be used to
identify the compounds that promote xenobiotic pregnane ligand-binding and hPXR agonism. The cytotoxicity
of the compounds screened was tested in parallel by measuring the cell viability using CellTiter-Fluor assay in
the same wells.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of nuclear receptor.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Increased luminescence (gain-of-signal) was measured relative to 46 uM rifampicin (positive
control) signal, using DMSO (neutral control) as a baseline for luciferase induction. Response was reported as a
percent of positive control activity.

-------
Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

1: none (Set the corrected response value (cval) as the normalized response value (resp); cval = resp. No
additional mc3 methods needed for endpoint-specific normalization.)

Level 4: Baseline and required tcplFit2 parameters defined by:

1: bmad.aeid.lowconc.twells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (rep) for test compound wells (wilt = t) with
concentration index (cndx) equal to 1 or 2. Calculate one standard deviation of the normalized response
for test compound wells (wilt = t) with a concentration index (cndx) of 1 or 2; onesd = sqrt(sum((resp -
mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

2: pc20 (Add a cutoff value of 20. Typically for percent of control data.), 6: bmad6 (Add a cutoff value of
6 multiplied by the baseline median absolute deviation (bmad). By default, bmad is calculated using test
compound wells (wilt = t) for the endpoint.), 28: ow_bidirectional_gain (Multiply winning model hitcall
(hitc) by -1 for models fit in the negative analysis direction. Typically used for endpoints where only
positive responses are biologically relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical

-------
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 9667	Number of chemicals tested: 7871

ACTIVITY HIT CALLS

Active hit count: hitc>0.9	Inactive hit count: Oihitc 0.9	NA hit count: hitc^O

2759	5767	1141

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:	450

gain-loss (gnls) model:	284

power(pow) model:	1040

linear-polynomial (polyl) model:	3278

quadratic-polynomial(poly2) model:	1009

exponential-2 (exp2) model:	578

exponential-3 (exp3) model:	245

exponential-4 (exp4) model:	2228

exponential-5 (exp5) model:	555

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3 Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.

-------
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	7.689

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

0
4.08
lnf%

100
12.034

NA

-------
4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 555.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Lynch C, Sakamuru S, Huang R, Niebler J, Ferguson SS, Xia M. Characterization of human pregnane
X receptor activators identified from a screening of the Tox21 compound library. Biochem Pharmacol. 2021
Feb;184:114368. doi: 10.1016/j.bcp.2020.114368. Epub 2020 Dec 14. PMID: 33333074., Lynch C, Mackowiak B,
Huang R, Li L, Heyward S, Sakamuru S, Wang H, Xia M. Identification of Modulators That Activate the Constitutive
Androstane Receptor From the Tox21 10K Compound Library. Toxicol Sci. 2019 Jan l;167(l):282-292. doi:
10.1093/toxsci/kfy242. PMID: 30247703; PMCID: PMC6657574.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
-------
Assay Endpoint ID: 2545

U KN5_HCS_SBAD2_neu rite_outgrowth

1.	General Information

1.1	Assay Title: University of Konstanz (UKN) Leist Lab Neurite Outgrowth Assay in human iPSC-derived immature
Dorsal Root Ganglia Neurons (UKN5)

1.2	Assay Summary: UKN5_HCS_SBAD2 is a cell-based, multiplexed-readout assay screening for neurite outgrowth
and cell viability that uses SBAD2 (peripheral neurons differentiated from iPSC), a human peripheral nervous
system cell line, with measurements taken at 3 hours after chemical dosing in a microplate: 96-well plate. UKN5,
also referred to as PeriTox, is an assay that uses human iPSC line SBAD2 as a model of peripheral neurons.
Following 24 hr chemical exposures in multi-well plates, neurite area is evaluated as a marker of neurite
outgrowth using high-content imaging of cells stained with calcein-AM. Cell viability is assessed using stain
Hoechst H-33342. UKN5_HCS_SBAD2_neurite_outgrowth is an assay component measured from the
UKN5_HCS_SBAD2 assay. It is designed to make measurements of neurite outgrowth, a form of morphology
reporter, as detected with Fluorescence intensity signals by HCS Fluorescent Imaging. The assay endpoint
UKN5_HCS_SBAD2_neurite_outgrowth was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of morphology reporter, gain or loss-of-signal activity can be used
to understand developmental effects. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the neurodevelopment intended target family, where the subfamily is neurite
outgrowth.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Leist research group at the University of Konstanz conducts high content imaging assays for
neurotoxicity.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human induced pluripotent stem cell line iPSC EPITHELIAL-1 (Cat# IPSC0028) is
purchased from Sigma-Aldrich, Taufkirchen, Germany as a frozen suspension of single cells.

1.9	Assay Throughput: 96-well plate. The assay uses a 96 well-plate with 3 compounds per plate, 6 different
concentrations of each compound per plate, 3 technical replicates per plate representing one biological
replicate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the neurite area are indicative of neurodevelopment.

The Neurite Outgrowth Assay (UKN5) is designed to investigate changes in neurite outgrowth (NOG) in response
to chemical exposure in immature dorsal root ganglia (iDRG) neurons using a high-content screening (HCS)
technology. Neurite outgrowth is one of several key processes of neurodevelopment. The assay includes a
parallel viability assessment to measure changes in cell viability by staining of Hoechst H-33342.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental

-------
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together. This assay assesses disturbances in the
development of the peripheral nervous system.

2.3	Experimental System: adherent SBAD2 (peripheral neurons differentiated from iPSC) cell line used. The human
induced pluripotent stem cell line iPSC EPITHELIAL-1 (Cat IPSC0028) is purchased from Sigma-Aldrich,
Taufkirchen, Germany as a frozen suspension of single cells. iPSC EPITHELIAL-1 cells are produced via
reprogramming of epithelial cells from a Caucasian female (24 years) using OSKM retrovirus. Pluripotency was
certified by gene and protein expression of pluripotency markers. The maintenance culture is cultured in
colonies under feeder-free conditions on Laminin-521 coating in Essential 8 (E8) medium. The cells are split
weekly.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: The previously differentiated immature peripheral neurons are thawed and seeded on
Matrigel coated plates (1:40 diluted) in 75 ul medium composed of 75% N2-S medium and 25% KSR medium,
supplemented with CHIR99021 (1.5 uM), SU5402 (5 uM) and DAPT (y-Secretase inhibitor IX, 5 uM) at a density
of 100.000 cells/cm2. One hour after seeding, treatment compounds are added to the cells in 25 ul of similar to
culture medium in which cells were seeded. 23 h after toxicant application, cells are live-stained with H-33342
and calcein-AM and incubated for 60 min. After 24 h of treatment (including staining), the cells are imaged using
a high-content microscope (Cellomics VTI Array Scan).

Baseline median absolute deviation for the assay (bmad): 5.958
Response cutoff threshold used to determine hit calls: 23.831
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content screening to describe neurite outgrowth in human iPSC-derived
immature dorsal root ganglia (iDRG) neurons, via staining with Hoechst-33342 and calcein-AM. The cells are
imaged using a high-content microscope (Cellomics VTI Array Scan). Changes in neurite area are measured.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

41.1522633744856 uM
Key positive control:

Narciclasine

Target (nominal) number of replicates:

15

Standard maximum concentration tested:

10000 uM
Neutral vehicle control:

DMSO

-------
DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: An automated microplate reading microscope (Array-Scanll HCS Reader, Cellomics, PA) equipped
with a Hamamatsu ORCA-ER camera (resolution 1024 x 1024; run at 2 x 2 binning) was used for image
acquisition. Ten fields per well were imaged. Images were recorded in 2 channels using a 20x objective and
excitation/emission wavelengths of 365 ± 50/535 ± 45 to detect H-33342 in channel 1 and 474 ± 40/535 ± 45 to
detect calcein in channel 2. In both channels, a fixed exposure time and an intensity histogram-derived threshold
were used for object identification. Neurite pixels were identified using the following image analysis algorithm:
nuclei were identified as objects in channel 1 according to their size, area, shape, and intensity which were
predefined on untreated cells using a machine-based learning algorithm, and manual selection of nuclei to be
classified as intact. The nuclear outlines were expanded by 3.2 um in each direction, to define a virtual cell soma
area (VCSA) based on the following procedure: All calcein-positive pixels of the field (beyond a given intensity
threshold) were defined as viable cellular structures (VCSs). The threshold was dynamically determined for each
field after flat field and background correction and intensity normalization to 512 gray values and was set to
12% of the maximal brightness (channel 63 of 512). The VCS defines the sum of all somata and neurites without
their assignment to individual cells. In an automatic calculation, the VCSAs, defined in the H-33342 channel,
were used as filter in the calcein channel and subtracted from the VCS. The remaining pixels (VCS - VCSA) in the
calcein channel were defined as neurite area.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

-------
2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

25: bmad4 (Add a cutoff value of 4 multiplied the baseline median absolute deviation (bmad). By default,
bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 71	Number of chemicals tested: 71

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
29

Inactive hit count: 0
-------
quadratic-polynomialfpoly2) model:	11

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

4

0

3

12

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

313126.39

Neutral control median absolute deviation, by plate: nmad

16626.412

-------
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.7%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	151649.19

Positive control well median absolute deviation, by plate: pmad	9236.116

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-5.406

((pmed - nmed) / sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 4.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are

-------
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Krug AK, Balmer NV, Matt F, Schonenberger F, Merhof D, Leist M. Evaluation of a human neurite
growth assay as specific screen for developmental neurotoxicants. Arch Toxicol. 2013 Dec;87(12):2215-31. doi:
10.1007/s00204-013-1072-y. Epub 2013 May 14. PMID: 23670202., Hoelting L, Klima S, Karreman C, Grinberg
M, Meisig J, Henry M, Rotshteyn T, Rahnenfiihrer J, Bliithgen N, Sachinidis A, Waldmann T, Leist M. Stem Cell-
Derived Immature Human Dorsal Root Ganglia Neurons to Identify Peripheral Neurotoxicants. Stem Cells Transl
Med. 2016 Apr;5(4):476-87. doi: 10.5966/sctm.2015-0108. Epub 2016 Mar 1. PMID: 26933043; PMCID:
PMC4798731., Krebs A, van Vugt-Lussenburg BMA, Waldmann T, Albrecht W, Boei J, Ter Braak B, Brajnik M,
Braunbeck T, Brecklinghaus T, Busquet F, Dinnyes A, Dokler J, Dolde X, Exner TE, Fisher C, Fluri D, Forsby A,
Hengstler JG, Holzer AK, Janstova Z, Jennings P, Kisitu J, Kobolak J, Kumar M, Limonciel A, Lundqvist J, Mihalik B,
Moritz W, Pallocca G, Ulloa APC, Pastor M, Rovida C, Sarkans U, Schimming JP, Schmidt BZ, Stober R, Strassfeld
T, van de Water B, Wilmes A, van der Burg B, Verfaillie CM, von Hellfeld R, Vrieling H, Vrijenhoek NG, Leist M.
The EU-ToxRisk method documentation, data processing and chemical testing pipeline for the regulatory use of
new approach methods. Arch Toxicol. 2020 Jul;94(7):2435-2461. doi: 10.1007/s00204-020-02802-6. Epub 2020
Jul 6. PMID: 32632539; PMCID: PMC7367925.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2547

U KN5_HCS_SBAD2_cell_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the University of Konstanz (UKN) Leist Lab Neurite Outgrowth Assay in
human iPSC-derived immature Dorsal Root Ganglia Neurons (UKN5)

1.2	Assay Summary: UKN5_HCS_SBAD2 is a cell-based, multiplexed-readout assay screening for neurite outgrowth
and cell viability that uses SBAD2 (peripheral neurons differentiated from iPSC), a human peripheral nervous
system cell line, with measurements taken at 3 hours after chemical dosing in a microplate: 96-well plate. UKN5,
also referred to as PeriTox, is an assay that uses human iPSC line SBAD2 as a model of peripheral neurons.
Following 24 hr chemical exposures in multi-well plates, neurite area is evaluated as a marker of neurite
outgrowth using high-content imaging of cells stained with calcein-AM. Cell viability is assessed using stain
Hoechst H-33342. UKN5_HCS_SBAD2_cell_viability is an assay component measured from the
UKN5_HCS_SBAD2 assay. It is designed to make measurements of enzyme activity, a form of viability reporter,
as detected with Fluorescence intensity signals by HCS Fluorescent Imaging. The assay endpoint
UKN5_HCS_SBAD2_cell_viability was analyzed with bidirectional fitting relative to DMSO as the negative control
and baseline of activity. Using a type of viability reporter, gain or loss-of-signal activity can be used to understand
viability effects. To generalize the intended target to other relatable targets, this assay endpoint is annotated to
the cell cycle intended target family, where the subfamily is viability.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Leist research group at the University of Konstanz conducts high content imaging assays for
neurotoxicity.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human induced pluripotent stem cell line iPSC EPITHELIAL-1 (Cat# IPSC0028) is
purchased from Sigma-Aldrich, Taufkirchen, Germany as a frozen suspension of single cells.

1.9	Assay Throughput: 96-well plate. The assay uses a 96 well-plate with 3 compounds per plate, 6 different
concentrations of each compound per plate, 3 technical replicates per plate representing one biological
replicate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of Hoechst labelled nuclei are indicative of viability.

The Neurite Outgrowth Assay (UKN5) is designed to investigate changes in neurite outgrowth (NOG) in response
to chemical exposure in immature dorsal root ganglia (iDRG) neurons using a high-content screening (HCS)
technology. Neurite outgrowth is one of several key processes of neurodevelopment. The assay includes a
parallel viability assessment to measure changes in cell viability by staining of Hoechst H-33342.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these

-------
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together. This assay assesses disturbances in the
development of the peripheral nervous system.

2.3	Experimental System: adherent SBAD2 (peripheral neurons differentiated from iPSC) cell line used. The human
induced pluripotent stem cell line iPSC EPITHELIAL-1 (Cat IPSC0028) is purchased from Sigma-Aldrich,
Taufkirchen, Germany as a frozen suspension of single cells. iPSC EPITHELIAL-1 cells are produced via
reprogramming of epithelial cells from a Caucasian female (24 years) using OSKM retrovirus. Pluripotency was
certified by gene and protein expression of pluripotency markers. The maintenance culture is cultured in
colonies under feeder-free conditions on Laminin-521 coating in Essential 8 (E8) medium. The cells are split
weekly.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: The previously differentiated immature peripheral neurons are thawed and seeded on
Matrigel coated plates (1:40 diluted) in 75 ul medium composed of 75% N2-S medium and 25% KSR medium,
supplemented with CHIR99021 (1.5 uM), SU5402 (5 uM) and DAPT (y-Secretase inhibitor IX, 5 uM) at a density
of 100.000 cells/cm2. One hour after seeding, treatment compounds are added to the cells in 25 ul of similar to
culture medium in which cells were seeded. 23 h after toxicant application, cells are live-stained with H-33342
and calcein-AM and incubated for 60 min. After 24 h of treatment (including staining), the cells are imaged using
a high-content microscope (Cellomics VTI Array Scan).

Baseline median absolute deviation for the assay (bmad): 3.44
Response cutoff threshold used to determine hit calls: 13.759
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content screening to describe neurite outgrowth in human iPSC-derived
immature dorsal root ganglia (iDRG) neurons, via staining with Hoechst-33342 and calcein-AM. The cells are
imaged using a high-content microscope (Cellomics VTI Array Scan). Changes in viable cells were measured.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

41.1522633744856 uM
Key positive control:

NA

Target (nominal) number of replicates:

15

Standard maximum concentration tested:

10000 uM
Neutral vehicle control:

DMSO

-------
Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For a quantitative assessment of viable cells, the same images used to assess neurite area were
analysed using another image analysis algorithm: nuclei were identified in channel 1 as objects according to
their size, area, shape, and intensity. Nuclei of apoptotic cells with increased fluorescence were excluded. A
VCSA was defined around each nucleus by expanding it by 0.3 um into each direction. Calcein-AM staining,
labelling live cells, was detected in channel 2. The algorithm quantified the calcein intensity in the VCSA areas.
Cells having an average calcein signal intensity in the VCSAs below a predefined threshold were classified by the
program as "not viable". Valid nuclei with a positive calcein signal in their cognate VCSA were counted as viable
cells. A positive calcein signal was based on measurements of the average intensity (normal cells: 1300 ± 115,
threshold: < 50) and the total integrated intensity (normal cells: 186,000 ± 23,600, threshold < 1000) of cells.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

25: bmad4 (Add a cutoff value of 4 multiplied the baseline median absolute deviation (bmad). By default,
bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

-------
5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 71	Number of chemicals tested: 71

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
26

Inaclive hit count: 0
-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0.723

Neutral control median absolute deviation, by plate: nmad

0.022

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

3.14%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

0.619

0.02

Z Prime Factor for median positive and neutral control across all plates:

NA

-------
(1 - ((3 * (pmad + nmad)) / abs(pmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.649

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 4.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Krug AK, Balmer NV, Matt F, Schonenberger F, Merhof D, Leist M. Evaluation of a human neurite
growth assay as specific screen for developmental neurotoxicants. Arch Toxicol. 2013 Dec;87(12):2215-31. doi:
10.1007/s00204-013-1072-y. Epub 2013 May 14. PMID: 23670202., Hoelting L, Klima S, Karreman C, Grinberg
M, Meisig J, Henry M, Rotshteyn T, Rahnenfiihrer J, Bliithgen N, Sachinidis A, Waldmann T, Leist M. Stem Cell-
Derived Immature Human Dorsal Root Ganglia Neurons to Identify Peripheral Neurotoxicants. Stem Cells Transl
Med. 2016 Apr;5(4):476-87. doi: 10.5966/sctm.2015-0108. Epub 2016 Mar 1. PMID: 26933043; PMCID:
PMC4798731., Krebs A, van Vugt-Lussenburg BMA, Waldmann T, Albrecht W, Boei J, Ter Braak B, Brajnik M,
Braunbeck T, Brecklinghaus T, Busquet F, Dinnyes A, Dokler J, Dolde X, Exner TE, Fisher C, Fluri D, Forsby A,
Hengstler JG, Holzer AK, Janstova Z, Jennings P, Kisitu J, Kobolak J, Kumar M, Limonciel A, Lundqvist J, Mihalik B,
Moritz W, Pallocca G, Ulloa APC, Pastor M, Rovida C, Sarkans U, Schimming JP, Schmidt BZ, Stober R, Strassfeld
T, van de Water B, Wilmes A, van der Burg B, Verfaillie CM, von Hellfeld R, Vrieling H, Vrijenhoek NG, Leist M.
The EU-ToxRisk method documentation, data processing and chemical testing pipeline for the regulatory use of
new approach methods. Arch Toxicol. 2020 Jul;94(7):2435-2461. doi: 10.1007/s00204-020-02802-6. Epub 2020
Jul 6. PMID: 32632539; PMCID: PMC7367925.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2697

U KN2_HCS_I M R90_neu ra l_migration

1.	General Information

1.1	Assay Title: University of Konstanz (UKN) Leist Lab Neural Crest Cell Migration Assay in human induced IMR90
Pluripotent Stem Cells (UKN2)

1.2	Assay Summary: UKN2_HCS_IMR90 is a cell-based, multiplexed-readout assay screening for neural crest cell
migration and cell viability that uses IMR90 (neural crest cells differentiated from iPSC), a human embryo cell
line, with measurements taken at 3 hours after chemical dosing in a microplate: 96-well plate. UKN2, also known
as cMINC, is an assay that uses IMR90 cells (neural crest cells differentiated from iPSC) as a model of neural
crest cell migration. 24 hr after cell seeding, cell migration was initiated and proceeded for 24 hr. Then, chemical
was applied, and after 24 hr of exposure migration and viability were assessed using high-content imaging of
cells stained with calcein-AM and Hoechst H-33342. UKN2_HCS_IMR90_neural_migration is an assay
component measured from the UKN2_HCS_IMR90 assay. It is designed to make measurements of neural crest
cell migration, a form of morphology reporter, as detected with Fluorescence intensity signals by HCS
Fluorescent Imaging. The assay endpoint UKN2_HCS_IMR90_neural_migration was analyzed with bidirectional
fitting relative to DMSO as the negative control and baseline of activity. Using a type of morphology reporter,
gain or loss-of-signal activity can be used to understand developmental effects. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the neurodevelopment intended target
family, where the subfamily is neural crest cell migration.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Leist research group at the University of Konstanz conducts high content imaging assays for
neurotoxicity.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human induced pluripotent stem cell (human iPSC) line IMR90_clone_#4 has been
bought from WiCell, Wisconsin in 2012 and a master stock has been frozen.

1.9	Assay Throughput: 96-well plate. The assay uses a 96 well-plate with 2 compounds per plate, 6 different
concentrations of each compound per plate, 4 technical replicates per plate representing one biological
replicate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the number of migrated cells are indicative of neurodevelopment.

The Neural Crest Cell Migration Assay (UKN2) is designed to investigate changes in neural crest cell (NCC)
migration in response to chemical exposure in human induced pluripotent stem cells (iPSC, i.e. IMR90) using a
high-content screening (HCS) technology. Migration of NCCs is an essential process during fetal development.
Impaired NCC migration triggered genetically or by toxicants can lead to malformations and disorders e.g.
Hirschsprung's disease or Treacher-Collins syndrome. The assay includes a parallel viability assessment to
measure changes in cell viability.

-------
2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is migration of neural crest cells (NCCs). Impaired NCC migration triggered genetically or by toxicants
can lead to malformations and disorders e.g. Hirschsprung's disease or Treacher-Collins syndrome.

2.3	Experimental System: adherent IMR90 (neural crest cells differentiated from iPSC) cell line used.
Undifferentiated human iPSC cells (IMR90, WiCell) are maintained as monoculture on Laminin-521 coating in
essential 8 (E8) medium. The cells grow in colonies, and are split weakly. The cells show self-renewal and
pluripotency characteristics (regular testing). The cells can be differentiated into several different cell types.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: The wells are coated with PLO/laminin/fibronectin. Then, silicone stoppers are placed into
the wells of a 96-well plate. Cells are seeded around the stoppers and allowed to attach for 24 hours. Day 0: The
silicone stoppers are removed and the medium is replaced with pre-warmed, fresh N2-S medium containing the
cytokines EGF and FGF. Cells are allowed to migrate into the cell free area. Day 1: 25 ul of the 5x concentrated
toxicants are added to the wells. The cells plus toxicants are incubated for 24 h. Day 2: Cells are stained with
calcein-AM and Hoechst (H-33342) for 30 min before imaging with a high content imaging microscope.
Quantification of migration and viability is done by high content imaging analysis.

Baseline median absolute deviation for the assay (bmad): 11.106
Response cutoff threshold used to determine hit calls: 44.422
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content screening to describe neural crest cell (NCC) migration, via staining
with Hoechst-33342 and calcein-AM. The cells are imaged using a high-content microscope (Cellomics). For
migration quantification, a software tool has been developed (http://invitrotox.uni-konstanz.de/) to estimate
the number of cells that migrated into the region of interest (ROI).

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/docurnent/EPA-HQ-QPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

100 nM
Key positive control:

Cytochalasin D

Target (nominal) number of replicates:

8

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

-------
Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: NCCs are plated around silicone stoppers in a culture dish and are allowed to migrate into the
cell free area upon removal of the stopper. The number of migrated cells into the cell free zone is quantified 24
h after toxicant treatment. Migration inhibition of NCCs after treatment with toxicants is measured relative to
control conditions (solvent control cells). For the quantification the cells are stained with calcein-AM and H-
33342 for 30 min at 37 C. The center of the well (migration zone) is imaged in four tiles with a 5x objective.
Afterwards the four images are stitched together to obtain one image. For migration quantification, a software
tool has been developed (http://invitrotox.uni-konstanz.de/). With the help of this software the previously cell-
free area can be estimated and the number of H-33342 and calcein double-positive cells in the region of interest
(ROI) can be counted. The diameter of the ROI was chosen so that 150 to 300 cells were in the ROI in untreated
conditions. An Excel table containing the number of viable cells in the ROI of all wells of the plate is generated
by the software.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

25: bmad4 (Add a cutoff value of 4 multiplied the baseline median absolute deviation (bmad). By default,
bmad is calculated using test compound wells (wilt = t) for the endpoint.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 181	Number of chemicals tested: 167

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
40

Inactive hit count: 0
-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

293

Neutral control median absolute deviation, by plate: nmad

31.135

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

10.48%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

22.239

138.5

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

NA

-------
Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-2.803

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	605.25

Negative control well median absolute deviation value, by plate: mmad	24.463

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	1.56

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 12.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Nyffeler J, Dolde X, Krebs A, Pinto-Gil K, Pastor M, Behl M, Waldmann T, Leist M. Combination of
multiple neural crest migration assays to identify environmental toxicants from a proof-of-concept chemical
library. Arch Toxicol. 2017 Nov;91(ll):3613-3632. doi: 10.1007/s00204-017-1977-y. Epub 2017 May 5. PMID:
28477266.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2699

UKN2_HCS_IMR90_cell_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the University of Konstanz (UKN) Leist Lab Neural Crest Cell Migration Assay
in human induced IMR90 Pluripotent Stem Cells (UKN2)

1.2	Assay Summary: UKN2_HCS_IMR90 is a cell-based, multiplexed-readout assay screening for neural crest cell
migration and cell viability that uses IMR90 (neural crest cells differentiated from iPSC), a human embryo cell
line, with measurements taken at 3 hours after chemical dosing in a microplate: 96-well plate. UKN2, also known
as cMINC, is an assay that uses IMR90 cells (neural crest cells differentiated from iPSC) as a model of neural
crest cell migration. 24 hr after cell seeding, cell migration was initiated and proceeded for 24 hr. Then, chemical
was applied, and after 24 hr of exposure migration and viability were assessed using high-content imaging of
cells stained with calcein-AM and Hoechst H-33342. UKN2_HCS_IMR90_cell_viability is an assay component
measured from the UKN2_HCS_IMR90 assay. It is designed to make measurements of enzyme activity, a form
of viability reporter, as detected with Fluorescence intensity signals by HCS Fluorescent Imaging. The assay
endpoint UKN2_HCS_IMR90_cell_viability was analyzed with bidirectional fitting relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, gain or loss-of-signal activity can be
used to understand viability effects. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the cell cycle intended target family, where the subfamily is viability.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Leist research group at the University of Konstanz conducts high content imaging assays for
neurotoxicity.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The human induced pluripotent stem cell (human iPSC) line IMR90_clone_#4 has been
bought from WiCell, Wisconsin in 2012 and a master stock has been frozen.

1.9	Assay Throughput: 96-well plate. The assay uses a 96 well-plate with 2 compounds per plate, 6 different
concentrations of each compound per plate, 4 technical replicates per plate representing one biological
replicate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the number of Hoechst labelled nuclei are indicative of viability.

The Neural Crest Cell Migration Assay (UKN2) is designed to investigate changes in neural crest cell (NCC)
migration in response to chemical exposure in human induced pluripotent stem cells (iPSC, i.e. IMR90) using a
high-content screening (HCS) technology. Migration of NCCs is an essential process during fetal development.
Impaired NCC migration triggered genetically or by toxicants can lead to malformations and disorders e.g.
Hirschsprung's disease or Treacher-Collins syndrome. The assay includes a parallel viability assessment to
measure changes in cell viability.

-------
2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is migration of neural crest cells (NCCs). Impaired NCC migration triggered genetically or by toxicants
can lead to malformations and disorders e.g. Hirschsprung's disease or Treacher-Collins syndrome.

2.3	Experimental System: adherent IMR90 (neural crest cells differentiated from iPSC) cell line used.
Undifferentiated human iPSC cells (IMR90, WiCell) are maintained as monoculture on Laminin-521 coating in
essential 8 (E8) medium. The cells grow in colonies, and are split weakly. The cells show self-renewal and
pluripotency characteristics (regular testing). The cells can be differentiated into several different cell types.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: The wells are coated with PLO/laminin/fibronectin. Then, silicone stoppers are placed into
the wells of a 96-well plate. Cells are seeded around the stoppers and allowed to attach for 24 hours. Day 0: The
silicone stoppers are removed and the medium is replaced with pre-warmed, fresh N2-S medium containing the
cytokines EGF and FGF. Cells are allowed to migrate into the cell free area. Day 1: 25 ul of the 5x concentrated
toxicants are added to the wells. The cells plus toxicants are incubated for 24 h. Day 2: Cells are stained with
calcein-AM and Hoechst (H-33342) for 30 min before imaging with a high content imaging microscope.
Quantification of migration and viability is done by high content imaging analysis.

Baseline median absolute deviation for the assay (bmad): 4.04
Response cutoff threshold used to determine hit calls: 16.162
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content screening to describe neural crest cell (NCC) migration, via staining
with Hoechst-33342 and calcein-AM. The cells are imaged using a high-content microscope (Cellomics). For
migration quantification, a software tool has been developed (http://invitrotox.uni-konstanz.de/) to estimate
the number of cells that migrated into the region of interest (ROI).

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/docurnent/EPA-HQ-QPP-2020-0263-0006

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
4

Standard minimum concentration tested:

100 nM
Key positive control:

NA

Target (nominal) number of replicates:

8

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

-------
Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Cell viability is measured after 48 h outside the migration zone in the same well. The cells are
stained with calcein-AM and H-33342 and four fields outside the migration zone are imaged with a lOx objective.
Viability is defined as the number of H-33342 and calcein double-positive cells, viable cells are determined by
an automated algorithm. An excel file is generated with the number of viable cells in each well. Migration and
Viability are normalized to untreated controls.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

25: bmad4 (Add a cutoff value of 4 multiplied the baseline median absolute deviation (bmad). By default,
bmad is calculated using test compound wells (wilt = t) for the endpoint.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with

-------
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 181	Number of chemicals tested: 167

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
47

Inactive hit count: 0
-------
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed
Neutral control median absolute deviation, by plate: nmad
Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

Z Prime Factor for median positive and neutral control across all plates:

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-0.785

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

1801.5
62.269
3.83%

1531
69.682

NA

-------
Positive control signal-to-background: (pmed/nmed)

NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	2019.75

Negative control well median absolute deviation value, by plate: mmad	40.401

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	0.141

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 7.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

-------
• Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Nyffeler J, Dolde X, Krebs A, Pinto-Gil K, Pastor M, Behl M, Waldmann T, Leist M. Combination of
multiple neural crest migration assays to identify environmental toxicants from a proof-of-concept chemical
library. Arch Toxicol. 2017 Nov;91(ll):3613-3632. doi: 10.1007/s00204-017-1977-y. Epub 2017 May 5. PMID:
28477266.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2701

U KN4_HCS_LU H M ES_neu rite_outgrowth

1.	General Information

1.1	Assay Title: University of Konstanz (UKN) Leist Lab Neurite Outgrowth Assay in human LUHMES cells (UKN4)

1.2	Assay Summary: UKN4_HCS_LUHMES is a cell-based, multiplexed-readout assay screening for neurite
outgrowth and cell viability that uses LUHMES, a human central nervous system cell line, with measurements
taken at 3 hours after chemical dosing in a microplate: 96-well plate. UKN4, also referred to as NeuriTox, is an
assay that uses LUHMES neuronal precursors as a model of dopaminergic neuronal differentiation. Following a
2-day differentiation period, cells are exposed to chemicals for 24 hr and neurite area is evaluated as a marker
of neurite outgrowth using high-content imaging of cells stained with calcein AM. Cell viability is assessed using
stain Hoechst H-33342. UKN4_HCS_LUHMES_neurite_outgrowth is an assay component measured from the
UKN4_HCS_LHUMES assay. It is designed to make measurements of neurite outgrowth, a form of morphology
reporter, as detected with Fluorescence intensity signals by HCS Fluorescent Imaging. The assay endpoint
UKN4_HCS_LUHMES_neurite_outgrowth was analyzed with bidirectional fitting relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, gain or loss-of-signal activity can
be used to understand developmental effects. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the neurodevelopment intended target family, where the subfamily is neurite
outgrowth.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Leist research group at the University of Konstanz conducts high content imaging assays for
neurotoxicity.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The LUHMES cells have been brought from Lund (Sweden) to Konstanz in 2006 (Lotharius
et al., 2005) and a master stock has been frozen. The cells used for the test method are continuously generated
by cell culture.

1.9	Assay Throughput: 96-well plate. The assay uses a 96 well-plate with 5 compounds per plate, 10 different
concentrations of each compound per plate, 3 technical replicates per plate representing one biological
replicate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1 Purpose: Changes in the neurite area are indicative of neurodevelopment.

The Neurite Outgrowth Assay (UKN4) is designed to investigate changes in neurite outgrowth (NOG) in response
to chemical exposure in human neurons (LUHMES cells) using a high-content screening (HCS) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. The assay includes a parallel viability
assessment to measure changes in cell viability by staining of Hoechst H-33342.

2.2

Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental

-------
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together. This assay assesses disturbance in the
development of neurons in the central nervous system with a dopaminergic phenotype.

2.3	Experimental System: adherent LUHMES cell line used. LUHMES cells originate from the ventral mesencephalon
of an 8 week old human, female fetus. They exhibit the same characteristics as MESC2.10 cells. LUHMES cells
can be differentiated into morphologically and biochemically mature dopamine-like neurons following exposure
to tetracycline, GDNF (glial cell line-derived neurotrophic factor), and db-cAMP for 6 days. LUHMES ATCC CRL-
2927; LUHMES cells used at the University of Konstanz differ from LUHMES cells deposited and distributed by
ATCC.

2.4	Metabolic Competence: Dopamine transporter is expressed and used for MPP+ (l-methyl-4-phenylpyridinium)
transport.

2.5	Exposure Regime: Proliferating LUHMES cells are seeded in proliferation medium for 24 hours. Day 0: Medium
is changed from proliferation medium to differentiation medium. Day 2: Cell number reaches about 30-40 Mio
per T175 flask. Cells are trypsinized and replated onto 96-well plates (30000 cells/well in 90 ul) in differentiation
medium without cAMP and GDNF. At about 30 min - 2 hours after seeding, when cells have attached, the
compounds are added (10 ul of each dilution; total volume 100 ul). Day 3: 23.5 h after toxicant treatment, cells
are live-stained with H-33342 and calcein-AM and incubated for 30 min. After 24 h of treatment (including
staining), the cells are imaged using a high-content microscope (Cellomics).

Baseline median absolute deviation for the assay (bmad): 8.469
Response cutoff threshold used to determine hit calls: 25
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content screening to describe neurite outgrowth in human neurons
(LUHMES cells), via staining with Hoechst-33342 and calcein-AM. The cells are imaged using a high-content
microscope (Cellomics VTI Array Scan). Changes in neurite area are measured.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.508052634252909 nM
Key positive control:

Narciclasine

Target (nominal) number of replicates:

36

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

-------
DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: An automated microplate reading microscope (Array-Scanll HCS Reader, Cellomics, PA) equipped
with a Hamamatsu ORCA-ER camera (resolution 1024 x 1024; run at 2 x 2 binning) was used for image
acquisition. Ten fields per well were imaged. Images were recorded in 2 channels using a 20x objective and
excitation/emission wavelengths of 365 ± 50/535 ± 45 to detect H-33342 in channel 1 and 474 ± 40/535 ± 45 to
detect calcein in channel 2. In both channels, a fixed exposure time and an intensity histogram-derived threshold
were used for object identification. Neurite pixels were identified using the following image analysis algorithm:
nuclei were identified as objects in channel 1 according to their size, area, shape, and intensity which were
predefined on untreated cells using a machine-based learning algorithm, and manual selection of nuclei to be
classified as intact. The nuclear outlines were expanded by 3.2 um in each direction, to define a virtual cell soma
area (VCSA) based on the following procedure: The average width of the cytoplasm ring (distance nucleus - cell
membrane) of LUHMES cells was experimentally determined to be 2.3 um. Size irregularities were not always
due to growing neurites, as determined by combined F-actin/tubulin beta-Ill staining. To avoid scoring of false
positive neurite areas, the exclusion ring (VCSA) was made bigger than the average cell size. Then, we used two
control compounds (U0126 and bisindolylmaleimid I) to vary the expanded outlines from 0.6 to 4 um. We found
3.2 um to be optimal both to detect neurite growth over time and to identify reduced neurite growth with high
sensitivity. All calcein-positive pixels of the field (beyond a given intensity threshold) were defined as viable
cellular structures (VCSs). The threshold was dynamically determined for each field after flat field and
background correction and intensity normalization to 512 gray values and was set to 12% of the maximal
brightness (channel 63 of 512). The VCS defines the sum of all somata and neurites without their assignment to
individual cells. In an automatic calculation, the VCSAs, defined in the H-33342 channel, were used as filter in
the calcein channel and subtracted from the VCS. The remaining pixels (VCS - VCSA) in the calcein channel were
defined as neurite area.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive

-------
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 75	Number of chemicals tested: 75

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
42

Inactive hit count: 0
-------
gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

6

6

9

quadratic-polynomialfpoly2) model:	11

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

2

1

15

12

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

67242.41

Neutral control median absolute deviation, by plate: nmad	5508.022

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	8.65%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	32505.16

Positive control well median absolute deviation, by plate: pmad	3452.531

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-4.035

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 12.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Krug AK, Balmer NV, Matt F, Schonenberger F, Merhof D, Leist M. Evaluation of a human neurite
growth assay as specific screen for developmental neurotoxicants. Arch Toxicol. 2013 Dec;87(12):2215-31. doi:
10.1007/s00204-013-1072-y. Epub 2013 May 14. PMID: 23670202., Krebs A, van Vugt-Lussenburg BMA,
Waldmann T, Albrecht W, Boei J, Ter Braak B, Brajnik M, Braunbeck T, Brecklinghaus T, Busquet F, Dinnyes A,
Dokler J, Dolde X, Exner TE, Fisher C, Fluri D, Forsby A, Hengstler JG, Holzer AK, Janstova Z, Jennings P, Kisitu J,
Kobolak J, Kumar M, Limonciel A, Lundqvist J, Mihalik B, Moritz W, Pallocca G, Ulloa APC, Pastor M, Rovida C,
Sarkans U, Schimming JP, Schmidt BZ, Stober R, Strassfeld T, van de Water B, Wilmes A, van der Burg B, Verfaillie
CM, von Hellfeld R, Vrieling H, Vrijenhoek NG, Leist M. The EU-ToxRisk method documentation, data processing
and chemical testing pipeline for the regulatory use of new approach methods. Arch Toxicol. 2020
Jul;94(7):2435-2461. doi: 10.1007/s00204-020-02802-6. Epub 2020 Jul 6. PMID: 32632539; PMCID:
PMC7367925., Leist, M. DB-ALM Protocol No. 200: UNK4 Assay to Test Compound Derived Impairment in
Neurite Outgrowth in Differentiating Human Dopaminergic Neurons, http://cidportal.jrc.ec.europa.eu/ftp/jrc-
opendata/EURL-ECVAM/datasets/DBALM/LATEST/online/DBALM_docs/200_P_U KN4.pdf.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 2703

UKN4_HCS_LUHMES_cell_viability

1.	General Information

1.1	Assay Title: Viability Assessment in the University of Konstanz (UKN) Leist Lab Neurite Outgrowth Assay in
human LUHMES cells (UKN4)

1.2	Assay Summary: UKN4_HCS_LUHMES is a cell-based, multiplexed-readout assay screening for neurite
outgrowth and cell viability that uses LUHMES, a human central nervous system cell line, with measurements
taken at 3 hours after chemical dosing in a microplate: 96-well plate. UKN4, also referred to as NeuriTox, is an
assay that uses LUHMES neuronal precursors as a model of dopaminergic neuronal differentiation. Following a
2-day differentiation period, cells are exposed to chemicals for 24 hr and neurite area is evaluated as a marker
of neurite outgrowth using high-content imaging of cells stained with calcein AM. Cell viability is assessed using
stain Hoechst H-33342. UKN4_HCS_LUHMES_cell_viability is an assay component measured from the
UKN4_HCS_LHUMES assay. It is designed to make measurements of enzyme activity, a form of viability reporter,
as detected with Fluorescence intensity signals by HCS Fluorescent Imaging. The assay endpoint
UKN4_HCS_LUHMES_cell_viability was analyzed with bidirectional fitting relative to DMSO as the negative
control and baseline of activity. Using a type of viability reporter, gain or loss-of-signal activity can be used to
understand viability effects. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is viability.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: The Leist research group at the University of Konstanz conducts high content imaging assays for
neurotoxicity.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: The LUHMES cells have been brought from Lund (Sweden) to Konstanz in 2006 (Lotharius
et al., 2005) and a master stock has been frozen. The cells used for the test method are continuously generated
by cell culture.

1.9	Assay Throughput: 96-well plate. The assay uses a 96 well-plate with 5 compounds per plate, 10 different
concentrations of each compound per plate, 3 technical replicates per plate representing one biological
replicate.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in the number of Hoechst labelled nuclei are indicative of viability.

The Neurite Outgrowth Assay (UKN4) is designed to investigate changes in neurite outgrowth (NOG) in response
to chemical exposure in human neurons (LUHMES cells) using a high-content screening (HCS) technology.
Neurite outgrowth is one of several key processes of neurodevelopment. The assay includes a parallel viability
assessment to measure changes in cell viability by staining of Hoechst H-33342.

2.2	Scientific Principles: During the development of the nervous systems, many processes occur to give rise to a
functional and healthy neural network and hence nervous system. These important neurodevelopmental

-------
processes may be disrupted by potential toxicants, resulting in developmental neurotoxicity. Among these
processes is neurite outgrowth - the physical outward growth of neurites (eventually axons and dendrites) of
individual neurons that allows them to make connections with other neurons and ultimately gives rise to the
physical network of cells that connect the nervous system together. This assay assesses disturbance in the
development of neurons in the central nervous system with a dopaminergic phenotype.

2.3	Experimental System: adherent LUHMES cell line used. LUHMES cells originate from the ventral mesencephalon
of an 8 week old human, female fetus. They exhibit the same characteristics as MESC2.10 cells. LUHMES cells
can be differentiated into morphologically and biochemically mature dopamine-like neurons following exposure
to tetracycline, GDNF (glial cell line-derived neurotrophic factor), and db-cAMP for 6 days. LUHMES ATCC CRL-
2927; LUHMES cells used at the University of Konstanz differ from LUHMES cells deposited and distributed by
ATCC.

2.4	Metabolic Competence: Dopamine transporter is expressed and used for MPP+ (l-methyl-4-phenylpyridinium)
transport.

2.5	Exposure Regime: Proliferating LUHMES cells are seeded in proliferation medium for 24 hours. Day 0: Medium
is changed from proliferation medium to differentiation medium. Day 2: Cell number reaches about 30-40 Mio
per T175 flask. Cells are trypsinized and replated onto 96-well plates (30000 cells/well in 90 ul) in differentiation
medium without cAMP and GDNF. At about 30 min - 2 hours after seeding, when cells have attached, the
compounds are added (10 ul of each dilution; total volume 100 ul). Day 3: 23.5 h after toxicant treatment, cells
are live-stained with H-33342 and calcein-AM and incubated for 30 min. After 24 h of treatment (including
staining), the cells are imaged using a high-content microscope (Cellomics).

Baseline median absolute deviation for the assay (bmad): 0.708
Response cutoff threshold used to determine hit calls: 25
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: This assay utilizes a high-content screening to describe neurite outgrowth in human iPSC-derived
immature dorsal root ganglia (iDRG) neurons, via staining with Hoechst-33342 and calcein-AM. The cells are
imaged using a high-content microscope (Cellomics VTI Array Scan). Changes in viable cells were measured.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
10

Standard minimum concentration tested:

0.508052634252909 nM
Key positive control:

NA

Target (nominal) number of replicates:

36

Standard maximum concentration tested:

10000 nM
Neutral vehicle control:

DMSO

-------
DNT-IVB: Assays included in the developmental neurotoxicity in vitro battery, see
https://www.regulations.gov/document/EPA-HQ-OPP-2020-0263-0006

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: For a quantitative assessment of viable cells, the same images used to assess neurite area were
analysed using another image analysis algorithm: nuclei were identified in channel 1 as objects according to
their size, area, shape, and intensity. Nuclei of apoptotic cells with increased fluorescence were excluded. A
VCSA was defined around each nucleus by expanding it by 0.3 um into each direction. Calcein-AM staining,
labelling live cells, was detected in channel 2. The algorithm quantified the calcein intensity in the VCSA areas.
Cells having an average calcein signal intensity in the VCSAs below a predefined threshold were classified by the
program as "not viable". Valid nuclei with a positive calcein signal in their cognate VCSA were counted as viable
cells. A positive calcein signal was based on measurements of the average intensity (normal cells: 1300 ± 115,
threshold: < 50) and the total integrated intensity (normal cells: 186,000 ± 23,600, threshold < 1000) of cells.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 32: pval.zero (Set the positive control value (pval) to 0;
pval = 0.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

18: pc25 (Add a cutoff value of 25. Typically for percent of control data.)

-------
Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 75	Number of chemicals tested: 75

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
26

Inactive hit count: Oihitc 0.9
49

WINING MODEL SELECTION

NA hit count: hitc^O
0

Number of sample-assay endpoints with winning hill model:

11
1

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

3

24

quadratic-polynomialfpoly2) model: 9

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:

0

8

13

-------
exponentials (exp5) model:

6

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

0.963

Neutral control median absolute deviation, by plate: nmad

0.007

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

0.72%

POSITIVE CONTROL (well type = "p")
Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

0.941

0.008

-------
Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - rimed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	-1.582

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 6.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.

5.1

Potential Regulatory Applications

Context of Use: Examples of end use scenarios could include, but are not limited to:

-------
•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Krug AK, Balmer NV, Matt F, Schonenberger F, Merhof D, Leist M. Evaluation of a human neurite
growth assay as specific screen for developmental neurotoxicants. Arch Toxicol. 2013 Dec;87(12):2215-31. doi:
10.1007/s00204-013-1072-y. Epub 2013 May 14. PMID: 23670202., Krebs A, van Vugt-Lussenburg BMA,
Waldmann T, Albrecht W, Boei J, Ter Braak B, Brajnik M, Braunbeck T, Brecklinghaus T, Busquet F, Dinnyes A,
Dokler J, Dolde X, Exner TE, Fisher C, Fluri D, Forsby A, Hengstler JG, Holzer AK, Janstova Z, Jennings P, Kisitu J,
Kobolak J, Kumar M, Limonciel A, Lundqvist J, Mihalik B, Moritz W, Pallocca G, Ulloa APC, Pastor M, Rovida C,
Sarkans U, Schimming JP, Schmidt BZ, Stober R, Strassfeld T, van de Water B, Wilmes A, van der Burg B, Verfaillie
CM, von Hellfeld R, Vrieling H, Vrijenhoek NG, Leist M. The EU-ToxRisk method documentation, data processing
and chemical testing pipeline for the regulatory use of new approach methods. Arch Toxicol. 2020
Jul;94(7):2435-2461. doi: 10.1007/s00204-020-02802-6. Epub 2020 Jul 6. PMID: 32632539; PMCID:
PMC7367925., Leist, M. DB-ALM Protocol No. 200: UNK4 Assay to Test Compound Derived Impairment in
Neurite Outgrowth in Differentiating Human Dopaminergic Neurons, http://cidportal.jrc.ec.europa.eu/ftp/jrc-
opendata/EURL-ECVAM/datasets/DBALM/LATEST/online/DBALM_docs/200_P_U KN4.pdf.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3019

VALA_TUBHUV_Agonist_CellCount

1.	General Information

1.1	Assay Title: Viability Assessment in the VALA Sciences human Umbilical Vein Endothelial Tubule Formation
(TUBHUV) Agonism Assay

1.2	Assay Summary: VALA_TUBHUV_Agonist is a cell-based, multiplexed assay that uses human umbilical vein
endothelial cells (HUVEC). Measurements were taken 72 hours after chemical dosing in a 96-well plate.
VALA_TUBHUV_Agonist_CellCount is the one of two assay components measured from the
VALA_TUBHUV_Agonist assay. It is designed to make measurements of viability, a form of viability reporter, as
detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay
component VALA_TUBHUV_Agonist_CellCount was analyzed at the endpoint,
VALA_TUBHUV_Agonist_CellCount, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used to
understand viability. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cell count.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to the number of DAPI labelled nuceli is indicative of the
viability of the system.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: The developing neurovascular unit (NVU) gives rise to the preliminary interaction of the
fetal central nervous system and circulatory system and ultimately the microvasculature forming a blood-brain-
barrier (BBB). As the NVU comprises multiple cell types interacting at various stages of development, a
methodology combining high-throughput results across pertinent cell-based assays is needed to investigate
potential chemical-induced disruption to the development of this complex cell system. With each in vitro model,
different characteristics of the developing NVU can be recapitulated to identify the biological complexity
necessary for NVU development, i.e. vascular remodeling, barrier integrity measurement, transporter kinetics
for influx/efflux of xenobiotics, cell-cell interactions for tissue development, etc. ArunA and VALA assay sources
comprise several neurogenic and angiogenic endpoints, contribute to the development of the neurovascular

-------
toxicity signature. The migration of endothelial cells is a critical part of angiogenesis. The scratch assay is widely
utilized in studies of angiogenesis, to quantify the migration of endothelial cells, and serves to complement
information obtained with the angiogenesis "tube formation" assays. These assays are relevant to embryology,
and also to toxicity/pathology. In vitro tubule formation was monitored in human endothelial cells co-cultured
with primary human dermal fibroblast (HDF) cells through VALA Sciences. Endothelial cells isolated from human
umbilical cord (HUVECs) assemble into branched networks, which represent the initiation of blood vessel
formation, when cultured onto a fibroblast or gel layer (Sarkanen et al, 2010).

2.3	Experimental System: adherent HUVEC cell line used. The VALA tubule formation assay used cryopreserved
human endothelial (HUVEC) cells (CAI catalog 200-05f) co-cultured with primary human dermal fibroblast (HDF)
cells through VALA Sciences.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: This HUVEC-HDF assay was designed to detect chemical suppression of tubulogenesis, i.e.
antagonism. To measure tubulogenesis antagonism, HUVEC cells were co-cultured with HDF cells at a 1:3 ratio
for 3-days in 10 ng/ml VEGF (i.e., favorable conditions with growth factors), then changed to assay medium
containing 10 ng/ml VEGF and ToxCast compounds for an additional 3-day period. Neutral control for tubule
formation was 0.1% DMSO vehicle while the positive control for tubulogenesis inhibition was suramin (100 uM).
Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain and CD31-
immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length area served
as the primary output feature. Other tubule related features (tubule count, tubule length, tubule area, and node
count) were processed as well.

Baseline median absolute deviation for the assay (bmad): 0.07
Response cutoff threshold used to determine hit calls: 0.211
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 uM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 uM
Neutral vehicle control:

DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain
and CD31-immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length
area served as the primary output feature. Other tubule related features (tubule count, tubule length, tubule
area, and node count) were processed as well. Plate-level raw data, provided by each assay source, were
received by EPA from each contractor and analyzed using the ToxCast Pipeline (tcpl). Normalized response
values for each assay endpoint were calculated as resp = log2(rval/bval) where rval, bval, and pval correspond
to the raw value, the plate level DMSO control median, and the plate level positive/negative control median,
respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Numberof samples tested: 48	Number of chemicals tested: 46

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
9

28

11

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

3
2

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

6

8

quadratic-polynomialfpoly2) model: 5

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

20

2

1

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	6563.5

Neutral control median absolute deviation, by plate: nmad	256.49

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	3.69%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3020

VALA_TU BH UV_Agonist_Tu bu leLength

1.	General Information

1.1	Assay Title: VALA Sciences human Umbilical Vein Endothelial Tubule Formation (TUBHUV) Agonism Assay

1.2	Assay Summary: VALA_TUBHUV_Agonist is a cell-based, multiplexed assay that uses human umbilical vein
endothelial cells (HUVEC). Measurements were taken 72 hours after chemical dosing in a 96-well plate.
VALA_TUBHUV_Agonist_TubuleLength is the one of two assay components measured from the
VALA_TUBHUV_Agonist assay. It is designed to make measurements of tubulogenesis, a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. This
VALA_TUBHUV1 assay was designed to detect chemical activation of tubulogenesis, i.e. agonism. Data from the
assay component VALA_TUBHUV_Agonist_TubuleLength was analyzed at the endpoint,
VALA_TUBHUV_Agonist_TubuleLength, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of morphology reporter, gain-of-signal activity can be used to
understand developmental effects. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the neurodevelopment intended target family, where the subfamily is tubulogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Vein growth supports angiogenesis, which is critical for development, maturation, and
reproductive function. Dysregulation of angiogenesis can lead to birth defects, macular degeneration, impaired
wound healing, and tumor growth. This assay measures compound effects on vein formation in basal medium,
which lacks growth factors that support blood vessel growth. Tube formation agonists will increase vessel
growth in these conditions.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: The developing neurovascular unit (NVU) gives rise to the preliminary interaction of the
fetal central nervous system and circulatory system and ultimately the microvasculature forming a blood-brain-
barrier (BBB). As the NVU comprises multiple cell types interacting at various stages of development, a
methodology combining high-throughput results across pertinent cell-based assays is needed to investigate
potential chemical-induced disruption to the development of this complex cell system. With each in vitro model,
different characteristics of the developing NVU can be recapitulated to identify the biological complexity

-------
necessary for NVU development, i.e. vascular remodeling, barrier integrity measurement, transporter kinetics
for influx/efflux of xenobiotics, cell-cell interactions for tissue development, etc. ArunA and VALA assay sources
comprise several neurogenic and angiogenic endpoints, contribute to the development of the neurovascular
toxicity signature. The migration of endothelial cells is a critical part of angiogenesis. The scratch assay is widely
utilized in studies of angiogenesis, to quantify the migration of endothelial cells, and serves to complement
information obtained with the angiogenesis "tube formation" assays. These assays are relevant to embryology,
and also to toxicity/pathology. In vitro tubule formation was monitored in human endothelial cells co-cultured
with primary human dermal fibroblast (HDF) cells through VALA Sciences. Endothelial cells isolated from human
umbilical cord (HUVECs) assemble into branched networks, which represent the initiation of blood vessel
formation, when cultured onto a fibroblast or gel layer (Sarkanen et a I, 2010).

2.3	Experimental System: adherent HUVEC cell line used. The VALA tubule formation assay used cryopreserved
human endothelial (HUVEC) cells (CAI catalog 200-05f) co-cultured with primary human dermal fibroblast (HDF)
cells through VALA Sciences.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: This HUVEC-HDF assay was designed to detect chemical suppression of tubulogenesis, i.e.
antagonism. To measure tubulogenesis antagonism, HUVEC cells were co-cultured with HDF cells at a 1:3 ratio
for 3-days in 10 ng/ml VEGF (i.e., favorable conditions with growth factors), then changed to assay medium
containing 10 ng/ml VEGF and ToxCast compounds for an additional 3-day period. Neutral control for tubule
formation was 0.1% DMSO vehicle while the positive control for tubulogenesis inhibition was suramin (100 uM).
Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain and CD31-
immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length area served
as the primary output feature. Other tubule related features (tubule count, tubule length, tubule area, and node
count) were processed as well.

Baseline median absolute deviation for the assay (bmad): 5.37
Response cutoff threshold used to determine hit calls: 16.109
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 uM
Key positive control:
suramin

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 uM
Neutral vehicle control:

DMSO

-------
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain
and CD31-immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length
area served as the primary output feature. Other tubule related features (tubule count, tubule length, tubule
area, and node count) were processed as well. Plate-level raw data, provided by each assay source, were
received by EPA from each contractor and analyzed using the ToxCast Pipeline (tcpl). Normalized response
values for each assay endpoint were calculated as resp = log2(rval/bval) where rval, bval, and pval correspond
to the raw value, the plate level DMSO control median, and the plate level positive/negative control median,
respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
4: pval.apid.medpcbyconc.min (Calculate the positive control value (pval) as the plate-wise minimum, by
assay plate ID (apid), of the medians of corrected value (cval) of gain-of-signal single- or multiple-
concentration positive control wells (wilt = p or c) by apid, well type, and concentration.), 5: resp.pc
(Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference between
the corrected (cval) and baseline (bval) values divided the difference between the positive control (pval)
and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 48

Number of chemicals tested: 46

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
10

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	4938

Neutral control median absolute deviation, by plate: nmad	1283.932

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	28.31%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 3.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3021

VALA_TU B H U V_Antago n ist_Cel I Co u nt

1.	General Information

1.1	Assay Title: Viability Assessment in the VALA Sciences human Umbilical Vein Endothelial Tubule Formation
(TUBHUV) Antagonism Assay

1.2	Assay Summary: VALA_TUBHUV_Antagonist is a cell-based, multiplexed assay that uses human umbilical vein
endothelial cells (HUVEC). Measurements were taken 72 hours after chemical dosing in a 96-well plate.
VALA_TUBHUV_Antagonist_CellCount is the one of two assay components measured from the
VALA_TUBHUV_Antagonist assay. It is designed to make measurements of viability, a form of viability reporter,
as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay
component VALA_TUBHUV_Antagonist_CellCount was analyzed at the endpoint,
VALA_TUBHUV_Antagonist_CellCount, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used to
understand viability. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cell count.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to the number of DAPI labelled nuceli is indicative of the
viability of the system.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: The developing neurovascular unit (NVU) gives rise to the preliminary interaction of the
fetal central nervous system and circulatory system and ultimately the microvasculature forming a blood-brain-
barrier (BBB). As the NVU comprises multiple cell types interacting at various stages of development, a
methodology combining high-throughput results across pertinent cell-based assays is needed to investigate
potential chemical-induced disruption to the development of this complex cell system. With each in vitro model,
different characteristics of the developing NVU can be recapitulated to identify the biological complexity
necessary for NVU development, i.e. vascular remodeling, barrier integrity measurement, transporter kinetics
for influx/efflux of xenobiotics, cell-cell interactions for tissue development, etc. ArunA and VALA assay sources
comprise several neurogenic and angiogenic endpoints, contribute to the development of the neurovascular

-------
toxicity signature. The migration of endothelial cells is a critical part of angiogenesis. The scratch assay is widely
utilized in studies of angiogenesis, to quantify the migration of endothelial cells, and serves to complement
information obtained with the angiogenesis "tube formation" assays. These assays are relevant to embryology,
and also to toxicity/pathology. In vitro tubule formation was monitored in human endothelial cells co-cultured
with primary human dermal fibroblast (HDF) cells through VALA Sciences. Endothelial cells isolated from human
umbilical cord (HUVECs) assemble into branched networks, which represent the initiation of blood vessel
formation, when cultured onto a fibroblast or gel layer (Sarkanen et al, 2010).

2.3	Experimental System: adherent HUVEC cell line used. The VALA tubule formation assay used cryopreserved
human endothelial (HUVEC) cells (CAI catalog 200-05f) co-cultured with primary human dermal fibroblast (HDF)
cells through VALA Sciences.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: This HUVEC-HDF assay was designed to detect chemical suppression of tubulogenesis, i.e.
antagonism. To measure tubulogenesis antagonism, HUVEC cells were co-cultured with HDF cells at a 1:3 ratio
for 3-days in 10 ng/ml VEGF (i.e., favorable conditions with growth factors), then changed to assay medium
containing 10 ng/ml VEGF and ToxCast compounds for an additional 3-day period. Neutral control for tubule
formation was 0.1% DMSO vehicle while the positive control for tubulogenesis inhibition was suramin (100 uM).
Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain and CD31-
immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length area served
as the primary output feature. Other tubule related features (tubule count, tubule length, tubule area, and node
count) were processed as well.

Baseline median absolute deviation for the assay (bmad): 0.048
Response cutoff threshold used to determine hit calls: 0.143
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 uM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 uM
Neutral vehicle control:

DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain
and CD31-immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length
area served as the primary output feature. Other tubule related features (tubule count, tubule length, tubule
area, and node count) were processed as well. Plate-level raw data, provided by each assay source, were
received by EPA from each contractor and analyzed using the ToxCast Pipeline (tcpl). Normalized response
values for each assay endpoint were calculated as resp = log2(rval/bval) where rval, bval, and pval correspond
to the raw value, the plate level DMSO control median, and the plate level positive/negative control median,
respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Numberof samples tested: 48	Number of chemicals tested: 46

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
2

20

26

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

1
9

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

7

1

quadratic-polynomialfpoly2) model: 7

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

22

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	7268

Neutral control median absolute deviation, by plate: nmad	243.146

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	3.35%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3022

VALA_TU BHUV_Antagonist_Tu bu leLength

1.	General Information

1.1	Assay Title: VALA Sciences human Umbilical Vein Endothelial Tubule Formation (TUBHUV) Antagonism Assay

1.2	Assay Summary: VALA_TUBHUV_Antagonist is a cell-based, multiplexed assay that uses human umbilical vein
endothelial cells (HUVEC). Measurements were taken 72 hours after chemical dosing in a 96-well plate.
VALA_TUBHUV_Antagonist_TubuleLength is the one of two assay components measured from the
VALA_TUBHUV_Antagonist assay. It is designed to make measurements of tubulogenesis, a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. This
VALA_TUBHUV_Antagonist assay was designed to detect chemical suppression of tubulogenesis, i.e.
antagonism. Data from the assay component VALA_TUBHUV_Antagonist_TubuleLength was analyzed at the
endpoint, VALA_TUBHUV_Antagonist_TubuleLength, in the positive analysis fitting direction relative to DMSO
as the negative control and baseline of activity. Using a type of morphology reporter, loss-of-signal activity can
be used to understand developmental effects. To generalize the intended target to other relatable targets, this
assay endpoint is annotated to the neurodevelopment intended target family, where the subfamily is
tubulogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Vein growth supports angiogenesis, which is critical for development, maturation, and
reproductive function. Dysregulation of angiogenesis can lead to birth defects, macular degeneration, impaired
wound healing, and tumor growth. This assay measures compound effects on vein formation in complete
medium, which contains growth factors that support vein endothelial cell proliferation. Tube formation
antagonists will decrease vessel growth in these conditions.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: The developing neurovascular unit (NVU) gives rise to the preliminary interaction of the
fetal central nervous system and circulatory system and ultimately the microvasculature forming a blood-brain-
barrier (BBB). As the NVU comprises multiple cell types interacting at various stages of development, a
methodology combining high-throughput results across pertinent cell-based assays is needed to investigate
potential chemical-induced disruption to the development of this complex cell system. With each in vitro model,

-------
different characteristics of the developing NVU can be recapitulated to identify the biological complexity
necessary for NVU development, i.e. vascular remodeling, barrier integrity measurement, transporter kinetics
for influx/efflux of xenobiotics, cell-cell interactions for tissue development, etc. ArunA and VALA assay sources
comprise several neurogenic and angiogenic endpoints, contribute to the development of the neurovascular
toxicity signature. The migration of endothelial cells is a critical part of angiogenesis. The scratch assay is widely
utilized in studies of angiogenesis, to quantify the migration of endothelial cells, and serves to complement
information obtained with the angiogenesis "tube formation" assays. These assays are relevant to embryology,
and also to toxicity/pathology. In vitro tubule formation was monitored in human endothelial cells co-cultured
with primary human dermal fibroblast (HDF) cells through VALA Sciences. Endothelial cells isolated from human
umbilical cord (HUVECs) assemble into branched networks, which represent the initiation of blood vessel
formation, when cultured onto a fibroblast or gel layer (Sarkanen et al, 2010).

2.3	Experimental System: adherent HUVEC cell line used. The VALA tubule formation assay used cryopreserved
human endothelial (HUVEC) cells (CAI catalog 200-05f) co-cultured with primary human dermal fibroblast (HDF)
cells through VALA Sciences.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: This HUVEC-HDF assay was designed to detect chemical suppression of tubulogenesis, i.e.
antagonism. To measure tubulogenesis antagonism, HUVEC cells were co-cultured with HDF cells at a 1:3 ratio
for 3-days in 10 ng/ml VEGF (i.e., favorable conditions with growth factors), then changed to assay medium
containing 10 ng/ml VEGF and ToxCast compounds for an additional 3-day period. Neutral control for tubule
formation was 0.1% DMSO vehicle while the positive control for tubulogenesis inhibition was suramin (100 uM).
Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain and CD31-
immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length area served
as the primary output feature. Other tubule related features (tubule count, tubule length, tubule area, and node
count) were processed as well.

Baseline median absolute deviation for the assay (bmad): 7.197
Response cutoff threshold used to determine hit calls: 21.592
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 uM
Key positive control:
suramin

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 uM
Neutral vehicle control:

DMSO

-------
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain
and CD31-immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length
area served as the primary output feature. Other tubule related features (tubule count, tubule length, tubule
area, and node count) were processed as well. Plate-level raw data, provided by each assay source, were
received by EPA from each contractor and analyzed using the ToxCast Pipeline (tcpl). Normalized response
values for each assay endpoint were calculated as resp = log2(rval/bval) where rval, bval, and pval correspond
to the raw value, the plate level DMSO control median, and the plate level positive/negative control median,
respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 6:
resp.multnegl (Multiply the normalized response value (resp) by -1; -l*resp.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).), 15: pval.apid.medncbyconc.min (Calculate the positive
control value (pval) as the plate-wise minimum, by assay plate ID (apid), of the medians of the corrected
values (cval) for gain-of-signal single- or multiple-concentration negative control wells (wilt = m or o) by
apid, well type, and concentration.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

-------
SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 48

Number of chemicals tested: 46

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
8

Inactive hit count: 0
-------
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	66352

Neutral control median absolute deviation, by plate: nmad	3860.69

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	5.83%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) / sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

-------
ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 0.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3023

VALA_TUBIPS_Agonist_CellCount

1.	General Information

1.1	Assay Title: Viability Assessment in the VALA Sciences human iPS-derived Endothelial Tubule Formation (TUBIPS)
Agonism Assay

1.2	Assay Summary: VALA_TUBIPS_Agonist is a cell-based, multiplexed assay that uses human iPS-derived
endothelial cells (iPSC). Measurements were taken 72 hours after chemical dosing in a 96-well plate.
VALA_TUBIPS_Agonist_CellCount is the one of two assay components measured from the
VALA_TUBIPS_Agonist assay. It is designed to make measurements of viability, a form of viability reporter, as
detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay
component VALA_TUBIPS_Agonist_CellCount was analyzed at the endpoint, VALA_TUBIPS_Agonist_CellCount,
in the positive analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using
a type of viability reporter, loss-of-signal activity can be used to understand viability. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where
the subfamily is cell count.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to the number of DAPI labelled nuceli is indicative of the
viability of the system.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: The developing neurovascular unit (NVU) gives rise to the preliminary interaction of the
fetal central nervous system and circulatory system and ultimately the microvasculature forming a blood-brain-
barrier (BBB). As the NVU comprises multiple cell types interacting at various stages of development, a
methodology combining high-throughput results across pertinent cell-based assays is needed to investigate
potential chemical-induced disruption to the development of this complex cell system. With each in vitro model,
different characteristics of the developing NVU can be recapitulated to identify the biological complexity
necessary for NVU development, i.e. vascular remodeling, barrier integrity measurement, transporter kinetics
for influx/efflux of xenobiotics, cell-cell interactions for tissue development, etc. ArunA and VALA assay sources
comprise several neurogenic and angiogenic endpoints, contribute to the development of the neurovascular

-------
toxicity signature. The migration of endothelial cells is a critical part of angiogenesis. The scratch assay is widely
utilized in studies of angiogenesis, to quantify the migration of endothelial cells, and serves to complement
information obtained with the angiogenesis "tube formation" assays. These assays are relevant to embryology,
and also to toxicity/pathology. In vitro tubule formation was monitored in human endothelial cells co-cultured
with primary human dermal fibroblast (HDF) cells through VALA Sciences. Endothelial cells isolated from human
umbilical cord (HUVECs) assemble into branched networks, which represent the initiation of blood vessel
formation, when cultured onto a fibroblast or gel layer (Sarkanen et al, 2010).

2.3	Experimental System: adherent iPSC cell line used. The VALA tubule formation assay used cryopreserved human
endothelial (HUVEC) cells (CAI catalog 200-05f) co-cultured with primary human dermal fibroblast (HDF) cells
through VALA Sciences.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: This HUVEC-HDF assay was designed to detect chemical suppression of tubulogenesis, i.e.
antagonism. To measure tubulogenesis antagonism, HUVEC cells were co-cultured with HDF cells at a 1:3 ratio
for 3-days in 10 ng/ml VEGF (i.e., favorable conditions with growth factors), then changed to assay medium
containing 10 ng/ml VEGF and ToxCast compounds for an additional 3-day period. Neutral control for tubule
formation was 0.1% DMSO vehicle while the positive control for tubulogenesis inhibition was suramin (100 uM).
Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain and CD31-
immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length area served
as the primary output feature. Other tubule related features (tubule count, tubule length, tubule area, and node
count) were processed as well.

Baseline median absolute deviation for the assay (bmad): 0.121
Response cutoff threshold used to determine hit calls: 0.363
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 uM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 uM
Neutral vehicle control:

DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain
and CD31-immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length
area served as the primary output feature. Other tubule related features (tubule count, tubule length, tubule
area, and node count) were processed as well. Plate-level raw data, provided by each assay source, were
received by EPA from each contractor and analyzed using the ToxCast Pipeline (tcpl). Normalized response
values for each assay endpoint were calculated as resp = log2(rval/bval) where rval, bval, and pval correspond
to the raw value, the plate level DMSO control median, and the plate level positive/negative control median,
respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Numberof samples tested: 48	Number of chemicals tested: 46

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
14

28

6

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

2

3

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

3

14

quadratic-polynomialfpoly2) model: 7

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

18

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	7042

Neutral control median absolute deviation, by plate: nmad	612.314

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	9.35%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3024

VALA_TU BI PS_Agon ist_Tu bu leLength

1.	General Information

1.1	Assay Title: VALA Sciences human iPS-derived Endothelial Tubule Formation (TUBIPS) Agonism Assay

1.2	Assay Summary: VALA_TUBIPS_Agonist is a cell-based, multiplexed assay that uses human iPS-derived
endothelial cells (iPSC). Measurements were taken 72 hours after chemical dosing in a 96-well plate.
VALA_TUBIPS_Agonist_TubuleLength is the one of two assay components measured from the
VALA_TUBIPS_Agonist assay. It is designed to make measurements of tubulogenesis, a form of morphology
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. This
VALA_TUBIPS1 assay was designed to detect chemical activation of tubulogenesis, i.e. agonism. Data from the
assay component VALA_TUBIPS_Agonist_TubuleLength was analyzed at the endpoint,
VALA_TUBIPS_Agonist_TubuleLength, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of morphology reporter, gain-of-signal activity can be used to
understand developmental effects. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the neurodevelopment intended target family, where the subfamily is tubulogenesis.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Blood vessel growth supports angiogenesis, which is critical for development, maturation, and
reproductive function. Dysregulation of angiogenesis can lead to birth defects, macular degeneration, impaired
wound healing, and tumor growth. This assay measures compound effects on blood vessel formation in basal
medium, which lacks growth factors that support vessel growth. Tube formation agonists will increase vessel
growth in these conditions.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: The developing neurovascular unit (NVU) gives rise to the preliminary interaction of the
fetal central nervous system and circulatory system and ultimately the microvasculature forming a blood-brain-
barrier (BBB). As the NVU comprises multiple cell types interacting at various stages of development, a
methodology combining high-throughput results across pertinent cell-based assays is needed to investigate
potential chemical-induced disruption to the development of this complex cell system. With each in vitro model,
different characteristics of the developing NVU can be recapitulated to identify the biological complexity

-------
necessary for NVU development, i.e. vascular remodeling, barrier integrity measurement, transporter kinetics
for influx/efflux of xenobiotics, cell-cell interactions for tissue development, etc. ArunA and VALA assay sources
comprise several neurogenic and angiogenic endpoints, contribute to the development of the neurovascular
toxicity signature. The migration of endothelial cells is a critical part of angiogenesis. The scratch assay is widely
utilized in studies of angiogenesis, to quantify the migration of endothelial cells, and serves to complement
information obtained with the angiogenesis "tube formation" assays. These assays are relevant to embryology,
and also to toxicity/pathology. In vitro tubule formation was monitored in human endothelial cells co-cultured
with primary human dermal fibroblast (HDF) cells through VALA Sciences. Endothelial cells isolated from human
umbilical cord (HUVECs) assemble into branched networks, which represent the initiation of blood vessel
formation, when cultured onto a fibroblast or gel layer (Sarkanen et a I, 2010).

2.3	Experimental System: adherent iPSC cell line used. The VALA tubule formation assay used cryopreserved human
endothelial (HUVEC) cells (CAI catalog 200-05f) co-cultured with primary human dermal fibroblast (HDF) cells
through VALA Sciences.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: This HUVEC-HDF assay was designed to detect chemical suppression of tubulogenesis, i.e.
antagonism. To measure tubulogenesis antagonism, HUVEC cells were co-cultured with HDF cells at a 1:3 ratio
for 3-days in 10 ng/ml VEGF (i.e., favorable conditions with growth factors), then changed to assay medium
containing 10 ng/ml VEGF and ToxCast compounds for an additional 3-day period. Neutral control for tubule
formation was 0.1% DMSO vehicle while the positive control for tubulogenesis inhibition was suramin (100 uM).
Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain and CD31-
immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length area served
as the primary output feature. Other tubule related features (tubule count, tubule length, tubule area, and node
count) were processed as well.

Baseline median absolute deviation for the assay (bmad): 21.88
Response cutoff threshold used to determine hit calls: 65.64
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 uM
Key positive control:
suramin

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 uM
Neutral vehicle control:

DMSO

-------
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain
and CD31-immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length
area served as the primary output feature. Other tubule related features (tubule count, tubule length, tubule
area, and node count) were processed as well. Plate-level raw data, provided by each assay source, were
received by EPA from each contractor and analyzed using the ToxCast Pipeline (tcpl). Normalized response
values for each assay endpoint were calculated as resp = log2(rval/bval) where rval, bval, and pval correspond
to the raw value, the plate level DMSO control median, and the plate level positive/negative control median,
respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

-------
4: pval.apid.medpcbyconc.min (Calculate the positive control value (pval) as the plate-wise minimum, by
assay plate ID (apid), of the medians of corrected value (cval) of gain-of-signal single- or multiple-
concentration positive control wells (wilt = p or c) by apid, well type, and concentration.), 5: resp.pc
(Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference between
the corrected (cval) and baseline (bval) values divided the difference between the positive control (pval)
and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 11: bval.apid.nwlls.med
(Calculate the baseline value (bval) as the plate-wise median, by assay plate ID (apid), of the corrected
values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

-------
Number of samples tested: 48

Number of chemicals tested: 46

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
0

Inactive hit count: 0
-------
3.4 Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability

of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	17544

Neutral control median absolute deviation, by plate: nmad	3109.012

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	17.55%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

-------
The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4 Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-

research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3025

VALA_TU BI PS_Antagon ist_Cel ICou nt

1.	General Information

1.1	Assay Title: Viability Assessment in the VALA Sciences human iPS-derived Endothelial Tubule Formation (TUBIPS)
Antagonism Assay

1.2	Assay Summary: VALA_TUBIPS_Agonist is a cell-based, multiplexed assay that uses human iPS-derived
endothelial cells (iPSC). Measurements were taken 72 hours after chemical dosing in a 96-well plate.
VALA_TUBIPS_Antagonist_CellCount is the one of two assay components measured from the
VALA_TUBIPS_Antagonist assay. It is designed to make measurements of viability, a form of viability reporter,
as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay
component VALA_TUBIPS_Antagonist_CellCount was analyzed at the endpoint,
VALA_TUBIPS_Antagonist_CellCount, in the positive analysis fitting direction relative to DMSO as the negative
control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used to
understand viability. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cell count.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to the number of DAPI labelled nuceli is indicative of the
viability of the system.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: The developing neurovascular unit (NVU) gives rise to the preliminary interaction of the
fetal central nervous system and circulatory system and ultimately the microvasculature forming a blood-brain-
barrier (BBB). As the NVU comprises multiple cell types interacting at various stages of development, a
methodology combining high-throughput results across pertinent cell-based assays is needed to investigate
potential chemical-induced disruption to the development of this complex cell system. With each in vitro model,
different characteristics of the developing NVU can be recapitulated to identify the biological complexity
necessary for NVU development, i.e. vascular remodeling, barrier integrity measurement, transporter kinetics
for influx/efflux of xenobiotics, cell-cell interactions for tissue development, etc. ArunA and VALA assay sources
comprise several neurogenic and angiogenic endpoints, contribute to the development of the neurovascular

-------
toxicity signature. The migration of endothelial cells is a critical part of angiogenesis. The scratch assay is widely
utilized in studies of angiogenesis, to quantify the migration of endothelial cells, and serves to complement
information obtained with the angiogenesis "tube formation" assays. These assays are relevant to embryology,
and also to toxicity/pathology. In vitro tubule formation was monitored in human endothelial cells co-cultured
with primary human dermal fibroblast (HDF) cells through VALA Sciences. Endothelial cells isolated from human
umbilical cord (HUVECs) assemble into branched networks, which represent the initiation of blood vessel
formation, when cultured onto a fibroblast or gel layer (Sarkanen et al, 2010).

2.3	Experimental System: adherent iPSC cell line used. The VALA tubule formation assay used cryopreserved human
endothelial (HUVEC) cells (CAI catalog 200-05f) co-cultured with primary human dermal fibroblast (HDF) cells
through VALA Sciences.

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: This HUVEC-HDF assay was designed to detect chemical suppression of tubulogenesis, i.e.
antagonism. To measure tubulogenesis antagonism, HUVEC cells were co-cultured with HDF cells at a 1:3 ratio
for 3-days in 10 ng/ml VEGF (i.e., favorable conditions with growth factors), then changed to assay medium
containing 10 ng/ml VEGF and ToxCast compounds for an additional 3-day period. Neutral control for tubule
formation was 0.1% DMSO vehicle while the positive control for tubulogenesis inhibition was suramin (100 uM).
Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain and CD31-
immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length area served
as the primary output feature. Other tubule related features (tubule count, tubule length, tubule area, and node
count) were processed as well.

Baseline median absolute deviation for the assay (bmad): 0.026
Response cutoff threshold used to determine hit calls: 0.077
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 uM
Key positive control:

NA

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 uM
Neutral vehicle control:

DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Endpoints were cell titer and tubule length. Cells were then fixed and visualized by nuclear stain
and CD31-immunofluorescence. Cell counts were output for HUVEC and HDFs with mean total tubule length
area served as the primary output feature. Other tubule related features (tubule count, tubule length, tubule
area, and node count) were processed as well. Plate-level raw data, provided by each assay source, were
received by EPA from each contractor and analyzed using the ToxCast Pipeline (tcpl). Normalized response
values for each assay endpoint were calculated as resp = log2(rval/bval) where rval, bval, and pval correspond
to the raw value, the plate level DMSO control median, and the plate level positive/negative control median,
respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

-------
Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Numberof samples tested: 48	Number of chemicals tested: 46

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
2

28

18

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

4
9

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

8

1

quadratic-polynomialfpoly2) model: 2

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

22

2

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	12593.5

Neutral control median absolute deviation, by plate: nmad	203.116

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	1.61%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3027

VALA_M IG H UVl_ScratchOn ly_Cel ICou nt

1.	General Information

1.1	Assay Title: Viability Assessment in the VALA Sciences human Umbilical Vein Endothelial Cell Migration
(MIGHUV) Assay, Scratch Wound Inhibited Recovery

1.2	Assay Summary: VALA_MIGHUVl_ScratchOnly is a cell-based, multiplexed assay that uses umbilical vein-
derived endothelial cells (HUVEC). Measurements were taken 24 hours after chemical dosing in a 96-well plate.
VALA_MIGHUVl_ScratchOnly_CellCount is the one of two assay components measured from the
VALA_MIGHUVl_ScratchOnly assay. It is designed to make measurements of viability, a form of viability
reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the
assay component VALA_MIGHUVl_ScratchOnly_CellCount was analyzed at the endpoint,
VALA_MIGHUVl_ScratchOnly_CellCount, in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of viability reporter, loss-of-signal activity can be used to
understand viability. To generalize the intended target to other relatable targets, this assay endpoint is
annotated to the cell cycle intended target family, where the subfamily is cell count.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to the number of DAPI labelled nuceli is indicative of the
viability of the system.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: The developing neurovascular unit (NVU) gives rise to the preliminary interaction of the
fetal central nervous system and circulatory system and ultimately the microvasculature forming a blood-brain-
barrier (BBB). As the NVU comprises multiple cell types interacting at various stages of development, a
methodology combining high-throughput results across pertinent cell-based assays is needed to investigate
potential chemical-induced disruption to the development of this complex cell system. With each in vitro model,
different characteristics of the developing NVU can be recapitulated to identify the biological complexity
necessary for NVU development, i.e. vascular remodeling, barrier integrity measurement, transporter kinetics
for influx/efflux of xenobiotics, cell-cell interactions for tissue development, etc. ArunA and VALA assay sources
comprise several neurogenic and angiogenic endpoints, contribute to the development of the neurovascular

-------
toxicity signature. The migration of endothelial cells is a critical part of angiogenesis. The scratch assay is widely
utilized in studies of angiogenesis, to quantify the migration of endothelial cells, and serves to complement
information obtained with the angiogenesis "tube formation" assays. These assays are relevant to embryology,
and also to toxicity/pathology.

2.3	Experimental System: adherent HUVEC cell line used. The VALA migration assay used cryopreserved human
umbilical vein endothelial cells (HUVEC) from Cell Applications, Inc. (CAI; San Diego; catalog 200-05f).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Cells were seeded on 384-well microplates (day-O) at a density of 9000 cells per well for
detection of wound area recovery or 4000 cells per well for detection of beta-catenin (CTNB) immunoreactivity
and incubated in Endothelial Cell Growth Medium (CAI, San Diego; catalog 211-500). After 24 hours of culture,
the HUVEC plates were scratched using a custom scratch head with the Bravo liquid handling system. This
created a 'wound' area without cells in each well. Test chemical was added within 30 min, each tested in 3-
replicates across a 6-point concentration response (0.1 % DMSO vehicle control). 'ScratchOnly' Plates were
incubated for a 24 h 'wound recovery' period and then fixed with 4% paraformaldehyde. Hoechst dye was used
to visualize cell nuclei in the wound area, and CTNB immunofluorescence for visualizing CTNB-positive nuclei
(beta-Catenin (L54E2) Mouse mAb (IF Preferred) 2677; Cell Signaling Technology, Danvers, MA). Each HUVEC
plate included neutral vehicle control (0.1 % DMSO) treated wells and Cytochalasin D (actin cytoskeletal
disrupter) or BlO-acetoxime (synthetic GSK3 inhibitor) as positive reference controls for endothelial migration
and nuclear translocation of CTNB, respectively.

Baseline median absolute deviation for the assay (bmad): 0.048
Response cutoff threshold used to determine hit calls: 0.143
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 nM
Key positive control:

BlO-acetoxime

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Plate-level raw data, provided by each assay source, were received by EPA from each contractor
and analyzed using the ToxCast Pipeline (tcpl). Normalized response values for each assay endpoint were
calculated as resp = log2(rval/bval) where rval, bval, and pval correspond to the raw value, the plate level DMSO
control median, and the plate level positive/negative control median, respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp

-------
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 48	Number of chemicals tested: 46

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
2

Inactive hit count: Oihitc 0.9
8

WINING MODEL SELECTION

NA hit count: hitc^O
38

Number of sample-assay endpoints with winning hill model:
gain-loss (gnls) model:

2
8

-------
power(pow) model:
linear-polynomial (polyl) model:

0

1

quadratic-polynomialfpoly2) model: 2

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

34

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

-------
Neutral control well median response value, by plate: nmed

5918

Neutral control median absolute deviation, by plate: nmad	204.599

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	3.58%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1- ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 1.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical

-------
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3028

VALA_MIGH U Vl_ScratchOn ly_Wou nd Area

1.	General Information

1.1	Assay Title: VALA Sciences human Umbilical Vein Endothelial Cell Migration (MIGHUV) Assay, Area of Scratch
Wound Inhibited Recovery

1.2	Assay Summary: VALA_MIGHUVl_ScratchOnly is a cell-based, multiplexed assay that uses umbilical vein-
derived endothelial cells (HUVEC). Measurements were taken 24 hours after chemical dosing in a 96-well plate.
VALA_MIGHUVl_ScratchOnly_WoundArea is the one of two assay components measured from the
VALA_MIGHUVl_ScratchOnly assay. It is designed to make measurements of endothelial migration, a form of
morphology reporter, as detected with fluorescence intensity signals by HCS Fluorescent Imaging technology.
Data from the assay component VALA_MIGHUVl_ScratchOnly_WoundArea was analyzed at the endpoint,
VALA_MIGHUVl_ScratchOnly_WoundArea, in the positive analysis fitting direction relative to DMSO as the
negative control and baseline of activity. Using a type of morphology reporter, gain-of-signal activity can be used
to understand developmental effects. To generalize the intended target to other relatable targets, this assay
endpoint is annotated to the neurodevelopment intended target family, where the subfamily is migration.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Capillary growth supports angiogenesis, which is critical for development, maturation, and
reproductive function. Dysregulation of angiogenesis can lead to birth defects, macular degeneration, impaired
wound healing, and tumor growth. This assay measures compound effects on capillary network formation in
basal medium, which lacks growth factors that support capillary growth. Tube formation agonists will increase
capillary growth in these conditions.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: The developing neurovascular unit (NVU) gives rise to the preliminary interaction of the
fetal central nervous system and circulatory system and ultimately the microvasculature forming a blood-brain-
barrier (BBB). As the NVU comprises multiple cell types interacting at various stages of development, a
methodology combining high-throughput results across pertinent cell-based assays is needed to investigate
potential chemical-induced disruption to the development of this complex cell system. With each in vitro model,
different characteristics of the developing NVU can be recapitulated to identify the biological complexity

-------
necessary for NVU development, i.e. vascular remodeling, barrier integrity measurement, transporter kinetics
for influx/efflux of xenobiotics, cell-cell interactions for tissue development, etc. ArunA and VALA assay sources
comprise several neurogenic and angiogenic endpoints, contribute to the development of the neurovascular
toxicity signature. The migration of endothelial cells is a critical part of angiogenesis. The scratch assay is widely
utilized in studies of angiogenesis, to quantify the migration of endothelial cells, and serves to complement
information obtained with the angiogenesis "tube formation" assays. These assays are relevant to embryology,
and also to toxicity/pathology.

2.3	Experimental System: adherent HUVEC cell line used. The VALA migration assay used cryopreserved human
umbilical vein endothelial cells (HUVEC) from Cell Applications, Inc. (CAI; San Diego; catalog 200-05f).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Cells were seeded on 384-well microplates (day-O) at a density of 9000 cells per well for
detection of wound area recovery or 4000 cells per well for detection of beta-catenin (CTNB) immunoreactivity
and incubated in Endothelial Cell Growth Medium (CAI, San Diego; catalog 211-500). After 24 hours of culture,
the HUVEC plates were scratched using a custom scratch head with the Bravo liquid handling system. This
created a 'wound' area without cells in each well. Test chemical was added within 30 min, each tested in 3-
replicates across a 6-point concentration response (0.1 % DMSO vehicle control). 'ScratchOnly' Plates were
incubated for a 24 h 'wound recovery' period and then fixed with 4% paraformaldehyde. Hoechst dye was used
to visualize cell nuclei in the wound area, and CTNB immunofluorescence for visualizing CTNB-positive nuclei
(beta-Catenin (L54E2) Mouse mAb (IF Preferred) 2677; Cell Signaling Technology, Danvers, MA). Each HUVEC
plate included neutral vehicle control (0.1 % DMSO) treated wells and Cytochalasin D (actin cytoskeletal
disrupter) or BlO-acetoxime (synthetic GSK3 inhibitor) as positive reference controls for endothelial migration
and nuclear translocation of CTNB, respectively.

Baseline median absolute deviation for the assay (bmad): 2.233
Response cutoff threshold used to determine hit calls: 6.699
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 nM
Key positive control:

cytochalasin D

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Plate-level raw data, provided by each assay source, were received by EPA from each contractor
and analyzed using the ToxCast Pipeline (tcpl). Normalized response values for each assay endpoint were
calculated as resp = log2(rval/bval) where rval, bval, and pval correspond to the raw value, the plate level DMSO
control median, and the plate level positive/negative control median, respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 11:
bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 15: pval.apid.medncbyconc.min
(Calculate the positive control value (pval) as the plate-wise minimum, by assay plate ID (apid), of the

-------
medians of the corrected values (cval) for gain-of-signal single- or multiple-concentration negative
control wells (wilt = m or o) by apid, well type, and concentration.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 48	Number of chemicals tested: 46

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
6

31

11

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

2
2

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2

11

quadratic-polynomialfpoly2) model: 4

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

2

24

1

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	11.053

Neutral control median absolute deviation, by plate: nmad	1.976

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	18.8%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3029

VALA_MIGHUV2_CellCount

1.	General Information

1.1	Assay Title: Viability Assessment in the VALA Sciences human Umbilical Vein Endothelial Cell Migration
(MIGHUV) Assay, Area of Scratch Wound after Recovery

1.2	Assay Summary: VALA_MIGHUV2_BCatenin is a cell-based, multiplexed assay that uses umbilical vein-derived
endothelial cells (HUVEC). Measurements were taken 24 hours after chemical dosing in a 96-well plate.
VALA_MIGHUV2_CellCount is the one of two assay components measured from the VALA_MIGHUV2 assay. It
is designed to make measurements of viability, a form of viability reporter, as detected with fluorescence
intensity signals by HCS Fluorescent Imaging technology. Data from the assay component
VALA_MIGHUV2_CellCount was analyzed at the endpoint, VALA_MIGHUV2_CellCount, in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of viability
reporter, loss-of-signal activity can be used to understand viability. To generalize the intended target to other
relatable targets, this assay endpoint is annotated to the cell cycle intended target family, where the subfamily
is cell count.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.
l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Changes in fluorescence intensity related to the number of DAPI labelled nuceli is indicative of the
viability of the system.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: NVA score

2.3	Experimental System: adherent HUVEC cell line used. The VALA migration assay used cryopreserved human
umbilical vein endothelial cells (HUVEC) from Cell Applications, Inc. (CAI; San Diego; catalog 200-05f).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5

Exposure Regime: Cells were seeded on 384-well microplates (day-0) at a density of 9000 cells per well for
detection of wound area recovery or 4000 cells per well for detection of beta-catenin (CTNB) immunoreactivity

-------
and incubated in Endothelial Cell Growth Medium (CAI, San Diego; catalog 211-500). After 24 hours of culture,
the HUVEC plates were scratched using a custom scratch head with the Bravo liquid handling system. This
created a 'wound' area without cells in each well. Test chemical was added within 30 min, each tested in 3-
replicates across a 6-point concentration response (0.1 % DMSO vehicle control). 'ScratchOnly' Plates were
incubated for a 24 h 'wound recovery' period and then fixed with 4% paraformaldehyde. Hoechst dye was used
to visualize cell nuclei in the wound area, and CTNB immunofluorescence for visualizing CTNB-positive nuclei
(beta-Catenin (L54E2) Mouse mAb (IF Preferred) 2677; Cell Signaling Technology, Danvers, MA). Each HUVEC
plate included neutral vehicle control (0.1 % DMSO) treated wells and Cytochalasin D (actin cytoskeletal
disrupter) or BlO-acetoxime (synthetic GSK3 inhibitor) as positive reference controls for endothelial migration
and nuclear translocation of CTNB, respectively.

Baseline median absolute deviation for the assay (bmad): 0.049
Response cutoff threshold used to determine hit calls: 0.146
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6	Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 nM
Key positive control:

BlO-acetoxime

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
NA

Additionally, this assay was annotated to the intended target family of cell cycle.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Plate-level raw data, provided by each assay source, were received by EPA from each contractor
and analyzed using the ToxCast Pipeline (tcpl). Normalized response values for each assay endpoint were
calculated as resp = log2(rval/bval) where rval, bval, and pval correspond to the raw value, the plate level DMSO
control median, and the plate level positive/negative control median, respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

7: resp.log2 (Transform the response values to log-scale (base 2).), 9: resp.fc (Calculate the normalized
response (resp) as the fold change, i.e. the ratio of the corrected (cval) and baseline (bval) values; resp =
cval/bal.), 11: bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay
plate ID (apid), of the corrected values (cval) for neutral control wells (wilt = n).)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 27: ow_bidirectional_loss (Multiply winning model hitcall (hitc) by -1 for models fit in the
positive analysis direction. Typically used for endpoints where only negative responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with

-------
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 19: viability.gnls (Flag series with
an active hit call (hitc >= 0.9) if denoted as cell viability assay with winning model is gain-loss (gnls); if hitc
>= 0.9, modl=="gnls" and cell_viability_assay == 1, then flag.), 20: no.med.gt.3bmad (Flag series where
no median response values are greater than baseline as defined by 3 times the baseline median absolute
deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg is the
number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as

active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 48	Number of chemicals tested: 46

ACTIVITY HIT CALLS

Active hit count: hitc>0.9
5

Inactive hit count: Oihitc 0.9
14

WINING MODEL SELECTION

NA hit count: hitc^O
29

Number of sample-assay endpoints with winning hill model:

1
6

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

0

13

quadratic-polynomialfpoly2) model: 3

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

23

2

-------
For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed

6484

Neutral control median absolute deviation, by plate: nmad

246.853

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100

3.72%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed
Positive control well median absolute deviation, by plate: pmad

NA

NA

Z Prime Factor for median positive and neutral control across all plates:

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

NA

-------
Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmedj /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrt(mmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5. Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

• Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

-------
•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www,epa,gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3030

VALA_M IG H U V2_Bcaten i n

1.	General Information

1.1	Assay Title: VALA Sciences human Umbilical Vein Endothelial Cell Migration (MIGHUV) Assay, Nuclear
Translocation of beta-Catenin after Recovery

1.2	Assay Summary: VALA_MIGHUV2_BCatenin is a cell-based, multiplexed assay that uses umbilical vein-derived
endothelial cells (HUVEC). Measurements were taken 24 hours after chemical dosing in a 96-well plate.
VALA_MIGHUV2_Bcatenin is the one of two assay components measured from the VALA_MIGHUV2 assay. It is
designed to make measurements of nuclear translocation of CTNB, a form of morphology reporter, as detected
with fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay component
VALA_MIGHUV2_BCatenin was analyzed at the endpoint, VALA_MIGHUV2_BCatenin, in the positive analysis
fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of morphology
reporter, gain-of-signal activity can be used to understand developmental effects. To generalize the intended
target to other relatable targets, this assay endpoint is annotated to the neurodevelopment intended target
family, where the subfamily is migration.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Capillary growth supports angiogenesis, which is critical for development, maturation, and
reproductive function. Dysregulation of angiogenesis can lead to birth defects, macular degeneration, impaired
wound healing, and tumor growth. This assay measures compound effects on capillary network formation in
complete medium, which contains growth factors that support capillary endothelial cell proliferation. Tube
formation antagonists will decrease capillary growth in these conditions.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: The developing neurovascular unit (NVU) gives rise to the preliminary interaction of the
fetal central nervous system and circulatory system and ultimately the microvasculature forming a blood-brain-
barrier (BBB). As the NVU comprises multiple cell types interacting at various stages of development, a
methodology combining high-throughput results across pertinent cell-based assays is needed to investigate
potential chemical-induced disruption to the development of this complex cell system. With each in vitro model,
different characteristics of the developing NVU can be recapitulated to identify the biological complexity

-------
necessary for NVU development, i.e. vascular remodeling, barrier integrity measurement, transporter kinetics
for influx/efflux of xenobiotics, cell-cell interactions for tissue development, etc. ArunA and VALA assay sources
comprise several neurogenic and angiogenic endpoints, contribute to the development of the neurovascular
toxicity signature. The migration of endothelial cells is a critical part of angiogenesis. The scratch assay is widely
utilized in studies of angiogenesis, to quantify the migration of endothelial cells, and serves to complement
information obtained with the angiogenesis "tube formation" assays. These assays are relevant to embryology,
and also to toxicity/pathology.

2.3	Experimental System: adherent HUVEC cell line used. The VALA migration assay used cryopreserved human
umbilical vein endothelial cells (HUVEC) from Cell Applications, Inc. (CAI; San Diego; catalog 200-05f).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Cells were seeded on 384-well microplates (day-O) at a density of 9000 cells per well for
detection of wound area recovery or 4000 cells per well for detection of beta-catenin (CTNB) immunoreactivity
and incubated in Endothelial Cell Growth Medium (CAI, San Diego; catalog 211-500). After 24 hours of culture,
the HUVEC plates were scratched using a custom scratch head with the Bravo liquid handling system. This
created a 'wound' area without cells in each well. Test chemical was added within 30 min, each tested in 3-
replicates across a 6-point concentration response (0.1 % DMSO vehicle control). 'ScratchOnly' Plates were
incubated for a 24 h 'wound recovery' period and then fixed with 4% paraformaldehyde. Hoechst dye was used
to visualize cell nuclei in the wound area, and CTNB immunofluorescence for visualizing CTNB-positive nuclei
(beta-Catenin (L54E2) Mouse mAb (IF Preferred) 2677; Cell Signaling Technology, Danvers, MA). Each HUVEC
plate included neutral vehicle control (0.1 % DMSO) treated wells and Cytochalasin D (actin cytoskeletal
disrupter) or BlO-acetoxime (synthetic GSK3 inhibitor) as positive reference controls for endothelial migration
and nuclear translocation of CTNB, respectively.

Baseline median absolute deviation for the assay (bmad): 3.722
Response cutoff threshold used to determine hit calls: 11.166
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 nM
Key positive control:

cytochalasin D

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Plate-level raw data, provided by each assay source, were received by EPA from each contractor
and analyzed using the ToxCast Pipeline (tcpl). Normalized response values for each assay endpoint were
calculated as resp = log2(rval/bval) where rval, bval, and pval correspond to the raw value, the plate level DMSO
control median, and the plate level positive/negative control median, respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 11:
bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 15: pval.apid.medncbyconc.min
(Calculate the positive control value (pval) as the plate-wise minimum, by assay plate ID (apid), of the

-------
medians of the corrected values (cval) for gain-of-signal single- or multiple-concentration negative
control wells (wilt = m or o) by apid, well type, and concentration.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 48	Number of chemicals tested: 46

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
5

10

33

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

2

16

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

3

8

quadratic-polynomialfpoly2) model: 4

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

2

1

12

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	0.068

Neutral control median absolute deviation, by plate: nmad	0.007

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.26%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 2.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------
Assay Endpoint ID: 3031

VALA_MIGHUV2_WoundArea

1.	General Information

1.1	Assay Title: VALA Sciences human Umbilical Vein Endothelial Cell Migration (MIGHUV) Assay, Area of Scratch
Wound after Recovery

1.2	Assay Summary: VALA_MIGHUV2_BCatenin is a cell-based, multiplexed assay that uses umbilical vein-derived
endothelial cells (HUVEC). Measurements were taken 24 hours after chemical dosing in a 96-well plate.
VALA_MIGHUV2_WoundArea is the one of two assay components measured from the VALA_MIGHUV2 assay.
It is designed to make measurements of endothelial migration, a form of morphology reporter, as detected with
fluorescence intensity signals by HCS Fluorescent Imaging technology. Data from the assay component
VALA_MIGHUV2_WoundArea was analyzed at the endpoint, VALA_MIGHUV2_WoundArea, in the positive
analysis fitting direction relative to DMSO as the negative control and baseline of activity. Using a type of
morphology reporter, gain-of-signal activity can be used to understand developmental effects. To generalize
the intended target to other relatable targets, this assay endpoint is annotated to the neurodevelopment
intended target family, where the subfamily is migration.

1.3	Date of Document Creation: September 05 2024

1.4	Authors and Contact Information:

US Environmental Protection Agency (EPA) Center for Computational Toxicology and Exposure (CCTE)
109 T.W. Alexander Drive (Mail Code D143-02)

Research Triangle Park, NC 27711

1.5	Assay Source: Vala Sciences, a Contract Research Organization (CRO), provides cell-based assay services.

1.6	Date of Assay Development: For date of assay development, see Section 6: Bibliography.

1.7	References: For complete list of references, see Section 6: Bibliography.

1.8	Proprietary Elements: Endothelial cell culture, treatments, and assay endpoint measurements were conducted
independently by VALA sciences (VALA).

1.9	Assay Throughput: 384-well plate. VALA systems offer medium-to-high throughput chemical screening
amenability in a 96 well format for both neurogenic- or angiogenic-specific endpoints.

l.lOStatus: The assay is fully developed, and data are publicly available in ToxCast's invitroDB.

l.llAbbreviations:

AIC: Akaike Information Criterion	ToxCast: US EPA's Toxicity Forecaster Program

AOP: Adverse Outcome Pathway	tcpl: ToxCast Data Analysis Pipeline R Package

CV: Coefficient of Variation	SSMD: Strictly Standardized Mean Difference

DMSO: Dimethyl Sulfoxide

2.	Test Method Description

2.1	Purpose: Capillary growth supports angiogenesis, which is critical for development, maturation, and
reproductive function. Dysregulation of angiogenesis can lead to birth defects, macular degeneration, impaired
wound healing, and tumor growth. This assay measures compound effects on capillary network formation in
complete medium, which contains growth factors that support capillary endothelial cell proliferation. Tube
formation antagonists will decrease capillary growth in these conditions.

Chemical-induced perturbations to cellular key events across angiogenic and neurogenic outcomes in vitro can
inform on cell-based prioritization of neurovascular unit (NVU) hazard potential.

2.2	Scientific Principles: The developing neurovascular unit (NVU) gives rise to the preliminary interaction of the
fetal central nervous system and circulatory system and ultimately the microvasculature forming a blood-brain-
barrier (BBB). As the NVU comprises multiple cell types interacting at various stages of development, a
methodology combining high-throughput results across pertinent cell-based assays is needed to investigate
potential chemical-induced disruption to the development of this complex cell system. With each in vitro model,
different characteristics of the developing NVU can be recapitulated to identify the biological complexity

-------
necessary for NVU development, i.e. vascular remodeling, barrier integrity measurement, transporter kinetics
for influx/efflux of xenobiotics, cell-cell interactions for tissue development, etc. ArunA and VALA assay sources
comprise several neurogenic and angiogenic endpoints, contribute to the development of the neurovascular
toxicity signature. The migration of endothelial cells is a critical part of angiogenesis. The scratch assay is widely
utilized in studies of angiogenesis, to quantify the migration of endothelial cells, and serves to complement
information obtained with the angiogenesis "tube formation" assays. These assays are relevant to embryology,
and also to toxicity/pathology.

2.3	Experimental System: adherent HUVEC cell line used. The VALA migration assay used cryopreserved human
umbilical vein endothelial cells (HUVEC) from Cell Applications, Inc. (CAI; San Diego; catalog 200-05f).

2.4	Metabolic Competence: Xenobiotic biotransformation potential has not been characterized.

2.5	Exposure Regime: Cells were seeded on 384-well microplates (day-O) at a density of 9000 cells per well for
detection of wound area recovery or 4000 cells per well for detection of beta-catenin (CTNB) immunoreactivity
and incubated in Endothelial Cell Growth Medium (CAI, San Diego; catalog 211-500). After 24 hours of culture,
the HUVEC plates were scratched using a custom scratch head with the Bravo liquid handling system. This
created a 'wound' area without cells in each well. Test chemical was added within 30 min, each tested in 3-
replicates across a 6-point concentration response (0.1 % DMSO vehicle control). 'ScratchOnly' Plates were
incubated for a 24 h 'wound recovery' period and then fixed with 4% paraformaldehyde. Hoechst dye was used
to visualize cell nuclei in the wound area, and CTNB immunofluorescence for visualizing CTNB-positive nuclei
(beta-Catenin (L54E2) Mouse mAb (IF Preferred) 2677; Cell Signaling Technology, Danvers, MA). Each HUVEC
plate included neutral vehicle control (0.1 % DMSO) treated wells and Cytochalasin D (actin cytoskeletal
disrupter) or BlO-acetoxime (synthetic GSK3 inhibitor) as positive reference controls for endothelial migration
and nuclear translocation of CTNB, respectively.

Baseline median absolute deviation for the assay (bmad): 4.114
Response cutoff threshold used to determine hit calls: 12.343
Detection technology used: HCS Fluorescent Imaging (Fluorescence)

2.6 Response: Cell motility is fundamental to a variety of biological processes, ranging from development of neural-
crest-derived structures in embryology, wound healing, angiogenesis, and cell migration in the development,
growth, and metastasis of tumors. To quantify the effects of test compounds on cell motility in a format enabling
high throughput format, Vala Sciences Inc has developed a protocol for "scratch assays" compatible with cells
plated in 384 well dishes. In these experiments, cells are seeded, and allowed to form confluent monolayers.
The monolayer is then "scratched" in a reproducible manner, so that cells are removed from the center of the
well, creating a semi-rectangular empty space. The cells are then further incubated, for 7 to 48 hrs, in the
presence of test compounds. During this time period, cells from the margins of the wells migrate into the center.
The number of cells can then be quantified via methods of high content analysis (identification of nuclei, cell
shape, and expression of relevant biomarkers). If migration is inhibited, fewer cells enter the wound area.
Additionally, using these same endothelial cells, angiogenesis ('tubule' formation) assays recapitulate
microvascular vessels and large veins ("macro vessels"), which form capillary-like structures in vitro. Cells from
macro- and micro- vascular endothelial cells often differ in their tubule formation rates and characteristics,
which emphasizes the need to test compounds against cells derived from both cell types. Since cell motility is
an important component of the ability of cells to form new blood vessels, the cell migration assays will provide

ASSAY DESIGN SUMMARY

Nominal number of tested concentrations:
6

Standard minimum concentration tested:

0.0129 nM
Key positive control:

cytochalasin D

Target (nominal) number of replicates:

3

Standard maximum concentration tested:

100 nM
Neutral vehicle control:

DMSO

-------
information which complements the information obtained from the tubule assays in the context of
angiogenesis.

2.7	Quality and Acceptance Criteria: Each assay may utilize different acceptance criteria and quality assurance
methods as it pertains to the individual assay platform and implementation. Pre-processing transformations
may indicate issues in plates or wells by setting well quality (wllq) values to 0. Analytical QC calls per sample and
substance can also be considered to understand the applicability domain of the chemicals for screening.

2.8	Technical Limitations: ToxCast data can provide initial (screening) information about the capacity for a chemical
to illicit a biological response; caution is advised with extrapolation of these results to organism-level responses.
The potential for a chemical to elicit adverse health outcomes in living systems is a function of multiple factors,
and this assay is not intended to provide predictive details regarding long term or indirect adverse effects in
complex biological systems but can aid in the prioritization of compound selection for more resource intensive
toxicity studies. See Section 4.4. for more information on the chemical applicability of the assay.

2.9	Related Assays: For related assays, consult the following assay lists or intended target families. This assay is
present in the following assay lists:

NA

Additionally, this assay was annotated to the intended target family of neurodevelopment.

3. Data Interpretation

The ToxCast Data Analysis Pipeline (tcpl) R package includes processing functionality for two screening
paradigms: (1) single-concentration ("SC") and (2) multiple-concentration ("MC") screening. SC screening
consists of testing chemicals at one concentration, often for the purpose of identifying potentially active
chemicals to test in the multiple-concentration format. MC screening consists of testing chemicals across a
concentration range, such that the modeled activity can give an estimate of potency, efficacy, etc. MC data is
the focus of this documentation, with SC data processing metrics to be incorporated in the future.

3.1	Responses captured in prediction model: See Section 2.6 for additional information on responses measured.

3.2	Data Analysis: Plate-level raw data, provided by each assay source, were received by EPA from each contractor
and analyzed using the ToxCast Pipeline (tcpl). Normalized response values for each assay endpoint were
calculated as resp = log2(rval/bval) where rval, bval, and pval correspond to the raw value, the plate level DMSO
control median, and the plate level positive/negative control median, respectively.

Prior to the data processing, all the data must go through pre-processing to transform the heterogeneous data
into a uniform format before it can be loaded into a database. Level 0 pre-processing is done in R by
vendor/dataset-specific scripts with all manual transformations to the data documented with justification.
Common examples of manual transformations include fixing a sample ID typo or changing well quality
value(wllq) to 0 after identifying problems such a plate row/column missing an assay reagent.

Once data is loaded into the database, tcpl utilizes generalized processing functions provided to process,
normalize, model, qualify, and visualize the data. To promote reproducibility, all method assignments must
occur through the database and should come from the available list of methods for each processing level.
Assigned multiple concentration processing methods include:

Level 2: Component-specific corrections include:

1: none (Use corrected response value (cval) as is; cval = cval. No additional mc2 methods needed for
component-specific corrections.)

Level 3: Endpoint-specific normalization include:

5: resp.pc (Calculate the normalized response (resp) as a percent of control, i.e. the ratio of the difference
between the corrected (cval) and baseline (bval) values divided the difference between the positive
control (pval) and baseline (bval) values multiplied by 100; resp = (cval-bval)/(pval-bval)*100.), 11:
bval.apid.nwlls.med (Calculate the baseline value (bval) as the plate-wise median, by assay plate ID
(apid), of the corrected values (cval) for neutral control wells (wilt = n).), 15: pval.apid.medncbyconc.min
(Calculate the positive control value (pval) as the plate-wise minimum, by assay plate ID (apid), of the

-------
medians of the corrected values (cval) for gain-of-signal single- or multiple-concentration negative
control wells (wilt = m or o) by apid, well type, and concentration.)

Level 4: Baseline and required tcplFit2 parameters defined by:

2: bmad.aeid.lowconc.nwells (Calculate the baseline median absolute value (bmad) as the median
absolute deviation of normalized response values (resp) for neutral control wells (wilt = n). Calculate one
standard deviation of the normalized response for neutral control wells (wilt = n); onesd = sqrt(sum((resp
- mean resp)A2)/sample size - 1). Onesd is used to establish BMR and therefore required for tcplfit2
processing.)

Level 5: Possible cutoff thresholds, where higher value for endpoint is selected, include:

1: bmad3 (Add a cutoff value of 3 multiplied by the baseline median absolute deviation (bmad) as defined
at Level 4.), 28: ow_bidirectional_gain (Multiply winning model hitcall (hitc) by -1 for models fit in the
negative analysis direction. Typically used for endpoints where only positive responses are biologically
relevant.)

Level 6: Cautionary flagging include:

5: modi.directionality.fail (Flag series if model directionality is questionable, i.e. if the winning model
direction was opposite, more responses (resp) would have exceeded the cutoff (coff). If loss was winning
directionality (top < 0), flag if count(resp < -l*coff) < 2*count(resp > coff). If gain was winning
directionality (top > 0), flag if count(resp > coff) < 2*count(resp < -l*coff).), 6: singlept.hit.high (Flag
single-point hit that's only at the highest cone tested, where series is an active hit call (hitc >= 0.9) with
the median response observed above baseline occurring only at the highest tested concentration tested.
), 7: singlept.hit.mid (Flag single-point hit that's not at the highest cone tested, where series is an active
hit call (hitc >= 0.9) with the median response observed above baseline occurring only at one
concentration and not the highest concentration tested.), 8: multipoint.neg (Flag multi-point miss, where
series is an inactive hit call (hitc < 0.9) with multiple median responses observed above baseline.), 9:
bmd.high (Flag series if modeled benchmark dose (BMD) is greater than AC50 (concentration at 50
percent maximal response). This is indicates high variability in baseline response in excess of more than
half of the maximal response.), 10: noise (Flag series as noisy if the quality of fit as calculated by the root
mean square error (rmse) for the series is greater than the cutoff (coff); rmse > coff.), 11: border (Flag
series if borderline activity is suspected based on modeled top parameter (top) relative to cutoff (coff);
| top | <= 1.2(coff) or | top | >= 0.8(coff).), 13: low.nrep (Flag series if the average number of replicates
per concentration is less than 2; nrep < 2.), 14: low.nconc (Flag series if 4 concentrations or less were
tested; nconc <= 4.), 15: gnls.lowconc (Flag series where winning model is gain-loss (gnls) and the gain
AC50 is less than the minimum tested concentration, and the loss AC50 is less than the mean tested
concentration.), 17: efficacy.50 (Flag low efficacy hits if series has an active hit call (hitc >= 0.9) and
efficacy values (e.g. top and maximum median response) less than 50 percent; intended for biochemical
assays. If hitc >= 0.9 and coff >= 5, then flag when top < 50 or max_med < 50. If hitc >= 0.9 and coff < 5,
then flag when top < log2(1.5) or max_med < log2(1.5).), 18: ac50.lowconc (Flag series with an active hit
call (hitc >= 0.9) if AC50 (concentration at 50 percent maximal response) is less than the lowest
concentration tested;if hitc >= 0.9 and ac50 < 10Alogc_min, then flag.), 20: no.med.gt.3bmad (Flag series
where no median response values are greater than baseline as defined by 3 times the baseline median
absolute deviation (bmad); nmed_gtbl_pos and nmed_gtbl_neg both = 0, where nmed_gtbl_pos/_neg
is the number of medians greater than 3*bmad/less than -3*bmad.)

The following is an aggregate endpoint summary of the number of samples and chemicals tested, as well as
active or inactive hit calls (hitc) and predicted winning models for all samples tested in this endpoint.

SAMPLE AND CHEMICAL COVERAGE

Number of samples tested: 48	Number of chemicals tested: 46

ACTIVITY HIT CALLS

Inactive hit count: 0
-------
Active hit count: hitc>0.9
6

23

19

WINING MODEL SELECTION

Number of sample-assay endpoints with winning hill model:

0
4

gain-loss (gnls) model:
power(pow) model:
linear-polynomial (polyl) model:

2

16

quadratic-polynomialfpoly2) model: 6

exponential-2 (exp2) model:
exponential-3 (exp3) model:
exponential-4 (exp4) model:
exponential-5 (exp5) model:

0

0

3

17

For each concentration series, several point-of-departure (POD) estimates are calculated for the winning model.
The major estimates include: (1) the activity concentration at the specified benchmark response (BMR) (bmd),
(2) the activity concentration at 50% of the maximal response (ac50), (3) the activity concentration at the
efficacy cutoff (acc), (4) the activity concentration at 10% of the maximal response, and (5) the concentration at
5% of the maximal response.

3.3	Prediction Model: All statistical analyses were conducted using R programming language, employing the tcpl
package to generate model parameters and confidence intervals. Each chemical concentration response series
is fit to ten predictive models, encoded by the dependency package tcplfit2. The models include the constant,
Hill, gain-loss, two polynomials (i.e. linear and quadratic), power, and four exponential variants. The
polynomials, power, and exponential models are all based on BMDExpress2. The winning model (modi) is
selected based on the lowest AIC value and is used to determine the activity (or hit call) for the concentration
series. If two models have equal AIC values, then the simpler model (i.e. model with fewer parameters) wins. In
invitrodb, levels 4 and 5 capture model fit information. mc4 captures summary values calculated for each
concentration series, whereas mc4_param stores the estimated model parameters for all models fit to data in
long format. mc5 captures the winning model selected and the activity hit call, whereas mc5_param stores the
estimated model parameters for the selected winning model in long format. Activity for each concentration-
response series is determined by calculating a continuous hit-call for the winning model, which is the product
of three proportional weights. The first weight reflects whether there is at least one median response outside
the efficacy cutoff band. Second, the top (or maximal change in the predicted response) is larger than the cutoff.
The last weight reflects whether the AIC of the winning model is less than the constant model, i.e. the winning
model is better fit than a flat line.

The continuous hit call value (hitc), fit category (fitc), and cautionary flags (mc6) can be used to understand the
goodness-of-fit, enabling the user to decide the stringency with which to filter and interpret results. Hitc may
be further binarized into active or inactive, depending on the level of stringency required by the user; herein,
hitc greater than or equal to 0.90 is active, hitc between 0 and 0.90 is inactive, and hitc less than 0 is not
applicable, but different thresholds may be used. Fitc was summarize curve behavior relative activity, efficacy,
and potency comparisons between the AC50 and the concentration range screened. Cautionary flags on fitting
were developed in previous versions of tcpl and have been stored at level 6. These flags are programmatically
generated and indicate characteristics of a curve that need extra attention or potential anomalies in the curve
or data. Users may review these filtered groupings to understand high-confidence curves.

3.4	Software: The ToxCast Data Analysis Pipeline (tcpl) is an R package that manages, curve-fits, plots, and stores
ToxCast data to populate its linked MySQL database, invitrodb. Data for invitrodb v4.2 was processed using the
tcpl v3.2. See Section 7: Supporting Information on the ToxCast program and tcpl R package.

-------
4. Test Method Performance

4.1 Robustness: The following assay performance metrics surmise the robustness of the method i.e. the reliability
of the experimental results and the prediction capability of the model used.

NEUTRAL CONTROL (well type = "n")

Neutral control well median response value, by plate: nmed	10.043

Neutral control median absolute deviation, by plate: nmad	1.101

Coefficient of variation (CV%) in neutral control wells: (nmad/nmed)*100	11.05%

POSITIVE CONTROL (well type = "p")

Positive control well median response value, by plate: pmed	NA

Positive control well median absolute deviation, by plate: pmad	NA

Z Prime Factor for median positive and neutral control across all plates:	NA

(1 - ((3 * (pmad + nmad)) / absfpmed - nmed))

Strictly standardized mean difference (SSMD) for positive compared to neutral control wells:	NA

((pmed - nmed) /sqrtfpmad2 + nmad2)

Positive control signal-to-noise: ((pmed-nmed)/nmad)	NA

Positive control signal-to-background: (pmed/nmed)	NA

NEGATIVE CONTROL (well type = "m")

Negative control well median, by plate: mmed	NA

Negative control well median absolute deviation value, by plate: mmad	NA

Z Prime Factor for median negative and neutral control across all plates:	NA

(1 - ((3 * (mmad + nmad)) / absfmmed - nmed))

Strictly standardized mean difference (SSMD) for negative compared to neutral control wells:	NA

((mmed - nmed) /sqrtfmmad2 + nmad2)

Signal-to-noise (median across all plates, using negative control wells):	NA

((mmed-nmed)/nmad)

Signal-to-background (median across all plates, using negative control wells):	NA

(mmed/nmed)

4.2	Reference Chemical Information: Reference chemical curation is ongoing, and this section will be updated as
more information becomes available.

4.3	Performance Measures and Predictive Capacity: The performance and predictivity for a given assay may be
evaluated with a variety of performance statistics but is dependent upon available data. Predictive capacity (i.e.
false negative, false positive rates) will be assessed when reference chemical information is available. Ideally,
assays will have sufficient data on reference chemicals (i.e. positive and negative controls) to enable estimation
of accuracy statistics, such as sensitivity and specificity.

ToxCast targets may align to a range of event types in the Adverse Outcome Pathway (AOP) framework, however
each assay technology may have specific limitations, which may require user discretion for more complex
interpretations of the data.

The median root mean squared error (RMSE) across all winning models for active hits was calculated as: 3.

4.4	Chemical Library Scope and Limitations: The ToxCast Chemical Library was designed to capture a large spectrum
of structurally and physicochemically diverse compounds. This chemical inventory incorporates toxicity data-
rich chemicals, chemicals spanning major use-categories, and chemicals with exposure potential, including but

-------
not limited to pesticides, antimicrobials, fragrances, green chemistry alternatives, food additives, toxicity
reference compounds and failed pharmaceuticals. In addition to environmental or exposure concerns, chemical
selection criteria also consider practical constraints, such as commercial availability, dimethyl sulfoxide (DMSO)
solubility and stability, and suitability for testing in automated or semi-automated systems (e.g., low volatility
and moderate LogP values). Under these constraints, there were three major, interrelated drivers for chemical
selection: availability of animal toxicity data or mechanistic knowledge, exposure potential, and EPA regulatory
interest. The first driver would provide the necessary in vivo and mechanistic data to anchor and validate
subsequent prediction modeling efforts, whereas the latter two were intended to provide coverage of the
chemical landscape to which humans and ecosystems are potentially exposed and for which toxicity data are
mostly lacking. Analytical QC calls per sample and substance should be considered to understand the
applicability domain of the chemicals for screening.

5.	Potential Regulatory Applications

5.1 Context of Use: Examples of end use scenarios could include, but are not limited to:

•	Support Category Formation and Read-Across: The outcomes from the assay could be used to
substantiate a hypothesis for grouping substances together for the purposes of read-across,

•	Priority Setting: The assay might help prioritize substances within an inventory for more detailed
evaluation,

•	Screening Level Assessment of a Biomarker or Mechanistic Activity or Response: The screening level
assessment may be sufficient to identify a hazard and provide a gauge of potency; or

•	Integrated approaches to testing and assessment (IATA): The assay may form one component of an
IATA.

6.	Bibliography: Zurlinden TJ, Saili KS, Baker NC, Toimela T, Heinonen T, Knudsen TB. A cross-platform approach to
characterize and screen potential neurovascular unit toxicants. Reprod Toxicol. 2020 Jun 24;96:300-315. doi:
10.1016/j.reprotox.2020.06.010. Epub ahead of print. PMID: 32590145.

7.	Supporting Information:

More information on the ToxCast program can be found at: https://www.epa.gov/chemical-research/toxicity-
forecasting. The most recent version of downloadable data can be found at: https://www.epa.gov/chemical-
research/exploring-toxcast-data-downloadable-data. The ToxCast Data Analysis Pipeline (tcpl) R package is
available on CRAN or GitHub. Check out tcpl's vignette for comprehensive documentation describing ToxCast
data processing, retrieval, and interpretation.

-------